Interactive Application for Building, Running and Managing Data Pipelines for Enterprise Data Lakes.

Integrate, Prepare and Blend

Ingest data in minutes from anywhere and any type without writing code. Prepare, cleanse, and enrich using code-free data wrangler and built-in transformation plugins. Blend data from traditional RDBMS to Data Warehouse to Hadoop.

Aggregate and Analyze

Perform step-by-step aggregation and analytics in Batch or Realtime. Leverage plugins that use state-of-art Spark ML for building models and scoring models in a unified environment.

Automate and Operationalize

Use REST APIs or CLI tools for automating deployment and management of pipelines in different environments. Use built-in enterprise scheduler to schedule pipelines to run at periodic intervals, aggregate pipeline logs and metrics, and compare different runs of pipelines for diagnosing problems.

Deploy, Audit and Govern

Deploy pipelines to be executed as MapReduce or Spark or Spark Streaming in case of real-time. Catalog all of the datasets and metadata to support data governance. Secure your data with fine-grained access control, monitor and track user activities through audit logs.

Cask Data Application Platform CDAP 4 Now Generally Available!

Cask Hydrator is a code-free visual application for building complex data pipelines and managing them on your Data Lake. With Cask Hydrator, you can ingest data from varied sources, ingest CSV, XML, Excel, etc., cleanse, normalize, and transform data, build machine learning models on-fly, perform aggregations, run custom scripts, and more.

With its Live Preview feature, Cask Hydrator also provides the ability to use live data during the development process in the graphical Studio environment. And with the addition of Cask Wrangler, a graphical data wrangler, Cask Hydrator offers an improved schema definition experience to its pipelines.

Open Source and Extensible

Cask Hydrator is 100% open source and highly extensible.

Batch and Real-time on Spark

Cask Hydrator offers support for MapReduce, Spark, Spark Streaming, and Tigon Flows.

Connects to Anything

Cask Hydrator integrates with existing enterprise solutions for security, MDM, and BI, protecting past investments by the business in these solutions.

Features & Benefits

Accelerate Time to Value

Rapidly deliver reliable and operational Data Lakes and production Data Applications faster and better. Extensible libraries and components promote reuse and further accelerate the pace of innovation.

Provide Self-Service

Broaden the user base of your big data platform with a radically simplified developer experience and code-free Extensions for non-developers. Reusable libraries can be assembled and run as data pipelines through drag-and-drop interfaces.

Enable Governance

Automatic tracking of all audits and data lineage with discovery and search. Integrate into existing security and governance systems with authentication, authorization, and audit built-in automatically.

Other Cask Products

Cask Wrangler, powered by CDAP, provides an easy and interactive way to visualize, transform, and cleanse data. It helps data scientists and data engineers derive new schemas and operationalize the data preparation with a few clicks.

Learn More

Cask Tracker, powered by CDAP, helps you discover, profile, and govern data in your data lake. It has powerful features that allow IT as well as business users to manage all facets of data governance from data discovery, to metadata tracking, to data lineage, and usage analytics.

Learn More

CDAP accelerates time to value from Hadoop through standardized APIs, configurable templates, and visual interfaces, and it increases efficiencies through reusable and portable components. CDAP removes barriers to innovation as an extensible and future-proof platform that provides consistency across environments and easily integrates with existing MDM, BI, and security solutions.

Learn More

Cask Market is Cask’s “Big Data App Store” with push button deployment for applications, use cases, data pipelines, sample datapacks, and plugins from within CDAP. It provides step-by-step wizards to help configure and deploy new entities within the platform.

Learn More

Want to see Cask Hydrator in action? Click the button to request a demo >>