Data Scientist

As a data scientist in your organization, you need to find the data in your organization and set up easy access to analyze large volumes of data.

With Hydrator, an extension built on CDAP you can:

  • Easily build data pipelines using a rich user interface that provides self-service, drag and drop access to create, manage, and use data pipelines built on Hadoop.
  • Simplify the the data integration and ETL process. Automate the process of moving, parsing, cleansing and cataloging datasets.

With Tracker, also an extension built on CDAP you can:

  • Cask Tracker, which provides a structured approach to find the data you need to analyze by searching your company‚Äôs metadata, both business and technical.