About Cloudera Data Services on premises
Cloudera Data Services on premises works on top of Cloudera Base on premises and is the on-premises offering of Cloudera that brings many of the benefits of the public cloud deployments to Cloudera on premises. Cloudera Private Cloud Data Services lets you deploy and use the Cloudera Data Warehouse, Cloudera AI, and Cloudera Data Engineering Data Services.
Building on Apache Spark, Cloudera Data Engineering is an all-inclusive data engineering toolset that enables orchestration automation with Apache Airflow, advanced pipeline monitoring, visual troubleshooting, and comprehensive management tools to streamline ETL processes across enterprise analytics teams. Cloudera Data Engineering powers consistent, repeatable, and automated data engineering workflows.
Cloudera Data Warehouse enables you to create highly-performant, independent, self-service data warehouses for teams of business analysts. Cloudera’s support for an open data lakehouse, centered on Cloudera Data Warehouse, brings high-performance, self-service reporting and analytics to your business – simplifying data management for data practitioners and administrators.
Cloudera AI, Cloudera’s platform for machine learning and AI unifies self-service data science and data engineering in a single, portable service for multi-function analytics on data. It optimizes ML workflows across your business with native and robust tools for deploying, serving, and monitoring models.
Replication Manager enables you to copy and migrate HDFS data, Hive external tables, and Ozone data between Cloudera Base on premises 7.1.8 or higher clusters using Cloudera Manager version 7.7.3 or higher.
Cloudera Data Catalog is a service within Cloudera Data Platform that enables you to understand, manage, secure, and govern data assets across the enterprise. Data Catalog helps you discern data lying in your Data Lake (Base cluster).
Providing a consistent user experience to data practitioners and IT admins across Cloudera on cloud and Cloudera on premises, these data services open up avenues for a hybrid data lifecycle.