CDP Private Cloud Data Services overview

CDP Private Cloud Data Services works on top of CDP Private Cloud Base and is the on-premises offering of CDP that brings many of the benefits of the public cloud deployments to on-premises CDP deployments. CDP Private Cloud Data Services lets you deploy and use the Cloudera Data Warehouse (CDW), Cloudera Machine Learning (CML), and Cloudera Data Engineering (CDE) Data Services.

CDP Private Cloud disaggregates compute and storage, which allows independent scaling of compute and storage clusters. The Data Services provide containerized analytic applications that scale dynamically and can be upgraded independently. Through the use of containers deployed on Kubernetes, CDP Private Cloud Data Services brings both agility and predictable performance to analytic applications. CDP Private Cloud Data Services inherits unified security, governance, and metadata management through Cloudera Shared Data Experience (SDX), which is available on the CDP Private Cloud Base cluster.

CDP Private Cloud Data Services users can rapidly provision and deploy services such as Cloudera Data Warehousing, Cloudera Machine Learning, and Cloudera Data Engineering through the Management Console, and easily scale them up or down as required.

A CDP Private Cloud Data Services deployment requires a Private Cloud Base cluster, along with container-based clusters that run the Data Services. You can either use a dedicated RedHat OpenShift container cluster or deploy an Embedded Container Service (ECS) container cluster.

You can install CDP Private Cloud Base on virtual machines or bare-metal hardware. CDP Private Cloud Base provides the following components and services that are used by CDP Private Cloud Data Services:

  • SDX Data Lake cluster for security, metadata, and governance
  • HDFS or Ozone for storage
  • Cloudera Runtime components such as Ranger, Atlas, and Hive Metastore (HMS)
  • Networking infrastructure that supports network traffic between storage and compute environments