Updated CDH Components

CDH Component Changes in CDP Private Cloud Base 7.

Component Changes in CDP Private Cloud Base 7.0

Pig, Flume, Sentry, and Navigator have been removed.

  • Pig can be replaced with Hive or Spark.

  • Flume has been replaced with Cloudera Flow Management (CFM). CFM is a no-code data ingestion and management solution powered by Apache NiFi. Contact your Cloudera account team for more information about moving from Flume to CFM.

  • Sentry has been replaced with Apache Ranger. A Sentry-to-Ranger policy migration tool is available for CDP Private Cloud Base 7.1 and migrations will be supported when Replication Manager is used to migrate Hive tables from CDH to CDP.

  • Navigator has been replaced with Apache Atlas.

  • Cloudera Director is not supported with CDP Private Cloud Base.
  • MapReduce v1 was deprecated as of CDH 5.0 and is not supported in CDP.

Component Changes in CDP Private Cloud Base 7.1

Lucid and the YARN Fair Scheduler are being removed.

  • Lucidworks is being replaced with Solr.

  • The YARN Fair Scheduler is being replaced with the YARN Capacity Scheduler. A migration tool will be provided to convert the Fair Scheduler configurations to Capacity Scheduler.

  • The Sentry policy files feature option has been deprecated in CDH 5.x and is entirely removed in CDP Private Cloud Base 7.1. Prior to an upgrade to CDP Private Cloud Base 7.1, the Sentry Service role must be set up and used for enforcing access policies over Hive/Solr. To migrate the existing Sentry Policy file configuration to a Sentry Service, see: Migrating from Sentry Policy Files to the Sentry Service .
  • Navigator has been replaced with Apache Atlas.

    Navigator lineage data can be transferred to Atlas as part of the CDH to CDP Private Cloud Base upgrade process. Navigator audit data is not transferred to Atlas.

    When you upgrade a CDH cluster to CDP Private Cloud Base, the Navigator audits persist. However, services no longer produce audits for Navigator. You can continue to run Navigator to be able to access the audits. See Transitioning Navigator audits.

The following CDH components have been included in Cloudera Runtime for the first time in CDP Private Cloud Base 7.1.1.

Deprecated Items

A deprecated item is a feature, component, platform, or functionality that Cloudera is planning to remove in a future release. Cloudera supports items that are deprecated until they are removed, and the deprecation gives customers time to plan for removal.

The following table lists deprecated items:

Table 1. CDH Components, Subcomponents, and Product Functionality
Item Related Information CDH Release in Which Item Is Deprecated Release in Which Support Is Removed
Apache Crunch Apache Crunch is deprecated, and will be removed in a future release. Cloudera recommends using Spark 2 instead. Additionally, as of CDH 6.0.0, Crunch is available only as Maven artifacts from the Cloudera Maven repository. 6.0.0 Cloudera Runtime 7
AsyncHBaseSink AsyncHBaseSink is incompatible with HBase 2.0 and you can no longer use AsyncHBaseSink with Apache Flume. For information about using HBase2Sink with Apache Flume, see . 6.0.0 6.0.0, Cloudera Runtime 7
DataFu 5.9.0 6.0.0, Cloudera Runtime 7
HBaseSink HBaseSink has been replaced with HBase2Sink, which is compatible with HBase 2.0. For information about using HBase2Sink with Apache Flume. 6.0.0 6.0.0, Cloudera Runtime 7
hbck read-write repair mode hbck is only available in a read-only inconsistency identifying mode. 6.0.0 6.0.0, Cloudera Runtime 7
HFTP Use WebHDFS 5.10.1 6.0.0, Cloudera Runtime 7
Hive CLI 5.0.0 Cloudera Runtime 7
HiveServer1 5.3.0 6.0.0, Cloudera Runtime 7
Hue UI version 3 Instead use Hue UI version 4 6.0.0 6.0.0,, Cloudera Runtime 7
Key HSM Debug Startup To get debug information during start up, set the root log level to debug in the /usr/share/keytrustee-server-keyhsm/conf/logback.xml file. 6.1.0 6.1.0, Cloudera Runtime 7
Kite Dataset API Kite Dataset API is deprecated, and will be removed in a future release. 6.0.0 , Cloudera Runtime 7
Kudu Flume sink configuration parameters The producer.skipMissingColumn, producer.skipBadColumnValue, and producer.warnUnmatchedRows Kudu Flume sink configuration parameters have been deprecated in favor of producer.missingColumnPolicy, producer.badColumnValuePolicy, and producer.unmatchedRowPolicy respectively. 6.1.0 , Cloudera Runtime 7
kudu perf loadgen tool configuration options The -table_num_buckets configuration option of the kudu perf loadgen tool is now removed in favor of -table_num_hash_partitions and -table_num_range_partitions 6.1.0 , Cloudera Runtime 7
Legacy Scala clients for Kafka (consumer and producer) The legacy Scala clients (producer and consumer) that are under the kafka.producer.* and kafka.consumer.* package. CDK 3.0.0 and CDH 6.0.0 6.1.0, Cloudera Runtime 7
Llama 5.5.0 6.0.0, Cloudera Runtime 7
Mahout 5.5.0 6.0.0
Management of Key Trustee Server without Cloudera Manager 5.9.0 6.0.0, Cloudera Runtime 7
MR Pipes 5.9.0 6.0.0, Cloudera Runtime 7
MRv1, MapReduce v1 APIs, MapReduce service Migrating from MapReduce 1 to MapReduce 2 5.0.0 6.0.0, Cloudera Runtime 7
Navigator Encrypt Filesystem-Level Encryption Using eCryptfs

There is no support for creating new eCryptfs mount points. Previously existing eCryptfs mount points are not affected.

July 2015 6.0.0, Cloudera Runtime 7
Navigator Encrypt migration command The navencrypt-migration command is deprecated, and has been removed. February 1, 2018 6.0.0, Cloudera Runtime 7
Old NameNode UI 5.5.0 6.0.0, Cloudera Runtime 7
Oozie Hive Action Oozie Hive 2 Action Extension 5.7.0 6.0.0, Cloudera Runtime 7
Parquet library with group ID com.twitter 6.0.0 6.0.0, Cloudera Runtime 7
Parquet methods for reading metadata on the client side 6.0.0 6.0.0, Cloudera Runtime 7
Python 2.6 Python 2.6, packaged with RHEL6, is deprecated. Key Trustee Server uses the utilities ktadmin and keytrustee-orgtool, which use the native version of Python that is packaged with the host OS. 6.0.0 To be determined.
Sentry policy files Migrating from Sentry Policy Files to the Sentry Service 5.8.0 6.0.0, Cloudera Runtime 7
Spark 1.x 5.13 6.0.0, Cloudera Runtime 7
Spark Standalone 5.5.0 6.0.0, Cloudera Runtime 7
Sqoop2 5.9.0 6.0.0, Cloudera Runtime 7
Unmanaged (CLI-based) CDH deployments 6.0.0 6.0.0, Cloudera Runtime 7
Whirr 5.5.0 6.0.0, Cloudera Runtime 7
YARN Fair Scheduler See Migrating from Fair Scheduler to Capacity Scheduler 5.9.0 6.0.0, Cloudera Runtime 7