CDP Private Cloud Base βΆοΈ Cloudera Runtime Release Notes Overview βΆοΈ 7.1.7 SP3 What's new in Cloudera Runtime 7.1.7 SP3 Cloudera Runtime 7.1.7 SP3 component versions βΆοΈ Using the Cloudera Runtime Maven repository 7.1.7 SP3 Runtime 7.1.7.3018-1 Runtime 7.1.7.3017-1 Runtime 7.1.7.3016-1 Runtime 7.1.7.3014-1 Runtime 7.1.7.3013-1 Runtime 7.1.7.3011-1 Runtime 7.1.7.3010-1 Runtime 7.1.7.3008-2 Runtime 7.1.7.3000-77 βΆοΈ Fixed issues in Cloudera Runtime 7.1.7 SP3 Fixed Issues in Apache Atlas Fixed Issues in Apache Avro Fixed issues in Apache Calcite Fixed issues in Cloud Connectors Fixed issues in Cruise Control Fixed issues in Data Analytics Studio Fixed Issues in Apache Hadoop Fixed Issues in Apache HDFS Fixed Issues in Apache HBase Fixed Issues in Apache Hive Fixed Issues in Hue Fixed Issues in Apache Impala Fixed Issues in Apache Kafka Fixed Issues in Kerberos Fixed Issues in Apache Kudu Fixed Issues in Apache Knox Fixed Issues in Livy Fixed Issues in MapΒReduce Fixed Issues in Navigator Encrypt Fixed Issues in Apache Oozie Fixed issues in Apache Ozone Fixed Issues in Apache Parquet Fixed Issues in Apache Phoenix Fixed Issues in Apache Ranger Fixed Issues in Schema Registry Fixed Issues in Cloudera Search Fixed Issues in Apache Solr Fixed Issues in Apache Spark Fixed Issues in Apache Sqoop Fixed Issues in Streams Replication Manager Fixed Issues in Streams Messaging Manager Fixed Issues in Apache Tez Fixed Issues in Apache YARN and YARN Queue Manager Fixed Issues in Zeppelin Fixed Issues in Apache Zookeeper βΆοΈ Known Issues in Cloudera Runtime 7.1.7 SP3 Known Issues in Apache Atlas Known Issues in Apache Avro Known Issues in Cruise Control Known Issues in Apache Calcite Known Issues in Data Analytics Studio Known Issues in Apache Hadoop Known Issues in Apache HBase Known Issues in HDFS Known Issues in Apache Hive Known Issues in Hue Known Issues in Apache Impala Known Issues in Apache Kafka Known Issues in Apache Knox Known Issues in Apache Kudu Known Issues in Navigator Encrypt Known Issues in Apache Oozie Known Issues in Apache Ozone Known Issues in Apache Parquet Known Issues in Apache Phoenix Known Issues in Apache Ranger Known Issues in Schema Registry Known Issues in Cloudera Search Known Issues in Apache Spark Known Issues in Streams Replication Manager Known Issues for Apache Sqoop Known issues in Streams Messaging Manager Known Issues in MapΒReduce and YARN Known Issues in Apache Zeppelin Known Issues in Apache ZooΒKeeper βΆοΈ Behavioral changes in Cloudera Runtime 7.1.7 SP3 Behavioral changes in Apache Hive βΆοΈ CDP Private Cloud Base API Modifications and Removals βΆοΈ CDP 7.1.7 SP2 and 7.1.7 SP3 Components with API differences API Compatibility changes in 7.1.7 SP3 for Spark API Compatibility changes in 7.1.7 SP3 for Zookeeper βΆοΈ Deprecation notices in Cloudera Runtime 7.1.7 SP3 Platform and OS Fixed Common Vulnerabilities and Exposures 7.1.7 SP3 Documentation Errata in Cloudera Runtime 7.1.7 SP3 βΆοΈ Cumulative hotfixes βΆοΈ Cumulative hotfix CDP Private Cloud Base 7.1.7.3018-1 (SP3 Cumulative hotfix8) Known Issues in 7.1.7 SP3 CHF 8 βΆοΈ Cumulative hotfix CDP Private Cloud Base 7.1.7.3017-1 (SP3 Cumulative hotfix7) Known Issues in 7.1.7 SP3 CHF 7 βΆοΈ Cumulative hotfix CDP Private Cloud Base 7.1.7.3016-1 (SP3 Cumulative hotfix6) Known Issues in 7.1.7 SP3 CHF 6 Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF6 βΆοΈ Cumulative hotfix CDP Private Cloud Base 7.1.7.3014-1 (SP3 Cumulative hotfix5) Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF5 Cumulative hotfix CDP Private Cloud Base 7.1.7.3013-1 (SP3 Cumulative hotfix4) Cumulative hotfix CDP Private Cloud Base 7.1.7.3011-1 (SP3 Cumulative hotfix3) Cumulative hotfix CDP Private Cloud Base 7.1.7.3010-1 (SP3 Cumulative hotfix2) Cumulative hotfix CDP Private Cloud Base 7.1.7.3008-2 (SP3 Cumulative hotfix1) βΆοΈ 7.1.7 SP2 What's new in Cloudera Runtime 7.1.7 SP2 Cloudera Runtime 7.1.7 SP2 component versions βΆοΈ Using the Cloudera Runtime Maven repository 7.1.7 SP2 Runtime 7.1.7.2000-305 Runtime 7.1.7.2002-1 Runtime 7.1.7.2009-1 Runtime 7.1.7.2010-1 Runtime 7.1.7.2011-1 Runtime 7.1.7.2013-1 Runtime 7.1.7.2016-1 Runtime 7.1.7.2021-1 Runtime 7.1.7.2023-1 Runtime 7.1.7.2024-1 Runtime 7.1.7.2025-2 Runtime 7.1.7.2026-3 Runtime 7.1.7.2030-1 Runtime 7.1.7.2032-1 Runtime 7.1.7.2035-2 Runtime 7.1.7.2038-1 Runtime 7.1.7.2040-4 Runtime 7.1.7.2046-1 Runtime 7.1.7.2047-1 Runtime 7.1.7.2050-1 βΆοΈ Fixed issues in Cloudera Runtime 7.1.7 SP2 Fixed Issues in Apache Atlas Fixed Issues in Apache Avro Fixed issues in Apache Calcite Fixed issues in Cloud Connectors Fixed issues in Cruise Control Fixed issues in Data Analytics Studio Fixed Issues in Apache Hadoop Fixed Issues in Apache HDFS Fixed Issues in Apache HBase Fixed Issues in Apache Hive Fixed Issues in Hue Fixed Issues in Apache Impala Fixed Issues in Apache Kafka Fixed Issues in Kerberos Fixed Issues in Apache Kudu Fixed Issues in Apache Knox Fixed Issues in Livy Fixed Issues in MapΒReduce Fixed Issues in Navigator Encrypt Fixed Issues in Apache Oozie Fixed issues in Apache Ozone Fixed Issues in Apache Parquet Fixed Issues in Apache Phoenix Fixed Issues in Apache Ranger Fixed Issues in Schema Registry Fixed Issues in Cloudera Search Fixed Issues in Apache Solr Fixed Issues in Apache Spark Fixed Issues in Apache Sqoop Fixed Issues in Streams Replication Manager Fixed Issues in Streams Messaging Manager Fixed Issues in Apache Tez Fixed Issues in Apache YARN and YARN Queue Manager Fixed Issues in Zeppelin Fixed Issues in Apache Zookeeper Hotfixes in Cloudera Runtime 7.1.7 SP2 βΆοΈ Known issues in Cloudera Runtime 7.1.7 SP2 Known Issues in Apache Atlas Known Issues in Apache Avro Known issues in Cruise Control Known issues in Apache Calcite Known Issues in Data Analytics Studio Known Issues in Apache Hadoop Known Issues in Apache HBase Known Issues in HDFS Known Issues in Apache Hive Known Issues in Hue Known Issues in Apache Impala Known Issues in Apache Kafka Known Issues in Apache Knox Known Issues in Apache Kudu Known Issues in Navigator Encrypt Known Issues in Apache Oozie Known Issues in Apache Ozone Known Issues in Apache Parquet Known Issues in Apache Phoenix Known Issues in Apache Ranger Known Issues in Schema Registry Known Issues in Cloudera Search Known Issues in Apache Spark Known Issues in Streams Replication Manager Known Issues for Apache Sqoop Known issues in Streams Messaging Manager Known Issues in MapΒReduce and YARN Known Issues in Apache Zeppelin Known Issues in Apache ZooΒKeeper βΆοΈ Behavioral changes in Cloudera Runtime 7.1.7 SP2 Behavioral changes in Apache Hive Behavioral Changes in Cloudera Search Behavioral changes in Apache Impala Fixed Common Vulnerabilities and Exposures 7.1.7 SP2 Documentation Errata in Cloudera Runtime 7.1.7 SP2 βΆοΈ Cumulative hotfixes Cumulative hotfix CDP PvΒC Base 7.1.7.2050-1 (SP2 cumulative hotfix19) Cumulative hotfix CDP PvΒC Base 7.1.7.2047-1 (SP2 cumulative hotfix18) Cumulative hotfix CDP PvΒC Base 7.1.7.2046-1 (SP2 cumulative hotfix17) Cumulative hotfix CDP PvΒC Base 7.1.7.2040-4 (SP2 cumulative hotfix16) Cumulative hotfix CDP PvΒC Base 7.1.7.2038-1 (SP2 cumulative hotfix15) Cumulative hotfix CDP PvΒC Base 7.1.7.2035-2 (SP2 cumulative hotfix14) Cumulative hotfix CDP PvΒC Base 7.1.7.2032-1 (SP2 cumulative hotfix13) Cumulative hotfix CDP PvΒC Base 7.1.7.2030-1 (SP2 cumulative hotfix12) Cumulative hotfix CDP PvΒC Base 7.1.7.2026-3 (SP2 cumulative hotfix11) Cumulative hotfix CDP PvΒC Base 7.1.7.2025-2 (SP2 cumulative hotfix10) Cumulative hotfix CDP PvΒC Base 7.1.7.2024-1 (SP2 cumulative hotfix9) Cumulative hotfix CDP PvΒC Base 7.1.7.2023-1 (SP2 cumulative hotfix8) Cumulative hotfix CDP PvΒC Base 7.1.7.2021-1 (SP2 cumulative hotfix7) Cumulative hotfix CDP PvΒC Base 7.1.7.2016-1 (SP2 cumulative hotfix6) Cumulative hotfix CDP PvΒC Base 7.1.7.2013-1 (SP2 cumulative hotfix5) Cumulative hotfix CDP PvΒC Base 7.1.7.2011-1 (SP2 cumulative hotfix4) Cumulative hotfix CDP PvΒC Base 7.1.7.2010-1 (SP2 cumulative hotfix3) Cumulative hotfix CDP PvΒC Base 7.1.7.2009-1 (SP2 cumulative hotfix2) Cumulative hotfix CDP PvΒC Base 7.1.7.2002-1 (SP2 cumulative hotfix1) βΆοΈ 7.1.7 SP1 What's new in Cloudera Runtime 7.1.7 SP1 Cloudera Runtime 7.1.7 SP1 component versions βΆοΈ Using the Cloudera Runtime Maven repository 7.1.7 SP1 Maven Artifacts for Cloudera Runtime 7.1.7 SP1 βΆοΈ Fixed issues in Cloudera Runtime 7.1.7 SP1 Fixed Issues in Apache Atlas Fixed Issues in Apache Avro Fixed issues in Cruise Control Fixed issues in Data Analytics Studio Fixed Issues in Apache Hadoop Fixed Issues in Apache HDFS Fixed Issues in Apache HBase Fixed Issues in Apache Hive Fixed Issues in Hue Fixed Issues in Apache Impala Fixed Issues in Apache Kafka Fixed Issues in Apache Kudu Fixed Issues in Apache Knox Fixed Issues in Navigator Encrypt Fixed Issues in Apache Oozie Fixed issues in Apache Ozone Fixed Issues in Apache Parquet Fixed Issues in Phoenix Fixed Issues in Apache Ranger Fixed Issues in Schema Registry Fixed Issues in Cloudera Search Fixed Issues in Apache Spark Fixed Issues in Apache Sqoop Fixed Issues in Streams Replication Manager Fixed Issues in Streams Messaging Manager Fixed Issues in Apache Tez Fixed Issues in Apache YARN Fixed Issues in Zeppelin Fixed Issues in Apache Zookeeper Hotfixes in Cloudera Runtime 7.1.7 SP1 βΆοΈ Known issues in Cloudera Runtime 7.1.7 SP1 Known Issues in Apache Atlas Known Issues in Apache Avro Known issues in Cruise Control Known Issues in Data Analytics Studio Known Issues in Apache Hadoop Known Issues in Apache HBase Known Issues in HDFS Known Issues in Apache Hive Known Issues in Hue Known Issues in Apache Impala Known Issues in Apache Kafka Known Issues in Kerberos Known Issues in Apache Knox Known Issues in Apache Kudu Known Issues in Navigator Encrypt Known Issues in Apache Oozie Known Issues in Apache Ozone Known Issues in Apache Parquet Known Issues in Apache Phoenix Known Issues in Apache Ranger Known Issues in Schema Registry Known Issues in Cloudera Search Known Issues in Apache Spark Known Issues in Streams Replication Manager Known Issues for Apache Sqoop Known issues in Streams Messaging Manager Known Issues in MapΒReduce and YARN Known Issues in Apache Zeppelin Known Issues in Apache ZooΒKeeper βΆοΈ Behavioral changes in Cloudera Runtime 7.1.7 SP1 Behavioral changes in Apache Hive Behavioral Changes in Cloudera Search Behavioral changes in Apache HBase Fixed Common Vulnerabilities and Exposures 7.1.7 SP1 Documentation Errata in Cloudera Runtime 7.1.7 SP1 Cloudera Logging is now available in CDP Private Cloud Base 7.1.7 SP1 Cumulative hotfixes βΆοΈ 7.1.7 βΆοΈ What's new in 7.1.7 Atlas Cruise Control Hive Hue Impala Kafka Kerberos Kudu Ozone Ranger Schema Registry Search Spark Sqoop Streams Replication Manager Streams Messaging Manager YARN Unaffected Components in this release Cloudera Runtime component versions βΆοΈ Using the Cloudera Runtime Maven repository 7.1.7 Maven Artifacts for Cloudera Runtime 7.1.7.0 βΆοΈ Fixed issues in Cloudera Runtime 7.1.7 Atlas Avro Cruise Control DAS Hadoop HDFS HBase Hive Hue Impala Kafka Kudu Knox Navigator Encrypt Oozie Ozone Parquet Phoenix Ranger Schema Registry Search Spark Sqoop Streams Replication Manager Streams Messaging Manager Tez YARN Zeppelin Zookeeper Hotfixes in Cloudera Runtime 7.1.7 βΆοΈ Known issues in Cloudera Runtime 7.1.7 Atlas Avro Cruise Control DAS Hadoop HBase HDFS Hive Hue Impala Kafka Kerberos Knox Kudu Navigator Encrypt Oozie Ozone Parquet Phoenix Ranger Schema Registry Search Spark Streams Replication Manager Sqoop Streams Messaging Manager YARN Zeppelin ZooΒKeeper βΆοΈ Behavioral changes in Cloudera Runtime 7.1.7 Cruise Control Hive Kafka Navigator Encrypt Phoenix Search Impala Streams Replication Manager YARN βΆοΈ Deprecation notices in Cloudera Runtime 7.1.7 Kudu Kafka HBase HDFS βΆοΈ CDP Private Cloud Base service groups and component reference CDP PVC Base - Data Warehouse CDP PVC Base - Data Engineering CDP PVC Base - Operational Database CDP PVC Base - Enterprise Essentials βΆοΈ Cloudera Manager Release Notes βΆοΈ Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) What's New in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Fixed Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Known Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Documentation Errata in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) βΆοΈ Deprecation notices in Cloudera Manager 7.11.3 CHF4 Platform and OS βΆοΈ Cloudera Manager 7.6.7 Release Notes (CDP Private Cloud Base 7.1.7 SP2) What's New in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) Fixed Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) Known Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) Fixed Common Vulnerabilities and Exposures in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) βΆοΈ Cumulative hotfixes Cloudera Manager 7.6.7 Cumulative hotfix 13 Cloudera Manager 7.6.7 Cumulative hotfix 12 Cloudera Manager 7.6.7 Cumulative hotfix 11 Cloudera Manager 7.6.7 Cumulative hotfix 10 Cloudera Manager 7.6.7 Cumulative hotfix 9 Cloudera Manager 7.6.7 Cumulative hotfix 8 Cloudera Manager 7.6.7 Cumulative hotfix 7 Cloudera Manager 7.6.7 Cumulative hotfix 6 Cloudera Manager 7.6.7 Cumulative hotfix 5 Cloudera Manager 7.6.7 Cumulative hotfix 4 Cloudera Manager 7.6.7 Cumulative hotfix 3 Cloudera Manager 7.6.7 Cumulative hotfix 2 Cloudera Manager 7.6.7 Cumulative hotfix 1 βΆοΈ Cloudera Manager 7.6.1 Release Notes (CDP Private Cloud Base 7.1.7 SP1) What's New in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) Fixed Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) Known Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) Documentation Errata in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) βΆοΈ Cumulative hotfixes Cloudera Manager 7.6.1 Cumulative hotfix 9 Cloudera Manager 7.6.1 Cumulative hotfix 8 Cloudera Manager 7.6.1 Cumulative hotfix 7 Cloudera Manager 7.6.1 Cumulative hotfix 6 Cloudera Manager 7.6.1 Cumulative hotfix 5 Cloudera Manager 7.6.1 Cumulative hotfix 4 Cloudera Manager 7.6.1 Cumulative hotfix 3 Cloudera Manager 7.6.1 Cumulative hotfix 2 Cloudera Manager 7.6.1 Cumulative hotfix 1 βΆοΈ Cloudera Manager 7.4.4 Release Notes What's New in Cloudera Manager 7.4.4 Fixed Issues in Cloudera Manager 7.4.4 Known Issues in Cloudera Manager 7.4.4 Known Issues for IBM PowerΒPC βΆοΈ Concepts βΆοΈ Cloudera Manager βΆοΈ Cloudera Manager Overview Overview Terminology Architecture State Management βΆοΈ Cloudera Manager Admin Console Home Page Automatic Logout Software Distribution Management Process Management Host Management Cloudera Manager Agents Resource Management User Management Security Management Monitoring a Cluster Using Cloudera Manager Cloudera Management Service Cluster Configuration Overview Server and Client Configuration Cloudera Manager API βΆοΈ Virtual Private Clusters and Cloudera SDX Advantages of Separating Compute and Data Resources Architecture Performance Trade Offs Compatibility Considerations for Virtual Private Clusters Networking Considerations for Virtual Private Clusters βΆοΈ Storage βΆοΈ Apache Hadoop HDFS Overview βΆοΈ Introduction Overview of HDFS βΆοΈ NameΒNodes βΆοΈ Moving NameΒNode roles Moving highly available NameΒNode, failover controller, and JournalΒNode roles using the Migrate Roles wizard Moving a NameΒNode to a different host using Cloudera Manager βΆοΈ Sizing NameΒNode heap memory Environment variables for sizing NameΒNode heap memory Monitoring heap memory usage Files and directories Disk space versus namespace Replication Examples of estimating NameΒNode heap memory Remove or add storage directories for NameΒNode data directories βΆοΈ DataΒNodes How NameΒNode manages blocks on a failed DataΒNode Replace a disk on a DataΒNode host Remove a DataΒNode Fixing block inconsistencies Add storage directories using Cloudera Manager Remove storage directories using Cloudera Manager βΆοΈ Configuring storage balancing for DataΒNodes Configure storage balancing for DataΒNodes using Cloudera Manager Perform a disk hot swap for DataΒNodes using Cloudera Manager βΆοΈ JournalΒNodes Moving the JournalΒNode edits directory for a role group using Cloudera Manager Moving the JournalΒNode edits directory for a role instance using Cloudera Manager Synchronizing the contents of JournalΒNodes βΆοΈ Apache Ozone Overview βΆοΈ Introduction to Ozone Ozone architecture Ozone security architecture How Ozone manages read operations How Ozone manages write operations βΆοΈ Apache HBase Overview Introduction βΆοΈ Apache Kudu Overview Kudu introduction Kudu architecture in a CDP private cloud base deployment Kudu network architecture Kudu-Impala integration Example use cases Kudu concepts βΆοΈ Apache Kudu usage limitations Schema design limitations Partitioning limitations Scaling recommendations and limitations Server management limitations Cluster management limitations Impala integration limitations Spark integration limitations Kudu security limitations Other known issues More Resources βΆοΈ Apache Kudu Background Operations Maintenance manager Flushing data to disk Compacting on-disk data Write-ahead log garbage collection Tablet history garbage collection and the ancient history mark βΆοΈ Apache Hadoop YARN Overview Introduction YARN Features Understanding YARN architecture βΆοΈ Data Access βΆοΈ Data Analytics Studio Overview Data Analytics Studio overview DAS architecture βΆοΈ Apache Hive Metastore Overview Introduction to Hive metastore βΆοΈ Apache Hive Overview Apache Hive features Hive on Tez introduction Hive unsupported interfaces and features Apache Hive 3 architectural overview βΆοΈ Installing Hive on Tez and adding a HiveΒServer role Adding a HiveΒServer role Changing the Hive warehouse location Apache Hive content roadmap βΆοΈ Apache Impala Overview Introduction Components βΆοΈ Hue Overview Hue overview βΆοΈ Cloudera Search Overview What is Cloudera Search How Cloudera Search works Cloudera Search and CDP Search and other Runtime components Cloudera Search architecture Local file system support βΆοΈ Cloudera Search tasks and processes Ingestion Indexing Querying ETL with Cloudera Morphlines Backing up and restoring data βΆοΈ Operational Database βΆοΈ Operational Database Overview βΆοΈ Operational Database overview Introduction to Apache HBase βΆοΈ Introduction to Apache Phoenix Apache Phoenix and SQL βΆοΈ Operational Database powered by Apache Accumulo Overview Release notes OpΒDB overview CLI tool support System requirements βΆοΈ Introduction to HBase Multi-cluster Client βΆοΈ Introduction to HBase Multi-cluster Client HBase MCC Usage with Kerberos HBase MCC Usage in Spark with Scala HBase MCC Usage in Spark with Java Zookeeper Configurations HBase MCC Configurations HBase MCC Restrictions βΆοΈ Data Science βΆοΈ Apache Spark Overview Apache Spark Overview Unsupported Apache Spark Features βΆοΈ Apache Zeppelin Overview Overview βΆοΈ CDP Security Overview βΆοΈ Introduction What is CDP Private Cloud? Importance of a Secure Cluster Secure by Design βΆοΈ Pillars of Security Authentication Authorization Encryption Identity Management Security Management Model βΆοΈ Security Levels Choosing the Sufficient Security Level for Your Environment Logical Architecture SDX Security Terms βΆοΈ Governance βΆοΈ Governance Overview Using metadata for cluster governance Data Stewardship with Apache Atlas Apache Atlas dashboard tour Apache Atlas metadata collection overview Atlas metadata model overview βΆοΈ Controlling Data Access with Tags Atlas classifications drive Ranger policies When to use Atlas classifications for access control βΆοΈ How tag-based access control works Propagation of tags as deferred actions Examples of controlling data access using classifications βΆοΈ Extending Atlas to Manage Metadata from Additional Sources Top-down process for adding a new metadata source βΆοΈ Streams Messaging βΆοΈ Apache Kafka Overview Kafka Introduction βΆοΈ Kafka Architecture Brokers Topics Records Partitions Record order and assignment Logs and log segments Kafka brokers and Zookeeper Leader positions and in-sync replicas βΆοΈ Kafka FAQ Basics Use cases βΆοΈ Cruise Control Overview Kafka cluster load balancing using Cruise Control βΆοΈ Streams Messaging Manager Overview Introduction to Streams Messaging Manager βΆοΈ Streams Replication Manager Overview Overview Key Features Main Use Cases βΆοΈ Use Case Architectures βΆοΈ Highly Available Kafka Architectures Active / Stand-by Architecture Active / Active Architecture Cross Data Center Replication βΆοΈ Cluster Migration Architectures On-premise to Cloud and Kafka Version Upgrade Aggregation for Analytics βΆοΈ Streams Replication Manager Architecture βΆοΈ Streams Replication Manager Driver Connect workers Connectors Task architecture and load-balancing Driver inter-node coordination Streams Replication Manager Service βΆοΈ Understanding Replication Flows Replication Flows Overview Remote Topics Bidirectional Replication Flows Fan-in and Fan-out Replication Flows Understanding co-located and external clusters Understanding SRM properties, their configuration and hierarchy βΆοΈ Schema Registry Overview βΆοΈ Schema Registry Overview Examples of interacting with Schema Registry βΆοΈ Schema Registry Use Cases Use Case 1: Registering and Querying a Schema for a Kafka Topic Use Case 2: Reading/Deserializing and Writing/Serializing Data from and to a Kafka Topic Use Case 3: Dataflow Management with Schema-based Routing Schema Registry Component Architecture βΆοΈ Schema Registry Concepts Schema Entities Compatibility policies βΆοΈ Planning βΆοΈ Deployment Planning for Cloudera Search Planning overview Dimensioning guidelines Schemaless mode overview and best practices Advantages of defining a schema for production use βΆοΈ Planning for Infra Solr Calculating Infra Solr resource needs βΆοΈ Planning for Apache Impala Guidelines for Schema Design User Account Requirements βΆοΈ Planning for Apache Kudu βΆοΈ Kudu schema design The perfect schema βΆοΈ Column design Decimal type Varchar type Column encoding Column compression βΆοΈ Primary key design Primary key index Considerations for backfill inserts βΆοΈ Partitioning βΆοΈ Range partitioning Adding and Removing Range Partitions Hash partitioning Multilevel partitioning Partition pruning βΆοΈ Partitioning examples Range partitioning Hash partitioning Hash and range partitioning Hash and hash partitioning Schema alterations Schema design limitations Partitioning limitations βΆοΈ Kudu transaction semantics Single tablet write operations Writing to multiple tablets Read operations (scans) βΆοΈ Known issues and limitations Writes Reads (scans) βΆοΈ Scaling Kudu Terms Example workload βΆοΈ Memory Verifying if a memory limit is sufficient File descriptors Threads Scaling recommendations and limitations βΆοΈ Planning for Streams Replication Manager Streams Replication Manager requirements Recommended deployment architecture βΆοΈ Installation & Upgrade βΆοΈ Installing CDP Private Cloud Base CDP Private Cloud Base Installation Guide βΆοΈ Version and Download Information Cloudera Manager Version Information Cloudera Manager Download Information Cloudera Runtime Version Information Cloudera Runtime Download Information Cloudera Manager support for Cloudera Runtime and CDH CDP Private Cloud Base Trial Download Information βΆοΈ CDP Private Cloud Base Requirements and Supported Versions βΆοΈ Hardware Requirements βΆοΈ Cloudera Manager Cloudera Manager Server Service Monitor Requirements Host Monitor Reports Manager Agent Hosts Event Server Alert Publisher βΆοΈ Cloudera Runtime Atlas Data Analytics Studio (DAS) HDFS HBase Hive Hue Impala Kafka Key Trustee Server Ranger KMS Kudu Oozie Ozone Phoenix Ranger Search Spark YARN ZooΒKeeper Operating System Requirements Database Requirements Java Requirements Networking and Security Requirements Data at Rest Encryption Requirements βΆοΈ Third-party filesystems IBM Spectrum Scale Dell EMC PowerΒScale βΆοΈ Trial Installation βΆοΈ Installing a Trial Cluster Before You Begin a Trial Installation Download the Trial version of CDP Private Cloud Base Run the Cloudera Manager Server Installer Install Cloudera Runtime Set Up a Cluster Using the Wizard Stopping the Embedded PostgreΒSQL Database Starting the Embedded PostgreΒSQL Database Changing Embedded PostgreΒSQL Database Passwords βΆοΈ Migrating from the Cloudera Manager Embedded PostgreΒSQL Database Server to an External PostgreΒSQL Database Prerequisites Identify Roles that Use the Embedded Database Server Migrate Databases from the Embedded Database Server to the External PostgreΒSQL Database Server βΆοΈ Installing and Configuring CDP with FIPS Overview Prerequisites Configure Cloudera Manager for FIPS Install and configure additional required components βΆοΈ Production Installation βΆοΈ Before You Install βΆοΈ Storage Space Planning for Cloudera Manager Cluster Lifecycle Management with Cloudera Manager Configure Network Names Setting SELinux Mode Disabling the Firewall Enable an NTP Service Impala Requirements Runtime Cluster Hosts and Role Assignments Allocating Hosts for Key Trustee Server and Key Trustee KMS βΆοΈ Configuring Local Package and Parcel Repositories βΆοΈ Understanding Package Management Repository Configuration Files Listing Repositories βΆοΈ Configuring a Local Package Repository βΆοΈ Creating a Permanent Internal Repository Setting Up a Web Server Downloading and Publishing the Package Repository Creating a Temporary Internal Repository Configuring Hosts to Use the Internal Repository βΆοΈ Configuring a Local Parcel Repository βΆοΈ Using an Internally Hosted Remote Parcel Repository Setting Up a Web Server Downloading and Publishing the Parcel Repository Configuring Cloudera Manager to Use an Internal Remote Parcel Repository Using a Local Parcel Repository Configuring /tmp directory for cluster hosts Installing Cloudera Manager, Cloudera Runtime, and Managed Services Step 1: Configure a Repository for Cloudera Manager βΆοΈ Step 2: Install Java Development Kit Installing OpenΒJDK on Cloudera Manager Installing OpenΒJDK for CDP Runtime Installing Oracle JDK for CDP Runtime βΆοΈ Step 3: Install Cloudera Manager Server Step 3: Deploy Cloudera Manager Server and Cloudera Manager Agents βΆοΈ Step 4. Install and Configure Databases Required Databases βΆοΈ Install and Configure PostgreΒSQL for CDP Installing Postgres JDBC Driver Installing PostgreΒSQL Server Installing the psycopg2 Python package for PostgreΒSQL-backed Hue Configuring and Starting the PostgreΒSQL Server Install and Configure MyΒSQL for CDP Install and Configure MariaΒDB for CDP βΆοΈ Configure Oracle Database Configuring the Hue Server to Store Data in the Oracle database βΆοΈ Configuring a database for Ranger or Ranger KMS Configuring a Ranger or Ranger KMS Database: MyΒSQL/MariaΒDB Configuring a Ranger or Ranger KMS Database: Oracle Configuring a Ranger or Ranger KMS Database: Oracle using /ServiceΒName format Configuring a PostgreΒSQL Database for Ranger or Ranger KMS Configure Ranger with SSL/TLS enabled PostgreΒSQL Database βΆοΈ Configuring the Database for Streaming Components Configure PostgreΒSQL for Streaming Components Configuring MyΒSQL for Streaming Components Configuring Oracle for Streaming Components βΆοΈ Step 5: Set up and configure the Cloudera Manager database Syntax for scm_prepare_database.Βsh βΆοΈ Step 6: Start the Cloudera Manager Server and Agents Installation Wizard βΆοΈ Step 7: Set Up a Cluster Using the Wizard Select Services Assign Roles Setup Database Enter Required Parameters Review Changes Command Details Summary Tuning JVM Garbage Collection (Recommended) Enable Auto-TLS Additional Steps for Apache Ranger βΆοΈ Installing Apache Knox Apache Knox Install Role Parameters βΆοΈ Setting Up Data at Rest Encryption for HDFS Installing Ranger KMS backed by a Database and HA Installing Ranger KMS backed with a Key Trustee Server and HA Installing a Java Keystore KMS Installing Cloudera Navigator Encrypt Installing Cloudera Navigator Key HSM Installing Ranger RMS βΆοΈ Custom Installation Solutions βΆοΈ Privileged commands for Cloudera Manager installation Prerequisites and exceptions for the example configuration Example configuration to add to the sudoers file βΆοΈ Creating Virtual Images of Cluster Hosts Creating a Pre-Deployed Cloudera Manager Host Instantiating a Cloudera Manager Image Creating a Pre-Deployed Worker Host Instantiating a worker host βΆοΈ Manually Install Cloudera Software Packages Install Cloudera Manager Packages Manually Install Cloudera Manager Agent Packages βΆοΈ Installation Reference βΆοΈ Ports Ports Used by Cloudera Manager Ports Used by Cloudera Navigator Key Trustee Server Ports Used by Cloudera Runtime Components Ports Used by DistΒCp Ports Used by Third-Party Components Service Dependencies in Cloudera Manager Cloudera Manager sudo command options Introduction to Parcels βΆοΈ After You Install Deploying Clients Testing the Installation Checking Host Heartbeats Running a MapΒReduce Job Testing with Hue Deploying Atlas service Secure Your Cluster Installing the GPL Extras Parcel Configuring HDFS properties to optimize log collection Troubleshooting Installation Problems βΆοΈ Uninstalling Cloudera Manager and Managed Software Record User Data Paths Stop all Services Deactivate and Remove Parcels Delete the Cluster Uninstall the Cloudera Manager Server Uninstall Cloudera Manager Agent and Managed Software Remove Cloudera Manager, User Data, and Databases Uninstalling a Runtime Component From a Single Host βΆοΈ Custom Installation Scenarios Installing a Kafka-centric cluster βΆοΈ Quick Start Deployment for a Streams Cluster Create a Streams Cluster on CDP Private Cloud Base βΆοΈ Before You Install System Requirements for POC Streams Cluster Disable the Firewall Enable an NTP Service βΆοΈ Installing a Trial Streaming Cluster Download the Trial version of CDP Private Cloud Base Run the Cloudera Manager Server Installer Install Cloudera Runtime Set Up a Streaming Cluster βΆοΈ Getting Started on your Streams Cluster Create a Kafka Topic to Store your Events Write a few Events into the Topic Read the Events Monitor your Cluster from the SMM UI After Evaluating Trial Software βΆοΈ Installing Operational Database powered by Apache Accumulo βΆοΈ Installing Accumulo Parcel 1.0.0 βΆοΈ Install OpΒDB Install OpΒDB CSD file Install CDP βΆοΈ Install OpΒDB parcel Install OpΒDB parcel using Local Parcel Repository Install OpΒDB parcel using Remote Parcel Repository βΆοΈ Add Accumulo on CDP service Add unsecure Accumulo on CDP service to your cluster Add secure Accumulo on CDP service to your cluster Creating trace user in unsecure OpΒDB deployment Check trace table Provide user permissions Verify your OpΒDB installation βΆοΈ Installing Accumulo Parcel 1.1.0 βΆοΈ Install OpΒDB Install OpΒDB CSD file Install CDP βΆοΈ Install OpΒDB parcel Install OpΒDB parcel using Local Parcel Repository Install OpΒDB parcel using Remote Parcel Repository βΆοΈ Add Accumulo on CDP service Add unsecure Accumulo on CDP service to your cluster Add secure Accumulo on CDP service to your cluster Verify your OpΒDB installation βΆοΈ Installing Accumulo Parcel 1.10 βΆοΈ Install Accumulo Install Accumulo CSD file Install CDP βΆοΈ Install Accumulo 1.10 parcel Install Accumulo parcel using Local Parcel Repository Install Accumulo using Remote Parcel Repository βΆοΈ Add Accumulo on CDP service Add unsecure Accumulo on CDP service to your cluster Add secure Accumulo on CDP service to your cluster Creating a trace user in unsecure Accumulo deployment Check trace table Provide user permissions Verify your Accumulo installation Getting Started with CDP Upgrade and Migration In-Place Upgrade CDH 6 to CDP Private Cloud Base In-Place Upgrade CDH 5 to CDP Private Cloud Base In-Place Upgrade HDP3 to CDP Private Cloud Base In-Place Upgrade HDP2 to CDP Private Cloud Base In-Place Upgrade CDP Private Cloud Base βΆοΈ Managing Clusters Accessing the Cloudera Manager Admin Console βΆοΈ Adding and Deleting Clusters Adding a Compute Cluster and Data Context βΆοΈ Adding a Cluster Using New Hosts Step 1: Welcome (Add Cluster - Installation) Step 2: Cluster Basics Step 3: Setup Auto-TLS Step 4: Specify Hosts Step 5: Select Repository Step 6: Select JDK Step 7: Enter Login Credentials Step 8: Install Agents Step 9: Install Parcels Step 11: Inspect Cluster βΆοΈ Adding a Cluster Using Currently Managed Hosts Step 1: Welcome (Add Cluster - Installation) Step 2: Cluster Basics Step 3: Setup Auto-TLS Step 4: Specify Hosts Step 5: Select Repository Step 6: Install Parcels Step 8: Inspect Cluster Deleting a Cluster βΆοΈ Tutorial: Using Impala, Hive and Hue with Virtual Private Clusters Set Up an Environment Using impala-shell and Hive View HDFS directory structure of Compute clusters Insert data in test_table through Spark Hue in a Virtual Private Cluster Environment Starting, Stopping, Refreshing, and Restarting a Cluster βΆοΈ Pausing a Cluster in AWS Shutting Down and Starting Up the Cluster Renaming a Cluster βΆοΈ Managing Hosts Viewing Host Status Adding a Host to a Cluster Parcels Configuring Hosts Viewing Host Role Assignments βΆοΈ Host Templates Creating a Host Template Editing a Host Template Deleting a Host Template Applying a Host Template to a Host Hosts Disks Overview βΆοΈ Deleting Hosts Deleting a Host from Cloudera Manager Removing a Host From a Cluster Stopping All the Roles on a Host Starting All the Roles on a Host Changing Hostnames Moving a Host Between Clusters βΆοΈ Configuring Upgrade Domains Configuring Upgrade Domains Changing the Upgrade Domain for hosts Putting all Hosts in an Upgrade Domain group into Maintenance Mode Specifying Racks for Hosts βΆοΈ Performing Maintenance on a Cluster Host Decommissioning Hosts Recommissioning Hosts βΆοΈ Tuning and Troubleshooting Host Decommissioning Tuning HDFS Prior to Decommissioning DataΒNodes Tuning HBase Prior to Decommissioning DataΒNodes Performance Considerations Troubleshooting Performance of Decommissioning Maintenance Mode Viewing the Maintenance Mode Status of a Cluster βΆοΈ Managing Roles βΆοΈ Role Instances Adding a Role Instance Starting, Stopping, and Restarting Role Instances Decommissioning Role Instances Recommissioning Role Instances Deleting Role Instances Configuring Roles to Use a Custom Garbage Collection Parameter βΆοΈ Role Groups Creating a Role Group Managing Role Groups Default User Roles Backing up Cloudera Manager databases βΆοΈ Managing Cloudera Runtime Services βΆοΈ Adding a Service Prerequisites for installing Atlas Installing Atlas using Add Service Installing Ranger using Add Service Comparing Configurations for a Service Between Clusters Starting a Cloudera Runtime Service on All Hosts Stopping a Cloudera Runtime Service on All Hosts Restarting a Cloudera Runtime Service Rolling Restart Aborting a Pending Command Deleting Services Renaming a Service Configuring Maximum File Descriptors βΆοΈ Extending Cloudera Manager Add-on Services Configuring Services to Use LZO Compression βΆοΈ Core Settings Service Configuration parameters migrated to Core Settings Service βΆοΈ Performance Management βΆοΈ Optimizing Performance in Cloudera Runtime Disabling Transparent Hugepages (THP) βΆοΈ Setting the vm.Βswappiness Linux Kernel Parameter File system partitioning recommendations Improving Performance in Shuffle Handler and IFile Reader Tips and Best Practices for Jobs Decrease Reserve Space Choosing and Configuring Data Compression βΆοΈ Managing Cloudera Manager Automatic Logout Starting, Stopping, and Restarting the Cloudera Manager Server βΆοΈ Configuring Cloudera Manager Configuring Cloudera Manager Server Ports Configuring Network Settings for a Proxy Server Moving the Cloudera Manager Server to a New Host βΆοΈ Migrating Embedded PostgreΒSQL Database to External PostgreΒSQL Database Step 1: Identify Roles that Use the Embedded Database Server Step 2: Migrate Databases from the Embedded Database Server to the External PostgreΒSQL Database Server βΆοΈ Migrating from PostgreΒSQL Database Server to MyΒSQL/Oracle Database Server Migrate from the Cloudera Manager External PostgreΒSQL Database Server to a MyΒSQL/Oracle Database Server Managing Cloudera Manager Server Logs βΆοΈ Cloudera Manager Agents Starting, Stopping, and Restarting Cloudera Manager Agents Configuring Cloudera Manager Agents Managing the Cloudera Manager Agent Logs βΆοΈ Overview of Parcels Advantages of Parcels Parcel Life Cycle Parcel Locations Managing Parcels Viewing Parcel Usage Parcel Configuration Settings βΆοΈ Managing Licenses Accessing the License Page Ending a CDP Private Cloud Base Trial Upgrading from a CDP Private Cloud Base Trial to CDP Private Cloud Base Renewing a License Cloudera Manager User Roles Other Tasks and Settings βΆοΈ Cloudera Management Service Starting the Cloudera Management Service Stopping the Cloudera Management Service Restarting the Cloudera Management Service Starting and Stopping Cloudera Management Service Roles Configuring Management Service Database Limits βΆοΈ Securing sensitive information using a Secure Credential Storage Provider (Technical Preview) Configuring a Secure Credential Storage Provider for Cloudera Manager (Technical Preview) Disabling or changing the Credential Storage Provider (Technical Preview) βΆοΈ Resource Management βΆοΈ Static Service Pools Enabling and Configuring Static Service Pools Disabling Static Service Pools βΆοΈ Linux Control Groups (cgroups) Enabling Resource Management with Control Groups Configuring Resource Parameters Configuring Custom Cgroups βΆοΈ Data Storage for Monitoring Data Configuring Service Monitor Data Storage Configuring Host Monitor Data Storage Viewing Host and Service Monitor Data Storage Data Granularity and Time-Series Metric Data Moving Monitoring Data on an Active Cluster βΆοΈ Host Monitor and Service Monitor Memory Configuration Configuring Memory Allocations βΆοΈ Accessing Storage Using Amazon S3 Referencing S3 Credentials for YARN, MapΒReduce, or Spark Clients Referencing Amazon S3 in URIs Using Fast Upload with Amazon S3 Enabling Fast Upload using Cloudera Manager βΆοΈ Configuring and Managing S3Guard Configuring S3Guard for Cluster Access to S3 Editing the S3Guard Configuration Running the Prune Command Using Cloudera Manager Admin Console Running the Prune Command Using the Cloudera Manager API How to Configure a MapΒReduce Job to Access S3 with an HDFS Credstore βΆοΈ Importing Data into Amazon S3 Using Sqoop βΆοΈ Authentication Using a Credential Provider to Secure S3 Credentials βΆοΈ Sqoop Import into Amazon S3 Import Data from RDBMS into an S3 Bucket Import Data into S3 Bucket in Incremental Mode Import Data into an External Hive Table Backed by S3 S3Guard with Sqoop βΆοΈ Accessing Storage Using Microsoft ADLS Configuring OAuth in Data Hub Configuring OAuth with core-site.Βxml Configuring OAuth with the Hadoop CredentialΒProvider Configuring Built-in TLS Acceleration βΆοΈ Importing Data into Microsoft Azure Data Lake Store (Gen1 and Gen2) Using Sqoop Prerequisites Authentication Sqoop Import into ADLS βΆοΈ Configuring Clusters Accessing the Cloudera Manager Admin Console βΆοΈ Modifying Configuration Properties Using Cloudera Manager βΆοΈ Changing the Configuration of a Service or Role Instance Searching for Properties Validation of Configuration Properties Overriding Configuration Properties Viewing and Editing Overridden Configuration Properties Resetting Configuration Properties to the Default Value Viewing and Editing Host Overrides Restarting Services and Instances after Configuration Changes βΆοΈ Suppressing Configuration and Parameter Validation Warnings Suppressing a Configuration Validation in Cloudera Manager Managing Suppressed Validations Suppressing Configuration Validations Before They Trigger Warnings Viewing a List of All Suppressed Validations Cluster-Wide Configuration Custom Configuration Setting an Advanced Configuration Snippet for a Cloudera Runtime Service Setting an Advanced Configuration Snippet for a Cluster Stale Configurations βΆοΈ Client Configuration Files How Client Configurations are Deployed Downloading Client Configuration Files Manually Redeploying Client Configuration Files βΆοΈ Viewing and Reverting Configuration Changes Changes for a service, role, or host Changes for a cluster Autoconfiguration βΆοΈ Using the Cloudera Manager API βΆοΈ Using the Cloudera Manager API to backup and restore clusters Backing up the Cloudera Manager configuration Restoring the Cloudera Manager configuration βΆοΈ Using the Cloudera Manager API to Manage and Configure Clusters Using the Cloudera Manager API for Cluster Automation Using the Cloudera Manager API to Obtain Configuration Files Using the Cloudera Manager API to Set Advanced Configuration Snippets (Safety Valves) Using Tags in Cloudera Manager Initiating HDFS failover using the Cloudera Manager API βΆοΈ Creating a Runtime Cluster Using a Cloudera Manager Template Exporting the Cluster Configuration Preparing a New Cluster Creating the Template Importing the Template to a New Cluster Sample Python Code Disabling Redaction of sensitive information when using the Cloudera Manager API βΆοΈ Monitoring and Diagnostics Accessing the Cloudera Manager Admin Console Monitoring and Diagnostics βΆοΈ Time Line Selecting a Point In Time or a Time Range βΆοΈ Health Tests Viewing Health Test Results Suppressing Health Test Results Suppressing a Health Test Configuring Suppression of Health Tests Before Tests Run Viewing a List of Suppressed Health Tests Unsuppressing Health Tests βΆοΈ Viewing Charts for Cluster, Service, Role, and Host Instances Exporting Data from Charts Adding and Removing Charts from a Dashboard Creating Triggers from Charts βΆοΈ Configuring Monitoring Settings Configuring Health Monitoring Configuring Service Monitoring Configuring Host Monitoring Configuring Directory Monitoring Configuring YARN Application Monitoring Configuring Impala Query Monitoring Configuring Impala Query Data Store Maximum Size Enabling Configuration Change Alerts Filtering Metrics βΆοΈ Configuring Log Events Configuring Logs Configuring Logging Thresholds Configuring Log Directories Enabling and Disabling Log Event Capture Configuring Which Log Messages Become Events Configuring Log Alerts Monitoring Clusters βΆοΈ Cluster Utilization Report overview Enable the Cluster Utilization Report Configure the Cluster Utilization Report βΆοΈ Use the Cluster Utilization Report to manage resources Overview Tab YARN Tab Impala Tab Download the Cluster Utilization Report βΆοΈ Creating a Custom Cluster Utilization Report Metrics and queries Impala query counter metrics Calculations for reports Retrieving metric data Querying metric data Inspecting Network Performance βΆοΈ Monitoring Services βΆοΈ Monitoring Service Status Viewing the URLs of the Client Configuration Files Viewing the Status of a Service Instance Viewing the Health and Status of a Role Instance Viewing the Maintenance Mode Status of a Cluster βΆοΈ Viewing Service Status Viewing Past Status Status Summary Service Summary Health Tests and Health History βΆοΈ Viewing Service Instance Details Role Instance Reference βΆοΈ Viewing Role Instance Status The Actions Menu Viewing Past Status Summary Health Tests and Health History Status Summary Charts The Processes Tab Running Diagnostic Commands for Roles βΆοΈ Periodic Stacks Collection Configuring Periodic Stacks Collection Viewing and Downloading Stacks Logs βΆοΈ Viewing Running and Recent Commands Viewing Running and Recent Commands For a Cluster Viewing Running and Recent Commands for a Service or Role Command Details βΆοΈ Monitoring Hosts Viewing All Hosts Role Assignments Viewing the Disks Overview Viewing the Hosts in a Cluster Viewing Individual Hosts βΆοΈ Host Details Viewing Host Details Status Processes Resources Commands Configuration Components Audits Charts Library βΆοΈ Host Inspector Running the Host Inspector Viewing Past Host Inspector Results βΆοΈ Monitoring Activities Selecting Columns to Show in the Activities List Sorting the Activities List Filtering the Activities List Activity Charts Viewing the Jobs in a Pig, Oozie, or Hive Activity βΆοΈ Task Attempts Viewing a Job's Task Attempts Selecting Columns to Show in the Tasks List Sorting the Tasks List Filtering the Tasks List Viewing Activity Details in a Report Format Comparing Similar Activities βΆοΈ Viewing the Distribution of Task Attempts The Task Distribution Chart TaskΒTracker Hosts βΆοΈ Monitoring Impala Queries Viewing Queries Configuring Impala Query Monitoring Impala Best Practices Results Tab Filtering Queries Filter Expressions Filter Attributes Choosing and Running a Filter Query Details βΆοΈ Monitoring YARN Applications Viewing Jobs Configuring YARN Application Monitoring Results Tab Filtering Jobs Filter Expressions Choosing and Running a Filter Filter Attributes Sending Diagnostic Data to Cloudera for YARN Applications βΆοΈ Monitoring Spark Applications Viewing and Debugging Spark Applications Using Logs Managing Spark Driver Logs Visualizing Spark Applications Using the Web Application UI Accessing the Web UI of a Running Spark Application Accessing the Web UI of a Completed Spark Application βΆοΈ Events Viewing Events βΆοΈ Filtering Events Adding an Event Filter Removing an Event Filter βΆοΈ Alerts Managing Alerts Configuring Alert Email Delivery Configuring Alert SNMP Delivery βΆοΈ Configuring Custom Alert Scripts Sample Custom Alert Script Enabling Configuration Change Alerts Enabling HBase Alerts Enabling Health Alerts Modifying the Health Threshold Configuring Alerts Transitioning Out of Alerting Health Threshold Configuring Log Alerts Configuring Alert Delivery βΆοΈ Triggers Creating a Trigger Using the Expression Editor Editing, Deleting, Suppressing, or Deleting a Trigger βΆοΈ Cloudera Manager Trigger Use Cases Creating a Trigger for Memory Capacity Creating a Trigger for CPU Capacity βΆοΈ Lifecycle and Security Auditing Viewing Audit Events βΆοΈ Filtering Audit Events Adding a Filter Removing a Filter Downloading Audit Events βΆοΈ Charting Time-Series Data Terminology Building a Chart with Time-Series Data Configuring Time-Series Query Results Using Context-Sensitive Variables in Charts βΆοΈ Chart Properties Changing the Chart Type Grouping (Faceting) Time Series Displaying Chart Details Editing a Chart Saving a Chart Obtaining Time-Series Data Using the API βΆοΈ Dashboards Dashboard Types Creating a Dashboard Managing Dashboards Configuring Dashboards Saving Charts to Dashboards Saving Charts to a New Dashboard Saving Charts to an Existing Dashboard Adding a New Chart to the Custom Dashboard Removing a Chart from a Custom Dashboard Moving and Resizing Charts βΆοΈ tsquery Language tsquery Syntax Metric Expressions Metric Expression Functions Predicates Discovering possible predicates Filtering by Day of Week or Hour of Day Time Series Attributes Time Series Entities and their Attributes FAQ βΆοΈ Metric Aggregation Presentation of Aggregate Data Accessing Aggregate Statistics Through tsquery Filtering Metrics βΆοΈ Logs Viewing Logs Logs List Filtering Logs Log Details βΆοΈ Viewing the Cloudera Manager Server Log Viewing Cloudera Manager Server Logs in the Logs Page Viewing the Cloudera Manager Server Log βΆοΈ Viewing the Cloudera Manager Agent Logs Viewing Cloudera Manager Agent Logs in the Logs Page Viewing the Cloudera Manager Agent Log Managing Disk Space for Log Files βΆοΈ Reports βΆοΈ Directory Usage Report Accessing the Directory Usage Report βΆοΈ Using the Directory Usage Report Filters Disk Usage Reports βΆοΈ Disk Usage Reports Viewing Current Disk Usage by User, Group, or Directory Viewing Historical Disk Usage by User, Group, or Directory Downloading Reports as CSV and XLS Files Activity, Application, and Query Reports βΆοΈ The File Browser Searching Within the File System Enabling Snapshots Setting Quotas Designating Directories to Include in Disk Usage Reports Downloading HDFS Directory Access Permission Reports Cluster Support Tokens using Cloudera Manager βΆοΈ Sending Usage and Diagnostic Data to Cloudera Configuring a Proxy Server Managing Anonymous Usage Data Collection Diagnostic Data Collection Log support in Cloudera Manager for ECS cluster Configuring the Frequency of Diagnostic Data Collection Configuring collection of Cloudera Manager table data Specifying the Diagnostic Data Directory Redaction of Sensitive Information from Diagnostic Bundles Disabling the Automatic Sending of Diagnostic Data from a Manually Triggered Collection Manually Triggering Collection and Transfer of Diagnostic Data to Cloudera βΆοΈ Troubleshooting Cluster Configuration and Operation Solutions to Common Problems Logs and Events βΆοΈ Replication Manager Replication Manager in CDP Private Cloud Base Support matrix for Replication Manager on CDP Private Cloud Base Port and network requirements for Replication Manager on CDP Private Cloud Base βΆοΈ Prepare to replicate using replication policies Cloudera license requirements for Replication Manager Configuring SSL/TLS certificate exchange between two Cloudera Manager instances Add source cluster as peer to use in replication policies βΆοΈ Enabling replication between clusters with Kerberos authentication Required ports in Kerberos authentication-enabled clusters for replication Considerations for realm names to use for replication Prepare Kerberos authentication-enabled clusters for replication Kerberos connectivity test Replicating from unsecure to secure clusters βΆοΈ Replication of encrypted data Encrypting data in transit between clusters Security considerations for encrypted data during replication Configuring heap size to replicate large directories using replication policies Retaining logs for Replication Manager βΆοΈ HDFS replication policies βΆοΈ HDFS replication policy considerations How HDFS replication policy works Improve network latency during replication job run Performance and scalability limitations to consider for replication policies Guidelines to use snapshot diff-based replication HDFS replication in Sentry-enabled clusters Specifying hosts to improve HDFS replication policy performance Creating HDFS replication policy to replicate HDFS data View HDFS replication policy details View historical details for an HDFS replication policy Monitoring the performance of HDFS replication policies βΆοΈ Hive external table replication policies βΆοΈ Hive replication policy considerations Specifying hosts to improve Hive replication policy performance Hive tables and DDL commands Disabling replication of parameters during Hive replication Accommodate HMS changes for Hive replication policies Creating a Hive external table replication policy Sentry to Ranger replication for Hive external tables Importing Sentry privileges into Ranger policies Replicating data to Impala clusters Replication of Impala and Hive User Defined Functions (UDFs) Monitoring the performance of Hive/Impala replication policies Managing replication policies Troubleshooting replication policies between on-premises clusters βΆοΈ Snapshots βΆοΈ Using snapshots with replication Snapshot policies in Replication Manager Creating and managing snapshot policies Snapshots history Hive/Impala replication using snapshots Orphaned snapshots βΆοΈ Managing HDFS snapshots in Cloudera Manager Browse HDFS directories Enabling and disabling HDFS snapshots Taking and deleting HDFS snapshots Restoring HDFS snapshots βΆοΈ Use DistΒCp to migrate HDFS data from HDP to CDP βΆοΈ Using DistΒCp to migrate data from secure HDP to unsecure CDP Step 1: Enabling hdfs user to run YARN jobs Step 2: Configuration changes on the CDP cluster Step 3: Running the DistΒCp job on the HDP cluster βΆοΈ Using DistΒCp to migrate data from secure HDP to secure CDP using DistΒCp Step 1: Configuration changes on HDP and CDP clusters Step 2: Configuring user to run YARN jobs on both the clusters Step 3: Running DistΒCp job on CDP cluster βΆοΈ How to: Next-Gen Storage βΆοΈ Storing Data Using Ozone βΆοΈ Managing storage elements by using the command-line interface βΆοΈ Commands for managing volumes Assigning administrator privileges to users Commands for managing buckets Commands for managing keys βΆοΈ Using Ozone S3 Gateway to work with storage elements Configuration to expose buckets under non-default volumes REST endpoints supported on Ozone S3 Gateway Configuring Ozone to work as a pure object store βΆοΈ Access Ozone S3 Gateway using the S3A filesystem Examples of using the S3A filesystem with Ozone S3 Gateway Configuring Spark access for S3A Configuring Hive access for S3A Configuring Impala access for S3A βΆοΈ Using the AWS CLI with Ozone S3 Gateway Configuring https endpoints in Ozone S3 Gateway to work with AWS CLI Examples of using the AWS CLI for Ozone S3 Gateway βΆοΈ Accessing Ozone object store with Amazon Boto3 client Obtaining resources to Ozone Obtaining client to Ozone through session βΆοΈ List of APIs verified Create a bucket List buckets Head a bucket Delete a bucket Upload a file Download a file Head an object Delete Objects Multipart upload βΆοΈ Working with Ozone File System (o3fs) Setting up o3fs βΆοΈ Working with ofs Volume and bucket management using ofs Key management using ofs βΆοΈ Ozone configuration options to work with CDP components Configuration options for Spark to work with o3fs Configuration options to store Hive managed tables on Ozone βΆοΈ Overview of the Ozone Manager in High Availability Considerations for configuring High Availability on the Ozone Manager βΆοΈ Ozone Manager nodes in High Availability Read and write requests with Ozone Manager in High Availability βΆοΈ Overview of Storage Container Manager in High Availability Considerations for configuring High Availability on Storage Container Manager Storage Container Manager operations in High Availability Offloading Application Logs to Ozone βΆοΈ Removing Ozone DataΒNodes from the cluster Decommissioning Ozone DataΒNodes Placing Ozone DataΒNodes in offline mode Configuring the number of storage container copies for a DataΒNode Recommissioning an Ozone DataΒNode Multi-Raft configuration for efficient write performances βΆοΈ Working with the Recon web user interface Access the Recon web user interface βΆοΈ Elements of the Recon web user interface Overview page DataΒNodes page Pipelines page Missing Containers page Configuring Ozone to work with Prometheus Ozone trash overview Configuring the Ozone trash checkpoint values βΆοΈ Configuring Ozone Security Using Ranger with Ozone βΆοΈ Kerberos configuration for Ozone Security tokens in Ozone Kerberos principal and keytab properties for Ozone service daemons Securing DataΒNodes Configure S3 credentials for working with Ozone Configuring custom Kerberos principal for Ozone Configuring Transparent Data Encryption for Ozone Configuring TLS/SSL encryption manually for Ozone Configuration for enabling mΒTLS in Ozone βΆοΈ Configuring security for Storage Container Managers in High Availability Considerations for enabling SCM HA security βΆοΈ Configuring Ozone Performance tuning for Ozone βΆοΈ How to: Storage βΆοΈ Managing Data Storage βΆοΈ Optimizing data storage βΆοΈ Balancing data across disks of a DataΒNode βΆοΈ Plan the data movement across disks Parameters to configure the Disk Balancer Run the Disk Balancer plan Disk Balancer commands βΆοΈ Erasure coding overview Understanding erasure coding policies Comparing replication and erasure coding Best practices for rack and node setup for EC Prerequisites for enabling erasure coding Limitations of erasure coding Using erasure coding for existing data Using erasure coding for new data Advanced erasure coding configuration Erasure coding CLI command Erasure coding examples βΆοΈ Increasing storage capacity with HDFS compression Enable GZipΒCodec as the default compression codec Use GZipΒCodec with a one-time job βΆοΈ Set HDFS quotas Setting HDFS quotas in Cloudera Manager βΆοΈ Configuring heterogeneous storage in HDFS HDFS storage types HDFS storage policies Commands for configuring storage policies Set up a storage policy for HDFS Set up SSD storage using Cloudera Manager Configure archival storage The HDFS mover command βΆοΈ Balancing data across an HDFS cluster Why HDFS data becomes unbalanced βΆοΈ Configurations and CLI options for the HDFS Balancer Properties for configuring the Balancer Balancer commands Recommended configurations for the Balancer βΆοΈ Configuring and running the HDFS balancer using Cloudera Manager Configuring the balancer threshold Configuring concurrent moves Recommended configurations for the balancer Running the balancer Configuring block size βΆοΈ Cluster balancing algorithm Storage group classification Storage group pairing Block move scheduling Block move execution Exit statuses for the HDFS Balancer HDFS βΆοΈ Optimizing performance βΆοΈ Improving performance with centralized cache management Benefits of centralized cache management in HDFS Use cases for centralized cache management Centralized cache management architecture Caching terminology Properties for configuring centralized caching Commands for using cache pools and directives βΆοΈ Specifying racks for hosts Viewing racks assigned to cluster hosts Editing rack assignments for hosts βΆοΈ Customizing HDFS Customize the HDFS home directory Properties to set the size of the NameΒNode edits directory βΆοΈ Optimizing NameΒNode disk space with Hadoop archives Overview of Hadoop archives Hadoop archive components Create a Hadoop archive List files in Hadoop archives Format for using Hadoop archives with MapΒReduce βΆοΈ Detecting slow DataΒNodes Enable disk IO statistics Enable detection of slow DataΒNodes βΆοΈ Allocating DataΒNode memory as storage HDFS storage types LAZY_ΒPERSIST memory storage policy Configure DataΒNode memory as storage βΆοΈ Improving performance with short-circuit local reads Prerequisites for configuring short-ciruit local reads Properties for configuring short-circuit local reads on HDFS βΆοΈ Configure mountable HDFS Add HDFS system mount Optimize mountable HDFS Configuring Proxy Users to Access HDFS βΆοΈ Using DistΒCp to copy files Using DistΒCp Distcp syntax and examples Using DistΒCp with Highly Available remote clusters βΆοΈ Using DistΒCp with Amazon S3 Using a credential provider to secure S3 credentials Examples of DistΒCp commands using the S3 protocol and hidden credentials Kerberos setup guidelines for Distcp between secure clusters βΆοΈ Distcp between secure clusters in different Kerberos realms Configure source and destination realms in krb5.conf Configure HDFS RPC protection Specify truststore properties Set HADOOP_ΒCONF to the destination cluster Launch distcp Copying data between a secure and an insecure cluster using DistΒCp and WebΒHDFS Post-migration verification Using DistΒCp between HA clusters using Cloudera Manager βΆοΈ Using the NFS Gateway for accessing HDFS Install the NFS Gateway βΆοΈ Start and stop the NFS Gateway services Start the NFS Gateway services Stop the NFS Gateway services Verify validity of the NFS services βΆοΈ Access HDFS from the NFS Gateway How NFS Gateway authenticates and maps users βΆοΈ APIs for accessing HDFS Set up WebΒHDFS on a secure cluster βΆοΈ Using HttpΒFS to provide access to HDFS Add the HttpΒFS role Using Load Balancer with HttpΒFS βΆοΈ HttpΒFS authentication Use curl to access a URL protected by Kerberos HTTP SPNEGO βΆοΈ Data storage metrics Using JMX for accessing HDFS metrics HDFS Metrics βΆοΈ Using HdfsΒFindΒTool to find files Downloading Hdfsfindtool from the CDH archives βΆοΈ Configuring Data Protection βΆοΈ Data protection βΆοΈ Backing up HDFS metadata βΆοΈ Introduction to HDFS metadata files and directories βΆοΈ Files and directories NameΒNodes JournalΒNodes DataΒNodes βΆοΈ HDFS commands for metadata files and directories Configuration properties βΆοΈ Back up HDFS metadata Prepare to back up the HDFS metadata Backing up NameΒNode metadata Back up HDFS metadata using Cloudera Manager Restoring NameΒNode metadata Restore HDFS metadata from a backup using Cloudera Manager Perform a backup of the HDFS metadata βΆοΈ Configuring HDFS trash Trash behavior with HDFS Transparent Encryption enabled Enabling and disabling trash Setting the trash interval βΆοΈ Using HDFS snapshots for data protection Considerations for working with HDFS snapshots Enable snapshot creation on a directory Create snapshots on a directory Recover data from a snapshot Options to determine differences between contents of snapshots CLI commands to perform snapshot operations βΆοΈ Managing snapshot policies using Cloudera Manager Create a snapshot policy Edit or delete a snapshot policy Enable and disable snapshot creation using Cloudera Manager Create snapshots using Cloudera Manager Delete snapshots using Cloudera Manager Preventing inadvertent deletion of directories βΆοΈ Accessing Cloud Data Cloud storage connectors overview The Cloud Storage Connectors βΆοΈ Working with Amazon S3 Limitations of Amazon S3 βΆοΈ Configuring Access to S3 Configuring Access to S3 on CDP Public Cloud βΆοΈ Configuring Access to S3 on Cloudera Private Cloud Base Using Configuration Properties to Authenticate Using Per-Bucket Credentials to Authenticate Using Environment Variables to Authenticate Using EC2 Instance Metadata to Authenticate Referencing S3 Data in Applications βΆοΈ Configuring Per-Bucket Settings Customizing Per-Bucket Secrets Held in Credential Files Configuring Per-Bucket Settings to Access Data Around the World βΆοΈ Encrypting Data on S3 βΆοΈ SSE-S3: Amazon S3-Managed Encryption Keys Enabling SSE-S3 βΆοΈ SSE-KMS: Amazon S3-KMS Managed Encryption Keys Enabling SSE-KMS IAM Role permissions for working with SSE-KMS βΆοΈ SSE-C: Server-Side Encryption with Customer-Provided Encryption Keys Enabling SSE-C Configuring Encryption for Specific Buckets Encrypting an S3 Bucket with Amazon S3 Default Encryption Performance Impact of Encryption βΆοΈ Safely Writing to S3 Through the S3A Committers Introducing the S3A Committers Configuring Directories for Intermediate Data Using the Directory Committer in MapΒReduce Verifying That an S3A Committer Was Used Cleaning up after failed jobs βΆοΈ Advanced Committer Configuration Enabling Speculative Execution Using Unique Filenames to Avoid File Update Inconsistency Speeding up Job Commits by Increasing the Number of Threads Securing the S3A Committers The S3A Committers and Third-Party Object Stores Limitations of the S3A Committers Troubleshooting the S3A Committers Security Model and Operations on S3 S3A and Checksums (Advanced Feature) A List of S3A Configuration Properties Working with versioned S3 buckets Working with Third-party S3-compatible Object Stores βΆοΈ Improving Performance for S3A Working with S3 buckets in the same AWS region βΆοΈ Configuring and tuning S3A block upload Tuning S3A Uploads Thread Tuning for S3A Data Upload Optimizing S3A read performance for different file types S3 Performance Checklist Troubleshooting S3 βΆοΈ Working with Google Cloud Storage βΆοΈ Configuring Access to Google Cloud Storage Create a GCP Service Account Create a Custom Role Modify GCS Bucket Permissions Configure Access to GCS from Your Cluster Additional Configuration Options for GCS βΆοΈ Working with the ABFS Connector βΆοΈ Introduction to Azure Storage and the ABFS Connector Feature Comparisons Setting up and configuring the ABFS connector βΆοΈ Configuring the ABFS Connector βΆοΈ Authenticating with ADLS Gen2 Configuring Access to Azure on CDP Public Cloud Configuring Access to Azure on Cloudera Private Cloud Base ADLS Proxy Setup βΆοΈ Performance and Scalability Hierarchical namespaces vs. non-namespaces Flush options βΆοΈ Using ABFS using CLI Hadoop File System commands Create a table in Hive Accessing Azure Storage account container from spark-shell Copying data with Hadoop DistΒCp DistΒCp and Proxy Settings ADLS Trash Folder Behavior Troubleshooting ABFS βΆοΈ Configuring HDFS ACLs HDFS ACLs Configuring ACLs on HDFS Using CLI commands to create and list ACLs ACL examples ACLS on HDFS features Use cases for ACLs on HDFS βΆοΈ Enable authorization for HDFS web UIs Enable authorization for additional HDFS web UIs Configuring HSTS for HDFS Web UIs βΆοΈ Configuring Fault Tolerance βΆοΈ High Availability on HDFS clusters βΆοΈ Configuring HDFS High Availability NameΒNode architecture Preparing the hardware resources for HDFS High Availability βΆοΈ Using Cloudera Manager to manage HDFS HA Enabling HDFS HA Prerequisites for enabling HDFS HA using Cloudera Manager Enabling High Availability and automatic failover Disabling and redeploying HDFS HA βΆοΈ Configuring other CDP components to use HDFS HA Configuring HBase to use HDFS HA Configuring the Hive Metastore to use HDFS HA Configuring Impala to work with HDFS HA Configuring oozie to use HDFS HA Changing a nameservice name for Highly Available HDFS using Cloudera Manager Manually failing over to the standby NameΒNode Additional HDFS haadmin commands to administer the cluster Turning safe mode on HA NameΒNodes Converting from an NFS-mounted shared edits directory to Quorum-Based Storage Administrative commands βΆοΈ Configuring Apache Kudu βΆοΈ Configure Kudu processes Experimental flags Configuring the Kudu master Configuring tablet servers Rack awareness (Location awareness) βΆοΈ Directory configurations Changing directory configuration βΆοΈ Managing Apache Kudu βΆοΈ Limitations Server management limitations Cluster management limitations Start and stop Kudu processes βΆοΈ Orchestrate a rolling restart with no downtime Minimize cluster distruption during planned downtime βΆοΈ Kudu web interfaces Kudu master web interface Kudu tablet server web interface Common web interface pages Best practices when adding new tablet servers Decommission or remove a tablet server Use cluster names in the kudu command line tool Migrate data on the same host βΆοΈ Migrate to multiple Kudu masters Prepare for the migration Perform the migration βΆοΈ Change master hostnames Prepare for master hostname changes Perform master hostname changes βΆοΈ Remove Kudu masters Prepare for removal Perform the removal βΆοΈ Run the tablet rebalancing tool Run a tablet rebalancing tool on a rack-aware cluster Run a tablet rebalancing tool in Cloudera Manager Run a tablet rebalancing tool in command line βΆοΈ Managing Apache Kudu Security Kudu security considerations Kudu security limitations βΆοΈ Kudu authentication Kudu authentication with Kerberos Kudu authentication tokens Client authentication to secure Kudu clusters Kudu coarse-grained authorization βΆοΈ Kudu fine-grained authorization Kudu and Apache Ranger integration Kudu authorization tokens Specifying trusted users Kudu authorization policies Ranger policies for Kudu Disabling redaction βΆοΈ Configuring a secure Kudu cluster using Cloudera Manager Enabling Kerberos authentication and RPC encryption Configuring custom Kerberos principal for Kudu Configuring coarse-grained authorization with ACLs Configuring TLS/SSL encryption for Kudu using Cloudera Manager Enabling Ranger authorization Configuring HTTPS encryption βΆοΈ Backing up and Recovering Apache Kudu βΆοΈ Kudu backup Back up tables Backup tools Generate a table list Backup directory structure Physical backups of an entire node βΆοΈ Kudu recovery Restore tables from backups Recover from disk failure Recover from full disks Bring a tablet that has lost a majority of replicas back online Rebuild a Kudu filesystem layout βΆοΈ Developing Applications with Apache Kudu View the API documentation Kudu example applications Kudu Python client βΆοΈ Kudu integration with Spark Spark integration known issues and limitations Spark integration best practices Upsert option in Kudu Spark Use Spark with a secure Kudu cluster Spark tuning βΆοΈ Using Hive Metastore with Apache Kudu Integrating the Hive Metastore with Apache Kudu Databases and Table Names Administrative tools for Hive Metastore integration Upgrading existing Kudu tables for Hive Metastore integration Enabling the Hive Metastore integration βΆοΈ Using Apache Impala with Apache Kudu βΆοΈ Understanding Impala integration with Kudu Impala database containment model Internal and external Impala tables Verifying the Impala dependency on Kudu Impala integration limitations βΆοΈ Using Impala to query Kudu tables Query an existing Kudu table from Impala Create a new Kudu table from Impala Use CREATE TABLE AS SELECT βΆοΈ Partitioning tables Basic partitioning Advanced partitioning Non-covering range partitions Partitioning guidelines Optimize performance for evaluating SQL predicates Insert data INSERT and primary key uniqueness violations Update data Upsert a row Alter a table Delete data Failures during INSERT, UPDATE, UPSERT, and DELETE operations Drop a Kudu table βΆοΈ Monitoring Apache Kudu βΆοΈ Kudu metrics Listing available metrics Collecting metrics through HTTP Diagnostics logging Monitor cluster health with ksck Report craches using breakpad Enable core dump Use the Charts Library βΆοΈ How to: Compute βΆοΈ Using YARN Web UI and CLI Access the YARN Web User Interface View Cluster Overview View Nodes and Node Details View Queues and Queue Details βΆοΈ View All Applications Search applications View application details UI Tools Use the YARN CLI to View Logs for Applications βΆοΈ Configuring Apache Hadoop YARN Security Linux Container Executor βΆοΈ Managing Access Control Lists YARN ACL rules YARN ACL syntax βΆοΈ YARN ACL types Admin ACLs Queue ACLs βΆοΈ Application ACLs Application ACL evaluation MapΒReduce Job ACLs Spark Job ACLs Application logs' ACLs βΆοΈ Configure TLS/SSL for Core Hadoop Services Configuring TLS/SSL for HDFS Configure TLS/SSL for YARN Enable HTTPS communication Configure Cross-Origin Support for YARN UIs and REST APIs Configure YARN Security for Long-Running Applications Enabling custom Kerberos principal support in YARN Enabling custom Kerberos principal support in a Queue Manager cluster βΆοΈ Configuring Apache Hadoop YARN High Availability βΆοΈ YARN ResourceΒManager High Availability YARN ResourceΒManager high availability architecture Configure YARN ResourceΒManager high availability Use the yarn rmadmin tool to administer ResourceΒManager high availability Migrate ResourceΒManager to another host βΆοΈ Work Preserving Recovery for YARN components Configure work preserving recovery on ResourceΒManager Configure work preserving recovery on NodeΒManager Example: Configuration for work preserving recovery βΆοΈ Managing and Allocating Cluster Resources using Capacity Scheduler βΆοΈ Resource Scheduling and Management YARN resource allocation of multiple resource-types Hierarchical queue characteristics Scheduling among queues Application reservations Resource distribution workflow Resource allocation overview βΆοΈ Use CPU scheduling Configure CPU scheduling and isolation Use CPU scheduling with distributed shell βΆοΈ Use GPU scheduling Configure GPU scheduling and isolation Use GPU scheduling with distributed shell βΆοΈ Use FPGA scheduling Configure FPGA scheduling and isolation Use FPGA with distributed shell βΆοΈ Limit CPU usage with Cgroups Use Cgroups Enable Cgroups βΆοΈ Manage Queues Prerequisite Add queues using YARN Queue Manager UI Configure cluster capacity with queues Configuring the resource capacity of root queue Change resource allocation mode Start and stop queues Delete queues βΆοΈ Configure Scheduler Properties at the Global Level Setting global maximum application priority Configure preemption Enabling Intra-Queue preemption Enabling LazyΒPreemption Set global application limits Set default Application Master resource limit Enable asynchronous scheduler Configuring queue mapping to use the user name from the application tag using Cloudera Manager Configure NodeΒManager heartbeat Configure data locality βΆοΈ Configure Per Queue Properties Set user limits within a queue Set Maximum Application limit for a specific queue Set Application-Master resource-limit for a specific queue Control access to queues using ACLs Enabling preemption for a specific queue Enable Intra-Queue Preemption for a specific queue Configure dynamic queue properties βΆοΈ Set Ordering policies within a specific queue Configure queue ordering policies βΆοΈ Dynamic Queue Scheduling [Technical Preview] Enabling the Dynamic Queue Scheduling feature Creating a new Dynamic Configuration Managing Dynamic Configurations How to read the Schedule table βΆοΈ Manage placement rules Placement rule policies How to read the Placement Rules table βΆοΈ Create placement rules Example - Placement rules creation Reorder placement rules Delete placement rules Enable override of default queue mappings βΆοΈ Manage dynamic queues Managed Parent Queues Converting a queue to a Managed Parent Queue Enabling dynamic child creation in weight mode Managing dynamic child creation enabled parent queues Managing dynamically created child queues Disabling auto queue deletion Deleting dynamically created child queues βΆοΈ Configure Partitions Enable node label on a cluster to configure partition Create partitions Assign or unassign a node to a partition View partitions Associate partitions with queues Disassociate partitions from queues Deleting partitions Use partitions when submitting a job Provide Read-only access to Queue Manager UI βΆοΈ Managing Apache Hadoop YARN Services Configure YARN Services API to Manage Long-running Applications Configure YARN Services using Cloudera Manager Migrating database configuration to a new location βΆοΈ Running YARN Services Deploy and manage services on YARN Launch a YARN service Save a YARN service definition βΆοΈ Create new YARN services using UI Create a standard YARN service Create a custom YARN service Manage the YARN service life cycle through the REST API YARN services API examples βΆοΈ Configuring Apache Hadoop YARN Log Aggregation YARN Log Aggregation Overview Log Aggregation File Controllers Configure Log Aggregation Log Aggregation Properties Configure Debug Delay βΆοΈ Managing Apache ZooΒKeeper Add a ZooΒKeeper service Use multiple ZooΒKeeper services Replace a ZooΒKeeper disk Replace a ZooΒKeeper role with ZooΒKeeper service downtime Replace a ZooΒKeeper role without ZooΒKeeper service downtime Replace a ZooΒKeeper role on an unmanaged cluster Confirm the election status of a ZooΒKeeper service βΆοΈ Configuring Apache ZooΒKeeper Enable the AdminΒServer Configure four-letter-word commands in ZooΒKeeper βΆοΈ Managing Apache ZooΒKeeper Security βΆοΈ ZooΒKeeper Authentication Configure ZooΒKeeper server for Kerberos authentication Configure ZooΒKeeper client shell for Kerberos authentication Verify the ZooΒKeeper authentication Enable server-server mutual authentication Use Digest Authentication Provider Configure ZooΒKeeper TLS/SSL using Cloudera Manager βΆοΈ ZooΒKeeper ACLs Best Practices ZooΒKeeper ACLs Best Practices: Atlas ZooΒKeeper ACLs Best Practices: Cruise Control ZooΒKeeper ACLs Best Practices: HBase ZooΒKeeper ACLs Best Practices: HDFS ZooΒKeeper ACLs Best Practices: Kafka ZooΒKeeper ACLs Best Practices: Oozie ZooΒKeeper ACLs Best Practices: Ranger ZooΒKeeper ACLs best practices: Search ZooΒKeeper ACLs Best Practices: YARN ZooΒKeeper ACLs Best Practices: ZooΒKeeper βΆοΈ How to: Data Access βΆοΈ Using Data Analytics Studio Compose queries βΆοΈ Manage queries Searching queries Refining query search using filters Saving the search results Compare queries βΆοΈ View query details Viewing the query recommendations Viewing the query details Viewing the visual explain for a query Viewing the Hive configurations for a query Viewing the query timeline Viewing the task-level DAG information Viewing the DAG flow Viewing the DAG counters Viewing the Tez configurations for a query βΆοΈ Manage databases and tables Using the Database Explorer Searching tables Managing tables Creating tables Uploading tables Editing tables Deleting tables Managing columns Managing partitions Viewing storage information Viewing detailed information Viewing table and column statistics Previewing tables using Data Preview βΆοΈ Manage reports Viewing the Read and Write report Viewing the Join report βΆοΈ DAS administration using Cloudera Manager in CDP Running a query on a different Hive instance Modifying the session cookie timeout value βΆοΈ Configuring user authentication Configuring user authentication using SPNEGO Configuring user authentication using LDAP Configuring TLS/SSL encryption manually for DAS using Cloudera Manager Cleaning up old queries, DAG information, and reports data Disabling the reporting feature βΆοΈ DAS administration using Ambari in CDP Running a query on a different Hive instance Cleaning up old queries, DAG information, and reports data using Ambari Creating system tables to run query on Hive and Tez DAG events Changing the retention period of DAS event logs βΆοΈ Working with Apache Hive Metastore HMS table storage Configuring HMS for high availability HWC authorization Authorizing external tables Configure HMS properties for authorization Filter HMS results βΆοΈ Setting up the metastore database βΆοΈ Setting up the backend Hive metastore database Set up MariaΒDB or MyΒSQL database Set up a PostgreΒSQL database Set up an Oracle database Configuring metastore database properties Configuring metastore location Setting up a JDBC URL connection override Tuning the metastore Creating a view from Spark βΆοΈ Starting Apache Hive Starting Hive on an insecure cluster Starting Hive using a password Running a Hive command Converting Hive CLI scripts to Beeline βΆοΈ Using Apache Hive βΆοΈ Apache Hive 3 tables Locating Hive tables and changing the location Refer to a table using dot notation Creating a CRUD transactional table Creating an insert-only transactional table Creating, using, and dropping an external table Creating an Ozone-based external table Accessing Hive files in Ozone Recommended Hive configurations when using Ozone Dropping an external table along with data Converting a managed non-transactional table to external Using constraints Determining the table type Apache Hive 3 ACID transactions βΆοΈ Apache Hive query basics Querying the information_schema database Inserting data into a table Updating data in a table Merging data in tables Deleting data from a table βΆοΈ Creating a temporary table Configuring temporary table storage βΆοΈ Using a subquery Subquery restrictions Use wildcards with SHOW DATABASES Aggregating and grouping data Querying correlated data βΆοΈ Using common table expressions Use a CTE in a query Comparing tables using ANY/SOME/ALL Escaping an invalid identifier CHAR data type support ORC vs Parquet formats Hive reserved words Creating a default directory for managed tables Generating surrogate keys βΆοΈ Partitions and performance Creating partitions dynamically βΆοΈ Partition refresh and configuration Automating partition discovery and repair Repairing partitions manually using MSCK repair Managing partition retention time βΆοΈ Query scheduling Enabling scheduled queries Enabling all scheduled queries Periodically rebuilding a materialized view Getting scheduled query information and monitor the query Lateral View βΆοΈ Materialized views βΆοΈ Creating and using a materialized view Creating the tables and view Verifing use of a query rewrite Using optimizations from a subquery Dropping a materialized view Showing materialized views Describing a materialized view Managing query rewrites Purposely using a stale materialized view Creating and using a partitioned materialized view Using JdbcΒStorageΒHandler to query RDBMS βΆοΈ Using functions Reloading, viewing, and filtering functions βΆοΈ Create a user-defined function Setting up the development environment Creating the UDF class Building the project and upload the JAR Registering the UDF Calling the UDF in a query βΆοΈ Managing Apache Hive βΆοΈ ACID operations Configuring partitions for transactions Viewing transactions Viewing transaction locks βΆοΈ Data compaction Compaction prerequisites Compaction tasks Initiating automatic compaction in Cloudera Manager Starting compaction manually Viewing compaction progress Disabling automatic compaction Configuring compaction using table properties Configuring compaction in Cloudera Manager Configuring the compaction check interval Compactor properties βΆοΈ Query vectorization Query vectorization properties Checking query execution Tracking Hive on Tez query execution Tracking an Apache Hive query in YARN Application not running message βΆοΈ Configuring Apache Hive Configuring legacy CREATE TABLE behavior Limiting concurrent connections Hive on Tez configurations Configuring HiveΒServer high availability using Dynamic Service Discovery βΆοΈ Configuring HiveΒServer high availability using a load balancer Configuring the Hive Delegation Token Store Adding a HiveΒServer role Configuring the HiveΒServer load balancer Achieving cross-cluster availability through Hive Load Balancer failover βΆοΈ Generating statistics Setting up the cost-based optimizer and statistics Generating and viewing Apache Hive statistics Statistics generation and viewing commands Removing scratch directories βΆοΈ Securing Apache Hive Hive access authorization Transactional table access External table access Accessing Hive files in Ozone βΆοΈ Configuring access to Hive on YARN Configuring HiveΒServer for ETL using YARN queues Managing YARN queue users Configuring queue mapping to use the user name from the application tag using Cloudera Manager Disabling impersonation (doas) Connecting to an Apache Hive endpoint through Apache Knox HWC authorization βΆοΈ Hive authentication Securing HiveΒServer using LDAP Client connections to HiveΒServer Pluggable authentication modules in HiveΒServer JDBC connection string syntax βΆοΈ Communication encryption Enabling TLS/SSL for HiveΒServer Enabling SASL in HiveΒServer βΆοΈ Securing an endpoint under AutoΒTLS Securing Hive metastore Activating the Hive web UI βΆοΈ Integrating Apache Hive with Apache Spark and BI βΆοΈ Hive Warehouse Connector for accessing Apache Spark data Set up HWC limitations βΆοΈ Reading data through HWC Direct Reader mode introduction Using Direct Reader mode Direct Reader configuration properties Direct Reader limitations Secure access mode introduction Setting up secure access mode Using secure access mode JDBC read mode introduction Using JDBC read mode JDBC mode configuration properties JDBC mode limitations Kerberos configurations for HWC Writing data through HWC Apache Spark executor task statistics βΆοΈ HWC and DataΒFrame APIs HWC and DataΒFrame API limitations HWC supported types mapping Catalog operations Read and write operations Committing a transaction for Direct Reader Closing HiveΒWarehouseΒSession operations Using HWC for streaming HWC API Examples Hive Warehouse Connector Interfaces Submitting a Scala or Java application Examples of writing data in various file formats βΆοΈ HWC integration with pyspark, sparklyr, and Zeppelin Submitting a Python app Reading and writing Hive tables in R Livy interpreter configuration Reading and writing Hive tables in Zeppelin βΆοΈ Apache Hive-Kafka integration Creating a table for a Kafka stream βΆοΈ Querying Kafka data Querying live data from Kafka Perform ETL by ingesting data from Kafka into Hive βΆοΈ Writing data to Kafka Writing transformed Hive data to Kafka Setting consumer and producer table properties Kafka storage handler and table properties βΆοΈ Connecting Hive to BI tools using a JDBC/ODBC driver Getting the JDBC driver Getting the ODBC driver Integrating Hive and a BI tool Specify the JDBC connection string JDBC connection string syntax Using JdbcΒStorageΒHandler to query RDBMS Setting up JDBCStorageΒHandler for Postgres βΆοΈ Apache Hive Performance Tuning Query results cache Best practices for performance tuning βΆοΈ ORC file format Advanced ORC properties Performance improvement using partitions Bucketed tables in Hive βΆοΈ Migrating Data Using Sqoop Data migration to Apache Hive Setting Up Sqoop Atlas Hook for Sqoop βΆοΈ Imports into Hive Creating a Sqoop import command Importing RDBMS data into Hive βΆοΈ HDFS to Apache Hive data migration Importing RDBMS data to HDFS Converting an HDFS file to ORC Incrementally updating an imported table Import command options βΆοΈ Starting and Stopping Apache Impala Modifying Impala Startup Options βΆοΈ Configuring Client Access to Impala βΆοΈ Impala Shell Tool Impala Shell Configuration Options Impala Shell Configuration File Connecting to Impala Daemon in Impala Shell Running Commands and SQL Statements in Impala Shell Impala Shell Command Reference Configuring ODBC for Impala Configuring JDBC for Impala Configuring Impyla for Impala Configuring Delegation for Clients Spooling Query Results Shut Down Impala βΆοΈ Setting Timeouts in Impala Setting Timeout and Retries for Thrift Connections to Backend Client Increasing StateΒStore Timeout Setting the Idle Query and Idle Session Timeouts βΆοΈ Securing Apache Impala βΆοΈ Securing Impala Configuring Impala TLS/SSL βΆοΈ Impala Authentication Configuring Kerberos Authentication βΆοΈ Configuring LDAP Authentication Enabling LDAP for in Hue Enabling LDAP Authentication for impala-shell βΆοΈ Impala Authorization Configuring Authorization Row-level filtering in Impala with Ranger policies βΆοΈ Configuring Apache Impala Configuring Impala Configuring Load Balancer for Impala βΆοΈ Tuning Apache Impala Setting Up HDFS Caching Setting up Data Cache for Remote Reads Configuring Dedicated Coordinators and Executors βΆοΈ Managing Apache Impala βΆοΈ Managing Resources in Impala Admission Control and Query Queuing Enabling Admission Control Creating Static Pools Configuring Dynamic Resource Pool Dynamic Resource Pool Settings Admission Control Sample Scenario Cancelling a Query βΆοΈ Managing Metadata in Impala On-demand Metadata Automatic Invalidation of Metadata Cache βΆοΈ Automatic Invalidation/Refresh of Metadata Configuring Event Based Automatic Metadata Sync βΆοΈ Monitoring Apache Impala βΆοΈ Impala Logs Managing Logs Impala lineage βΆοΈ Web User Interface for Debugging Debug Web UI for Impala Daemon Debug Web UI for StateΒStore Debug Web UI for Catalog Server Configuring Impala Web UI βΆοΈ Using Hue Using Hue Enabling the SQL editor autocompleter βΆοΈ Using governance-based data discovery Searching metadata tags List of supported non-alphanumeric characters for file and directory names in Hue Options to rerun Oozie workflows in Hue βΆοΈ Administering Hue Reference architecture Hue configuration files Hue configurations in CDP Runtime Hue Advanced Configuration Snippet βΆοΈ Hue logs Standard stream logs Hue service Django logs Enabling DEBUG Enabling httpd log rotation for Hue Hue supported browsers Adding a Hue service with Cloudera Manager Adding a Hue role instance with Cloudera Manager βΆοΈ Customizing the Hue web UI Adding a custom banner in Hue Changing the page logo in Hue Adding a splash screen in Hue Setting the cache timeout Enabling or disabling anonymous usage date collection Configuring the number of objects displayed in Hue βΆοΈ Using Oracle database with Hue Creating Hue Schema in Oracle database Configuring Oracle as backend database for Hue βΆοΈ Using MyΒSQL database with Hue Downloading and installing MyΒSQL database Configuring MyΒSQL server Installing and configuring MyΒSQL on RHEL 8 Creating the Hue database Configuring MyΒSQL as the backend database for Hue Configuring TLSv1.2-enforced MyΒSQL server βΆοΈ Using MariaΒDB database with Hue Downloading and installing MariaΒDB database Configuring MariaΒDB server Installing and configuring MariaΒDB on RHEL 8 Creating the Hue database Configuring MariaΒDB as the backend database for Hue βΆοΈ Using PostgreΒSQL database with Hue Download and install PostgreΒSQL Configure the PostgreΒSQL server Configure PostgreΒSQL as the backend database for Hue Disabling the share option in Hue Enabling Hue applications with Cloudera Manager Running shell commands Downloading and exporting data from Hue Backing up the Hue database Enabling a multi-threaded environment for Hue βΆοΈ Moving the Hue service to a different host Migrating Hue service using Add Service wizard Migrating Hue service by adding new role instances Configuring timezone for Hue βΆοΈ Securing Hue βΆοΈ User management in Hue Understanding Hue users and groups Finding the list of Hue superusers Creating a Hue user Restricting user login Creating a group in Hue Managing Hue permissions Resetting Hue user password Assigning superuser status to an LDAP user Configuring file and directory permissions for Hue βΆοΈ User authentication in Hue Authentication using Kerberos βΆοΈ Authentication using LDAP Import and sync LDAP users and groups Configuring authentication with LDAP and Search Bind Configuring authentication with LDAP and Direct Bind Multi-server LDAP/AD autentication Testing the LDAP configuration Configuring group permissions Enabling LDAP authentication with HiveΒServer2 and Impala LDAP properties Configuring LDAP on unmanaged clusters βΆοΈ Authentication using SAML Configuring SAML authentication on managed clusters Manually configuring SAML authentication Integrating your identity provider's SAML server with Hue SAML properties Troubleshooting SAML authentication Authentication using Knox SSO Applications and permissions reference Securing Hue passwords with scripts βΆοΈ Configuring TLS/SSL for Hue Creating a truststore file in PEM format Configuring Hue as a TLS/SSL client Enabling Hue as a TLS/SSL client Configuring Hue as a TLS/SSL server Enabling Hue as a TLS/SSL server using Cloudera Manager Enabling TLS/SSL for Hue Load Balancer Enabling TLS/SSL communication with HiveΒServer2 Enabling TLS/SSL communication with Impala Securing database connections with TLS/SSL Enforcing TLS version 1.2 for Hue Securing sessions Specifying HTTP request methods Restricting supported ciphers for Hue Specifying domains or pages to which Hue can redirect users Setting Oozie permissions Configuring secure access between Solr and Hue βΆοΈ Tuning Hue Adding a load balancer βΆοΈ Configuring high availability for Hue Configuring Hive and Impala for high availability with Hue Configuring for HDFS high availability Configuring dedicated Impala coordinator βΆοΈ Search Tutorial Tutorial βΆοΈ Validating the Cloudera Search deployment Create a test collection Index sample data Query sample data βΆοΈ Indexing sample Tweets with Cloudera Search Create a collection for tweets Copy sample tweets to HDFS βΆοΈ Using MapΒReduce batch indexing to index sample Tweets Batch indexing into online Solr servers using GoΒLive Batch indexing into offline Solr shards βΆοΈ Securing Cloudera Search Cloudera Search security aspects Configure TLS/SSL encryption for Solr Using a load balancer Cloudera Search authentication βΆοΈ Set proxy server authentication for clusters using Kerberos Configure Kerberos authentication for Solr Enable Kerberos authentication in Solr Overview of proxy usage and load balancing for Search Configuring custom Kerberos principals and custom system users for Solr Enable LDAP authentication in Solr Enabling Solr clients to authenticate with a secure Solr Creating a JAAS configuration file Enable Ranger authorization in Solr Configuring Ranger authorization Enable document-level authorization βΆοΈ Tuning Cloudera Search Solr server tuning categories Setting Java system properties for Solr Enable multi-threaded faceting Tuning garbage collection Enable garbage collector logging Solr and HDFS - the block cache βΆοΈ Tuning replication Adjust the Solr replication factor for index files stored in HDFS βΆοΈ Managing Cloudera Search βΆοΈ Managing collection configuration Cloudera Search config templates Generating collection configuration using configs Securing configs with ZooΒKeeper ACLs and Ranger Generating Solr collection configuration using instance directories Modifying a collection configuration generated using an instance directory Converting instance directories to configs Cloudera Search configuration files Using custom JAR files with Search Retrieving the clusterstate.Βjson file βΆοΈ Managing collections Creating a Solr collection Viewing existing collections Deleting all documents in a collection Deleting a collection Updating the schema in a collection Creating a replica of an existing shard Migrating Solr replicas Backing up a collection from HDFS Backing up a collection from local file system Restoring a collection Defining a backup target in solr.Βxml βΆοΈ Cloudera Search ETL Using Morphlines to index Avro Using Morphlines with Syslog βΆοΈ Indexing Data Using Morphlines Indexing Data βΆοΈ Near Real Time Indexing βΆοΈ Lily HBase Near Real Time Indexing for Cloudera Search Adding the Lily HBase Indexer Service Starting the Lily HBase NRT Indexer Service βΆοΈ Using the Lily HBase NRT Indexer Service Enable Replication on HBase Column Families Create a Collection in Cloudera Search Creating a Lily HBase Indexer Configuration File Creating a Morphline Configuration File Understanding the extractΒHBaseΒCells Morphline Command Registering a Lily HBase Indexer Configuration with the Lily HBase Indexer Service Verifying that Indexing Works Using the Indexer HTTP Interface βΆοΈ Configuring Lily HBase Indexer Security Configure Lily HBase Indexer to use TLS/SSL Configure Lily HBase Indexer Service to Use Kerberos Authentication βΆοΈ Batch Indexing Spark indexing using morphlines βΆοΈ MapΒReduce indexing βΆοΈ MapΒReduceΒIndexerΒTool MapΒReduceΒIndexerΒTool input splits MapΒReduceΒIndexerΒTool metadata MapΒReduceΒIndexerΒTool usage syntax Indexing data with MapΒReduceΒIndexerΒTool in Solr backup format βΆοΈ Lily HBase batch indexing for Cloudera Search Populating an HBase Table Create a Collection in Cloudera Search Creating a Lily HBase Indexer Configuration File Creating a Morphline Configuration File Understanding the extractΒHBaseΒCells Morphline Command Running HBaseΒMapΒReduceΒIndexerΒTool HBaseΒMapΒReduceΒIndexerΒTool command line reference Using --go-live with SSL or Kerberos Understanding --go-live and HDFS ACLs βΆοΈ Indexing Data Using Spark-Solr Connector βΆοΈ Batch indexing to Solr using SparkΒApp framework Create indexer Maven project Run the spark-submit job βΆοΈ How to: Operational Database βΆοΈ Configuring Apache HBase Using DNS with HBase Use the Network Time Protocol (NTP) with HBase Configure the graceful shutdown timeout property βΆοΈ Setting user limits for HBase Configure ulimit for HBase using Cloudera Manager Configuring ulimit for HBase Configure ulimit using Pluggable Authentication Modules using the Command Line Using dfs.Βdatanode.Βmax.Βtransfer.Βthreads with HBase Configure encryption in HBase βΆοΈ Using hedged reads Enable hedged reads for HBase βΆοΈ Understanding HBase garbage collection Configure HBase garbage collection Disable the BoundedΒByteΒBufferΒPool Configure the HBase canary Configuring auto split policy in an HBase table βΆοΈ Using HBase blocksize Configure the blocksize for a column family βΆοΈ Configuring HBase BlockΒCache Contents of the BlockΒCache Size the BlockΒCache Decide to use the BucketΒCache βΆοΈ About the Off-heap BucketΒCache Off-heap BucketΒCache BucketΒCache IO engine Configure BucketΒCache IO engine Configure the off-heap BucketΒCache using Cloudera Manager Configure the off-heap BucketΒCache using the command line Cache eviction priorities Bypass the BlockΒCache Monitor the BlockΒCache βΆοΈ Using quota management Configuring quotas General Quota Syntax βΆοΈ Throttle quotas Throttle quota examples Space quotas Quota enforcement Quota violation policies βΆοΈ Impact of quota violation policy Live write access Bulk Write Access Read access Metrics and Insight Examples of overlapping quota policies Number-of-Tables Quotas Number-of-Regions Quotas βΆοΈ Using HBase scanner heartbeat Configure the scanner heartbeat using Cloudera Manager βΆοΈ Storing medium objects (MOBs) Prerequisites Configure columns to store MOBs Configure the MOB cache using Cloudera Manager Test MOB storage and retrieval performance MOB cache properties βΆοΈ Limiting the speed of compactions Configure the compaction speed using Cloudera Manager Enable HBase indexing βΆοΈ Using HBase coprocessors Add a custom coprocessor Disable loading of coprocessors βΆοΈ Configuring HBase MultiΒWAL Configuring MultiΒWAL support using Cloudera Manager βΆοΈ Configuring the storage policy for the Write-Ahead Log (WAL) Configure the storage policy for WALs using Cloudera Manager Configure the storage policy for WALs using the Command Line βΆοΈ Using RegionΒServer grouping Enable RegionΒServer grouping using Cloudera Manager Configure RegionΒServer grouping Monitor RegionΒServer grouping Remove a RegionΒServer from RegionΒServer grouping Enabling ACL for RegionΒServer grouping Best practices when using RegionΒServer grouping Disable RegionΒServer grouping βΆοΈ Optimizing HBase I/O HBase I/O components Advanced configuration for write-heavy workloads βΆοΈ Managing Apache HBase Security βΆοΈ HBase authentication Configure HBase servers to authenticate with a secure HDFS cluster Configure secure HBase replication Configure the HBase client TGT renewal period HBase authorization βΆοΈ Configuring TLS/SSL for HBase Prerequisites to configure TLS/SSL for HBase Configure TLS/SSL for HBase Web UIs Configure TLS/SSL for HBase REST Server Configure TLS/SSL for HBase Thrift Server Configure HSTS for HBase Web UIs βΆοΈ Accessing Apache HBase βΆοΈ Use the HBase shell Virtual machine options for HBase Shell Script with HBase Shell Use the HBase command-line utilities Use the HBase APIs for Java βΆοΈ Use the HBase REST server Installing the REST Server using Cloudera Manager Using the REST API Using the REST proxy API βΆοΈ Using the Apache Thrift Proxy API Preparing a thrift server and client List of Thrift API and HBase configurations Example for using THttpΒClient API in secure cluster Example for using THttpΒClient API in unsecure cluster Example for using TSaslΒClientΒTransport API in secure cluster without HTTP βΆοΈ Using the Apache HBase Hive integration Configuring Hive to use with HBase Configuring HBase Hive integration βΆοΈ Configure HBase-Spark connector using Cloudera Manager Configuring HBase-Spark connector when both are on same cluster Configuring HBase-Spark connector when HBase is on remote cluster Example: Using the HBase-Spark connector βΆοΈ Use the Hue HBase app Configure the HBase thrift server role βΆοΈ Managing Apache HBase βΆοΈ Starting and stopping HBase using Cloudera Manager Start HBase Stop HBase βΆοΈ Graceful HBase shutdown Gracefully shut down an HBase RegionΒServer Gracefully shut down the HBase service βΆοΈ Importing data into HBase Choose the right import method Use snapshots Use CopyΒTable βΆοΈ Use BulkΒLoad Use cases for BulkΒLoad Use cluster replication Use Sqoop Use Spark Use a custom MapΒReduce job βΆοΈ Use HashΒTable and SyncΒTable Tool HashΒTable/SyncΒTable tool configuration Synchronize table data using HashΒTable/SyncΒTable tool βΆοΈ Writing data to HBase Variations on Put Versions Deletion Examples βΆοΈ Reading data from HBase Perform scans using HBase Shell βΆοΈ HBase filtering Dynamically loading a custom filter Logical operators, comparison operators and comparators Compound operators Filter types HBase Shell example Java API example HBase online merge Move HBase Master Role to another host Expose HBase metrics to a Ganglia server βΆοΈ Configuring Apache HBase High Availability Enable HBase high availability using Cloudera Manager HBase read replicas Timeline consistency Keep replicas current Read replica properties Configure read replicas using Cloudera Manager βΆοΈ Using rack awareness for read replicas Create a topology map Create a topology script Activate read replicas on a table Request a timeline-consistent read βΆοΈ Using Apache HBase Backup and Disaster Recovery HBase backup and disaster recovery strategies βΆοΈ Configuring HBase snapshots About HBase snapshots Configure snapshots βΆοΈ Manage HBase snapshots using Cloudera Manager Browse HBase tables Take HBase snapshots βΆοΈ Store HBase snapshots on Amazon S3 Configure HBase in Cloudera Manager to store snapshots in Amazon S3 Configure the dynamic resource pool used for exporting and importing snapshots in Amazon S3 HBase snapshots on Amazon S3 with Kerberos enabled Manage HBase snapshots on Amazon S3 in Cloudera Manager Delete HBase snapshots from Amazon S3 Restore an HBase snapshot from Amazon S3 Restore an HBase snapshot from Amazon S3 with a new name Manage Policies for HBase snapshots in Amazon S3 βΆοΈ Manage HBase snapshots using the HBase shell Shell commands Take a snapshot using a shell script Export a snapshot to another cluster βΆοΈ Snapshot failures Information and debugging βΆοΈ Using HBase replication Common replication topologies Notes about replication Replication requirements βΆοΈ Deploy HBase replication Replication across three or more clusters Enable replication on a specific table Configure secure replication βΆοΈ Configure bulk load replication Enable bulk load replication using Cloudera Manager Create empty table on the destination cluster Disable replication at the peer level Stop replication in an emergency βΆοΈ Initiate replication when data already exist Replicate pre-exist data in an active-active deployment Effects of WAL rolling on replication Configure secure HBase replication Restore data from a replica Verify that replication works Replication caveats βΆοΈ Configuring Apache HBase for Apache Phoenix Configure HBase for use with Phoenix βΆοΈ Using Apache Phoenix to Store and Access Data βΆοΈ Mapping Apache Phoenix schemas to Apache HBase namespaces Enable namespace mapping βΆοΈ Associating tables of a schema to a namespace Associate table in a customized Kerberos environment Associate a table in a non-customized environment without Kerberos βΆοΈ Using secondary indexing Use strongly consistent indexing Migrate to strongly consistent indexing βΆοΈ Using transactions Configure transaction support Use transactions with tables βΆοΈ Using JDBC API Connecting to PQS using JDBC Connect to Phoenix Query Server Connect to Phoenix Query Server through Apache Knox Launching Apache Phoenix Thin Client Using non-JDBC drivers βΆοΈ Using Apache Phoenix-Spark connector Configuring Phoenix-Spark connector when both are on same cluster Configuring Phoenix-Spark connector when Phoenix is on remote cluster Phoenix-Spark connector usage examples βΆοΈ Using Apache Phoenix-Hive connector Configure Phoenix-Hive connector Apache Phoenix-Hive usage examples Limitations of Phoenix-Hive connector βΆοΈ Managing Apache Phoenix Security Managing Apache Phoenix security Enable Phoenix ACLs Configure TLS encryption manually for Phoenix Query Server βΆοΈ Managing Operational Database powered by Apache Accumulo Change root user password Find latest OpΒDB keytab Relax WAL durability βΆοΈ How to: Data Science βΆοΈ Configuring Apache Spark βΆοΈ Configuring dynamic resource allocation Customize dynamic resource allocation settings Configure a Spark job for dynamic resource allocation Dynamic resource allocation properties βΆοΈ Spark security Enabling Spark authentication Enabling Spark Encryption Running Spark applications on secure clusters Configuring HSTS for Spark Accessing compressed files in Spark Sample script to connect Spark to Ozone βΆοΈ Developing Apache Spark Applications Introduction Spark application model Spark execution model Developing and running an Apache Spark WordΒCount application Using the Spark DataΒFrame API βΆοΈ Building Spark Applications Best practices for building Apache Spark applications Building reusable modules in Apache Spark applications Packaging different versions of libraries with an Apache Spark application βΆοΈ Using Spark SQL SQLContext and HiveΒContext Querying files into a DataΒFrame Spark SQL example Interacting with Hive views Performance and storage considerations for Spark SQL DROP TABLE PURGE TIMESTAMP compatibility for Parquet files Accessing Spark SQL through the Spark shell Calling Hive user-defined functions (UDFs) βΆοΈ Using Spark Streaming Spark Streaming and Dynamic Allocation Spark Streaming Example Enabling fault-tolerant processing in Spark Streaming Configuring authentication for long-running Spark Streaming jobs Building and running a Spark Streaming application Sample pom.Βxml file for Spark Streaming with Kafka βΆοΈ Accessing external storage from Spark βΆοΈ Accessing data stored in Amazon S3 through Spark Examples of accessing Amazon S3 data from Spark Accessing Hive from Spark Accessing HDFS Files from Spark βΆοΈ Accessing ORC Data in Hive Tables Accessing ORC files from Spark Predicate push-down optimization Loading ORC data into DataΒFrames using predicate push-down Optimizing queries using partition pruning Enabling vectorized query execution Reading Hive ORC tables Accessing Avro data files from Spark SQL applications Accessing Parquet files from Spark SQL applications βΆοΈ Using Spark MLlib Running a Spark MLlib example Enabling Native Acceleration For MLlib Using custom libraries with Spark βΆοΈ Running Apache Spark Applications Introduction Running your first Spark application Running Spark 3 Applications Updating Spark 2 apps for Spark 3.x Running sample Spark applications βΆοΈ Configuring Spark Applications Configuring Spark application properties in spark-defaults.Βconf Configuring Spark application logging properties βΆοΈ Submitting Spark applications spark-submit command options Spark cluster execution overview Canary test for pyspark command Fetching Spark Maven dependencies Accessing the Spark History Server βΆοΈ Running Spark applications on YARN Spark on YARN deployment modes Submitting Spark Applications to YARN Monitoring and Debugging Spark Applications Example: Running SparkΒPi on YARN Configuring Spark on YARN Applications Dynamic allocation βΆοΈ Submitting Spark applications using Livy Configuring the Livy Thrift Server Connecting to the Apache Livy Thrift Server Using Livy with Spark Using Livy with interactive notebooks Using the Livy API to run Spark jobs βΆοΈ Running an interactive session with the Livy API Livy objects for interactive sessions Setting Python path variables for Livy Livy API reference for interactive sessions βΆοΈ Submitting batch applications using the Livy API Livy batch object Livy API reference for batch jobs βΆοΈ Using PyΒSpark Running PyΒSpark in a virtual environment Running Spark Python applications Automating Spark Jobs with Oozie Spark Action βΆοΈ Tuning Apache Spark Introduction Check Job Status Check Job History Improving Software Performance βΆοΈ Tuning Apache Spark Applications Tuning Spark Shuffle Operations Choosing Transformations to Minimize Shuffles When Shuffles Do Not Occur When to Add a Shuffle Transformation Secondary Sort Tuning Resource Allocation Resource Tuning Example Tuning the Number of Partitions Reducing the Size of Data Structures Choosing Data Formats βΆοΈ CDS 3 Powered by Apache Spark CDS 3.2.3 Overview CDS 3.2.3 Requirements Installing CDS 3.2.3 Enabling Spark rolling event log files in CDP Enabling CDS 3.2.3 with GPU Support Updating Spark 2 apps for Spark 3 Running Spark 3 Applications with CDS 3.2.3 Running applications with CDS 3.2.3 with GPU Support CDS 3.2.3 Packaging, and Download Using the CDS 3.2.3 Maven Repo CDS 3.2.3 Maven Artifacts βΆοΈ Cumulative hotfixes for CDS Cumulative hotfix CDS 3.2.7172000.3-3 Cumulative hotfix CDS 3.2.7172000.6-1 Cumulative hotfix CDS 3.2.7172000.8-1 Cumulative hotfix CDS 3.2.7172000.9-1 Cumulative hotfix CDS 3.2.7172000.10-1 Cumulative hotfix CDS 3.2.7172000.12-1 Cumulative hotfix CDS 3.2.7172000.13-4 Cumulative hotfix CDS 3.2.7172000.14-1 Cumulative hotfix CDS 3.2.7172000.15-1 Cumulative hotfix CDS 3.2.7172000.16-1 Cumulative hotfix CDS 3.2.7173000.2-1 Cumulative hotfix CDS 3.2.7173000.3-1 Cumulative hotfix CDS 3.2.7173000.4-1 βΆοΈ Configuring Apache Zeppelin Introduction Configuring Zeppelin caching Configuring Livy Configure User Impersonation for Access to Hive Configure User Impersonation for Access to Phoenix βΆοΈ Enabling Access Control for Zeppelin Elements Enable Access Control for Interpreter, Configuration, and Credential Settings Enable Access Control for Notebooks Enable Access Control for Data βΆοΈ Shiro Settings: Reference Active Directory Settings LDAP Settings General Settings shiro.Βini Example βΆοΈ Using Apache Zeppelin Introduction Launch Zeppelin βΆοΈ Working with Zeppelin Notes Create and Run a Note Import a Note Export a Note Using the Note Toolbar Import External Packages βΆοΈ Configuring and Using Zeppelin Interpreters Modify interpreter settings Using Zeppelin Interpreters Customize interpreter settings in a note Use the JDBC interpreter to access Hive Use the Livy interpreter to access Spark Using Spark Hive Warehouse and HBase Connector Client .jar files with Livy βΆοΈ How to: Security βΆοΈ Configuring Authentication in Cloudera Manager Overview Kerberos Security Artifacts Overview Kerberos Configuration Strategies for CDP βΆοΈ Configuring Authentication in Cloudera Manager Cloudera Manager user accounts βΆοΈ Configuring external authentication and authorization for Cloudera Manager Configuring PAM authentication with LDAP and SSSD Configuring PAM authentication with Linux users Configuring PAM authentication using Apache Knox Configure authentication using Active Directory Configure authentication using an LDAP-compliant identity service Configure authentication using Kerberos (SPNEGO) Configure authentication using an external program Configure authentication using SAML βΆοΈ Enabling Kerberos Authentication for CDP Step 1: Install Cloudera Manager and CDP Step 2: Create the Kerberos Principal for Cloudera Manager Server Step 3: Enable Kerberos using the wizard Step 4: Create the HDFS superuser Step 5: Get or create a Kerberos principal for each user account Step 6: Prepare the cluster for each user Step 7: Verify that Kerberos security is working Step 8: (Optional) Enable authentication for HTTP web consoles for Hadoop roles Kerberos authentication for non-default users βΆοΈ Customizing Kerberos principals Configuring custom Kerberos principal for Atlas Configuring custom Kerberos principal for Cruise Control Configuring custom Kerberos principal for Apache Flink Configuring custom Kerberos principal for HBase Configuring custom Kerberos principal for HDFS Configuring custom Kerberos principal for Hive and Hive-on-Tez Configuring custom Kerberos principal for HttpΒFS Configuring custom Kerberos principal for Hue Configuring Kerberos Authentication Configuring custom Kerberos principal for Kafka Configuring custom Kerberos principal for Knox Configuring custom Kerberos principal for Kudu Configuring custom Kerberos principal for Livy Configuring custom Kerberos principal for NiΒFi and NiΒFi Registry Configuring custom Kerberos principal for Omid Configuring custom Kerberos principal for Oozie Configuring custom Kerberos principal for Ozone Configuring custom Kerberos principal for Phoenix Configuring custom Kerberos principal for Schema Registry Configuring custom Kerberos principals and custom system users for Solr Configuring custom Kerberos principal for Spark Configuring custom Kerberos principal for Streams Messaging Manager Configuring custom Kerberos principal for SQL Stream Builder Configuring custom Kerberos principal for Streams Replication Manager Enabling custom Kerberos principal support in YARN Enabling custom Kerberos principal support in a Queue Manager cluster Configuring custom Kerberos principal for Zeppelin Configuring custom Kerberos principal for ZooΒKeeper Managing Kerberos credentials using Cloudera Manager Using a custom Kerberos keytab retrieval script Adding trusted realms to the cluster Using auth-to-local rules to isolate cluster users Configuring a dedicated MIT KDC for cross-realm trust Integrating MIT Kerberos and Active Directory Hadoop Users (user:group) and Kerberos Principals Mapping Kerberos Principals to Short Names βΆοΈ Cloudera Authorization Overview Configuring LDAP Group Mappings Using Ranger to Provide Authorization in CDP βΆοΈ Encrypting Data in Transit Encrypting Data in Transit Understanding Keystores and Truststores Disabling TLS protocols on JMX ports Choosing manual TLS or Auto-TLS SAN Certificates βΆοΈ Configuring TLS Encryption for Cloudera Manager Using Auto-TLS Use case 1: Use Cloudera Manager to generate internal CA and corresponding certificates βΆοΈ Use case 2: Enabling Auto-TLS with an intermediate CA signed by an existing Root CA Certmanager Options - Using CM's GenerateΒCMCA API Use case 3: Enabling Auto-TLS with Existing Certificates Manually Configuring TLS Encryption for Cloudera Manager βΆοΈ Configuring TLS/SSL encryption manually for CDP Services Configuring TLS encryption manually for Apache Atlas Enable security for Cruise Control Configuring TLS/SSL encryption manually for DAS using Cloudera Manager Enabling security for Apache Flink βΆοΈ Configuring TLS/SSL for HBase Prerequisites to configure TLS/SSL for HBase Configuring TLS/SSL for HBase Web UIs Configuring TLS/SSL for HBase REST Server Configuring TLS/SSL for HBase Thrift Server Enabling TLS/SSL for HiveΒServer βΆοΈ Configuring TLS/SSL for Hue Creating a truststore file in PEM format Configuring Hue as a TLS/SSL client Enabling Hue as a TLS/SSL client Configuring Hue as a TLS/SSL server Enabling Hue as a TLS/SSL server using Cloudera Manager Enabling TLS/SSL for Hue Load Balancer Enabling TLS/SSL communication with HiveΒServer2 Enabling TLS/SSL communication with Impala Securing database connections with TLS/SSL Configuring Impala TLS/SSL βΆοΈ Channel encryption Configure Kafka brokers Configure Kafka MirrorΒMaker Configuring TLS/SSL encryption Configure Kafka clients Configure Zookeeper TLS/SSL support for Kafka βΆοΈ Authentication βΆοΈ TLS/SSL client authentication Configure Kafka brokers Configure Kafka clients Principal name mapping Inter-broker security Configuring multiple listeners βΆοΈ Configuring TLS/SSL encryption manually for Key Trustee Server Key Trustee Server Properties for TLS βΆοΈ Configuring TLS/SSL encryption manually for Apache Knox Knox Properties for TLS Configuring TLS/SSL encryption for Kudu using Cloudera Manager Configure Lily HBase Indexer to use TLS/SSL Configuring TLS/SSL encryption manually for Livy βΆοΈ Configuring TLS/SSL manually TLS/SSL certificate requirements and recommendations Configuring TLS/SSL encryption manually for NiΒFi and NiΒFi Registry NiΒFi TLS/SSL properties NiΒFi Registry TLS/SSL Properties Configure TLS/SSL for Oozie Configure TLS encryption manually for Phoenix Query Server Configure TLS/SSL encryption manually for Apache Ranger βΆοΈ Configure TLS/SSL encryption manually for Ranger KMS Overriding custom keystore alias on a Ranger KMS Server Configure TLS/SSL encryption manually for Ranger RMS Configuring TLS encryption manually for Schema Registry βΆοΈ Configure TLS/SSL encryption for Solr Using a load balancer Configuring TLS/SSL encryption manually for Spark Encryption in SSB Enabling TLS/SSL for the SRM service βΆοΈ Enabling TLS Encryption for SMM on CDP Private Cloud TLS/SSL settings for Streams Messaging Manager βΆοΈ Configuring TLS/SSL for Core Hadoop Services Configuring TLS/SSL for HDFS Configuring TLS/SSL for YARN Configuring TLS/SSL encryption manually for Zeppelin Configure ZooΒKeeper TLS/SSL using Cloudera Manager Manually Configuring TLS Encryption on the Agent Listening Port βΆοΈ Encrypting Data at Rest Encrypting Data at Rest Data at Rest Encryption Reference Architecture Data at Rest Encryption Requirements Resource Planning for Data at Rest Encryption βΆοΈ HDFS Transparent Encryption βΆοΈ Key Concepts and Architecture Keystores and the Key Management Server Data Encryption Components and Solutions Encryption Zones and Keys Accessing Files Within an Encryption Zone Optimizing Performance for HDFS Transparent Encryption βΆοΈ Managing Encryption Keys and Zones Validating Hadoop Key Operations Creating Encryption Zones Adding Files to an Encryption Zone Deleting Encryption Zones Backing Up Encryption Keys Rolling Encryption Keys Deleting Encryption Zone Keys βΆοΈ Re-encrypting Encrypted Data Encryption Keys (EDEKs) Benefits and Capabilities Prerequisites and Assumptions Limitations Re-encrypting an EDEK Managing Re-encryption Operations βΆοΈ Securing the Key Management System (KMS) Enabling Kerberos Authentication for the KMS Configuring TLS/SSL for the KMS Migrating Keys from a Java KeyΒStore to Cloudera Navigator Key Trustee Server βΆοΈ Migrating Ranger Key Management Server Role Instances to a New Host Migrate the Ranger Admin role instance to a new host Migrate the Ranger KMS db role instance to a new host Migrate the Ranger KMS KTS role instance to a new host βΆοΈ Migrating ACLs from Key Trustee KMS to Ranger KMS Key Trustee KMS operations not supported by Ranger KMS ACLs supported by Ranger KMS and Ranger KMS Mapping βΆοΈ Configuring CDP Services for HDFS Encryption Transparent Encryption Recommendations for HBase βΆοΈ Transparent Encryption Recommendations for Hive Changed Behavior after HDFS Encryption is Enabled KMS ACL Configuration for Hive Transparent Encryption Recommendations for Hue Transparent Encryption Recommendations for Impala Transparent Encryption Recommendations for MapΒReduce and YARN Transparent Encryption Recommendations for Search Transparent Encryption Recommendations for Spark Transparent Encryption Recommendations for Sqoop βΆοΈ Integrating Components for Encrypting Data at Rest Set up Luna 7 HSM for Ranger KMS w/database Set up Luna 6 HSM for Ranger KMS, KTS, and KeyΒHSM Set up Luna 7 HSM for Ranger KMS, KTS, and KeyΒHSM Set up GCP Cloud HSM for Ranger KMS, KTS, and KeyΒHSM Setting up CipherΒTrust HSM for Ranger KMS, KTS, and KeyΒHSM Integrating Ranger KMS DB with Google Cloud HSM Integrating Ranger KMS DB with CipherΒTrust Manager HSM Integrating Ranger KMS DB with SafeΒNet Keysecure HSM Connecting KeyΒSecure HSM to CipherΒTrust Manager after migration from Key Secure HSM βΆοΈ Using the Ranger Key Management Service Accessing the Ranger KMS Web UI List and Create Keys Roll Over an Existing Key Delete a Key βΆοΈ Navigator Key Trustee Server βΆοΈ Cloudera Navigator Key Trustee Server Overview Key Trustee Server System Requirements Cloudera Navigator Key Trustee Server βΆοΈ Backing up Key Trustee Server and clients Back up Key Trustee Server using Cloudera Manager Back up Key Trustee Server using the ktbackup.Βsh script Back up Key Trustee Server manually Back up Key Trustee Server clients βΆοΈ Restoring Navigator Key Trustee Server Restore Key Trustee Server in parcel-based installations Restore Key Trustee Server in package-based installations Restore Key Trustee Server from ktbackup.Βsh backups βΆοΈ Initializing Standalone Key Trustee Server Initializing Standalone Key Trustee Server Using Cloudera Manager Specifying TLS/SSL Minimum Allowed Version and Ciphers Configuring a Mail Transfer Agent for Key Trustee Server Verifying Cloudera Navigator Key Trustee Server Operations Managing Key Trustee Server Organizations βΆοΈ Managing Key Trustee Server Certificates Generating a New Certificate Replacing Key Trustee Server Certificates βΆοΈ Setting Up Key Trustee Server High Availability Configuring Key Trustee Server High Availability Using Cloudera Manager Recovering a Key Trustee Server βΆοΈ Navigator Encrypt Navigator Encrypt Overview Registering Cloudera Navigator Encrypt with Key Trustee Server Preparing for Encryption Using Cloudera Navigator Encrypt Encrypting and Decrypting Data Using Cloudera Navigator Encrypt Converting from Device Names to UUIDs for Encrypted Devices Navigator Encrypt Access Control List Maintaining Cloudera Navigator Encrypt βΆοΈ Navigator Key HSM Cloudera Navigator Key HSM Overview Initializing Navigator Key HSM HSM-Specific Setup for Cloudera Navigator Key HSM Validating Key HSM Settings Managing the Navigator Key HSM Service Integrating Key HSM with Key Trustee Server βΆοΈ Apache Ranger Access Control and Auditing βΆοΈ Apache Ranger Auditing Audit Overview βΆοΈ Managing Auditing with Ranger View audit details Create a read-only Admin user (Auditor) Configuring Ranger audit properties for Solr Configuring Ranger audit properties for HDFS βΆοΈ Ranger Audit Filters Default Ranger audit filters Configuring a Ranger audit filter policy How to set audit filters in Ranger Admin Web UI Filter service access logs from Ranger UI Excluding audits for specific users, groups, and roles Changing Ranger audit storage location and migrating data Configuring Ranger audits to show actual client IP address βΆοΈ Apache Ranger Authorization Using Ranger to Provide Authorization in CDP Ranger special entities βΆοΈ Ranger Policies Overview Ranger tag-based policies Tags and policy evaluation Ranger access conditions βΆοΈ Using the Ranger Console Accessing the Ranger console Ranger console navigation βΆοΈ Resource-based Services and Policies βΆοΈ Configuring resource-based services Configure a resource-based service: Atlas Configure a resource-based service: HBase Configure a resource-based service: HDFS Configure a resource-based service: HadoopΒSQL Configure a resource-based service: Kafka Configure a resource-based service: Knox Configure a resource-based service: NiΒFi Configure a resource-based service: NiΒFi Registry Configure a resource-based service: Solr Configure a resource-based service: YARN βΆοΈ Configuring resource-based policies Configure a resource-based policy: Atlas Configure a resource-based policy: HBase Configure a resource-based policy: HDFS Configure a resource-based policy: HadoopΒSQL Configure a resource-based storage handler policy: HadoopΒSQL Configure a resource-based policy: Kafka Configure a resource-based policy: Knox Configure a resource-based policy: NiΒFi Configure a resource-based policy: NiΒFi Registry Configure a resource-based policy: Solr Configure a resource-based policy: YARN Wildcards and variables in resource-based policies Adding a policy label to a resource-based policy Preloaded resource-based services and policies βΆοΈ Importing and exporting resource-based policies Import resource-based policies for a specific service Import resource-based policies for all services Export resource-based policies for a specific service Export all resource-based policies for all services βΆοΈ Row-level filtering and column masking in Hive Row-level filtering in Hive with Ranger policies Dynamic resource-based column masking in Hive with Ranger policies Dynamic tag-based column masking in Hive with Ranger policies βΆοΈ Tag-based Services and Policies Adding a tag-based service βΆοΈ Adding tag-based policies Using tag attributes and values in Ranger tag-based policy conditions Adding a tag-based PII policy Default EXPIRES ON tag policy βΆοΈ Importing and exporting tag-based policies Import tag-based policies Export tag-based policies Create a time-bound policy Create a Hive authorizer URL policy βΆοΈ Ranger Security Zones Security Zones Administration Security Zones Example Use Cases Adding a Ranger security zone βΆοΈ Administering Ranger Users, Groups, Roles, and Permissions Add a user Edit a user Delete a user Add a group Edit a group Delete a group Add a role through Ranger Add a role through Hive Edit a role Delete a role Add or edit permissions βΆοΈ Administering Ranger Reports View Ranger reports Search Ranger reports Export Ranger reports Using Ranger client libraries Using session cookies to validate Ranger policies βΆοΈ Apache Ranger User Management Ranger Usersync Configure Usersync assignment of Admin users Configure Ranger Usersync for Deleted Users and Groups Configure Ranger Usersync for invalid usernames Adding default service users and roles for Ranger Set credentials for Ranger Usersync Ranger user management βΆοΈ Configuring Ranger Authentication with UNIX, LDAP, or AD βΆοΈ Configuring Ranger Authentication with UNIX, LDAP, AD, or PAM Configure Ranger authentication for UNIX Configure Ranger authentication for AD Configure Ranger authentication for LDAP Configure Ranger authentication for PAM βΆοΈ Ranger AD Integration Ranger UI authentication Ranger UI authorization βΆοΈ Configuring Advanced Security Options for Apache Ranger Configuring the server work directory path for a Ranger service Configure session timeout for Ranger Admin Web UI Configure Kerberos authentication for Apache Ranger Configure TLS/SSL encryption manually for Apache Ranger βΆοΈ Configure TLS/SSL encryption manually for Ranger KMS Overriding custom keystore alias on a Ranger KMS Server Configure TLS/SSL encryption manually for Ranger RMS βΆοΈ Configuring Apache Ranger High Availability Configure Ranger Admin High Availability Configure Ranger Admin High Availability with a Load Balancer Migrating Ranger Usersync and Tagsync role groups Configuring JVM options and system properties for Ranger services How to pass JVM options to Ranger KMS services How to clear Ranger Admin access logs Enable Ranger Admin login using kerberos authentication How to configure Ranger HDFS plugin configs per (NameΒNode) Role Group How to add a coarse URI check for Hive agent How to suppress database connection notifications How to change the password for Ranger users βΆοΈ Configuring and Using Hive-HDFS ACL Sync Ranger RMS - HIVE-HDFS ACL Sync Overview Analyzing Ranger RMS resources How to full sync the Ranger RMS database Configure High Availability for Hive-HDFS ACL Sync Configure Hive-HDFS ACL Sync Hive-HDFS ACL Sync Use Cases Hive-HDFS ACL Sync Reference βΆοΈ Configuring and Using Ranger KMS βΆοΈ Configuring Ranger KMS High Availability Configure High Availability for Ranger KMS with DB Configure High Availability for Ranger KMS with KTS βΆοΈ Apache Knox Authentication βΆοΈ Apache Knox Overview Securing Access to Hadoop Cluster: Apache Knox Apache Knox Gateway Overview Knox Supported Services Matrix Knox Topology Management in Cloudera Manager Considerations for Knox Proxy Cloudera Manager through Apache Knox βΆοΈ Installing Apache Knox Apache Knox Install Role Parameters βΆοΈ Management of Knox shared providers in Cloudera Manager Configure Apache Knox authentication for PAM Configure Apache Knox authentication for AD/LDAP Configure Apache Knox authentication for SAML Add a new shared provider configuration TLS Mutual Authentication βΆοΈ Management of existing Apache Knox shared providers Add a new provider in an existing provider configuration Modify a provider in an existing provider configuration Disable a provider in an existing provider configuration Remove a provider parameter in an existing provider configuration Saving aliases Configuring Kerberos authentication in Apache Knox shared providers βΆοΈ Management of services for Apache Knox via Cloudera Manager Enable proxy for a known service in Apache Knox Disable proxy for a known service in Apache Knox Add custom service to existing descriptor in Apache Knox Proxy Add a custom descriptor to Apache Knox βΆοΈ Management of Service Parameters for Apache Knox via Cloudera Manager Add custom service parameter to descriptor Modify custom service parameter in descriptor Remove custom service parameter from descriptor βΆοΈ Additional Security Topics How to Add Root and Intermediate CAs to Truststore for TLS/SSL Amazon S3 Security How to Authenticate Kerberos Principals Using Java Check Cluster Security Settings Configure Antivirus Software on CDP Hosts Configure Browser-based Interfaces to Require Authentication (SPNEGO) Configure Browsers for Kerberos Authentication (SPNEGO) Configure Cluster to Use Kerberos Authentication Convert DER, JKS, PEM Files for TLS/SSL Artifacts Configure Authentication for Amazon S3 Configure Encryption for Amazon S3 Configure AWS Credentials Enable Sensitive Data Redaction Log a Security Support Case Obtain and Deploy Keys and Certificates for TLS/SSL Renew and Redistribute Certificates Set Up a Gateway Host to Restrict Access to the Cluster Set Up Access to Cloudera EDH (Microsoft Azure Marketplace) Use Self-Signed Certificates for TLS βΆοΈ Configuring Infra Solr Configure Ranger authorization for Infra Solr Configuring custom Kerberos principals and custom system users for Solr βΆοΈ How to: Governance βΆοΈ Searching with Metadata Searching overview Using Basic Search Using Search filters Using Free-text Search Saving searches Using advanced search Atlas index repair configuration βΆοΈ Working with Classifications and Labels Working with Atlas classifications and labels Creating classifications Creating labels Adding attributes to classifications Associating classifications with entities Propagating classifications through lineage Searching for entities using classifications βΆοΈ Exploring using Lineage Lineage overview Viewing lineage Lineage lifecycle βΆοΈ Leveraging Business Metadata Business Metadata overview Creating Business Metadata Adding attributes to Business Metadata Associating Business Metadata attributes with entities Importing Business Metadata associations in bulk Searching for entities using Business Metadata attributes βΆοΈ Managing Business Terms with Atlas Glossaries Glossaries overview Creating glossaries Creating terms Associating terms with entities Defining related terms Creating categories Assigning terms to categories Searching using terms βΆοΈ Importing Glossary terms in bulk Enhancements related to bulk glossary terms import βΆοΈ Setting up Atlas High Availability About Atlas High Availability Prerequisites for setting up Atlas HA Installing Atlas in HA using CDP Private Cloud Base cluster βΆοΈ Auditing Atlas Entities βΆοΈ Audit Operations Atlas Type Definitions Atlas Export and Import Operations Atlas Server Operations Audit enhancements Examples of Audit Operations βΆοΈ Securing Atlas Securing Atlas Configuring TLS encryption manually for Apache Atlas βΆοΈ Configuring Atlas Authentication Configure Kerberos authentication for Apache Atlas Configure Atlas authentication for AD Configure Atlas authentication for LDAP Configure Atlas PAM authentication Configure Atlas file-based authentication βΆοΈ Configuring Atlas Authorization Restricting classifications based on user permission Configuring Ranger Authorization for Atlas Configuring Atlas Authorization using Ranger Configuring Simple Authorization in Atlas βΆοΈ Configuring Atlas using Cloudera Manager βΆοΈ Configuring and Monitoring Atlas Showing Atlas Server status Accessing Atlas logs βΆοΈ Integrating Atlas with Ozone About Apache Ozone integration with Apache Atlas How Integration works βΆοΈ Using import utility tools with Atlas βΆοΈ Importing Hive Metadata using Command-Line (CLI) utility Using Atlas-Hive import utility with Ozone entities Setting up Atlas Kafka import tool βΆοΈ How to: Jobs Management Overview of Oozie Adding the Oozie service using Cloudera Manager Considerations for Oozie to work with AWS User authorization configuration for Oozie βΆοΈ Redeploying the Oozie ShareΒLib Redeploying the Oozie sharelib using Cloudera Manager βΆοΈ Oozie configurations with CDP services βΆοΈ Using Sqoop actions with Oozie Deploying and configuring Oozie Sqoop1 Action JDBC drivers Configuring Oozie Sqoop1 Action workflow JDBC drivers Configuring Oozie to enable MapΒReduce jobs to read or write from Amazon S3 Configuring Oozie to use HDFS HA Using Hive Warehouse Connector with Oozie Spark Action βΆοΈ Oozie High Availability Requirements for Oozie High Availability βΆοΈ Configuring Oozie High Availability using Cloudera Manager Oozie Load Balancer configuration Enabling Oozie High Availability Disabling Oozie High Availability βΆοΈ Scheduling in Oozie using cron-like syntax Oozie scheduling examples βΆοΈ Configuring an external database for Oozie Configuring PostgreΒSQL for Oozie Configuring MariaΒDB for Oozie Configuring MyΒSQL 5 for Oozie Configuring MyΒSQL 8 for Oozie Configuring Oracle for Oozie βΆοΈ Working with the Oozie server Starting the Oozie server Stopping the Oozie server Accessing the Oozie server with the Oozie Client Accessing the Oozie server with a browser Adding schema to Oozie using Cloudera Manager Enabling the Oozie web console on managed clusters Enabling Oozie SLA with Cloudera Manager Disabling Oozie UI using Cloudera Manager Moving the Oozie service to a different host βΆοΈ Oozie database configurations Configuring Oozie data purge settings using Cloudera Manager Loading the Oozie database Dumping the Oozie database Setting the Oozie database timezone Prerequisites for configuring TLS/SSL for Oozie Configure TLS/SSL for Oozie Oozie security enhancements Additional considerations when configuring TLS/SSL for Oozie HA Configure Oozie client when TLS/SSL is enabled Configuring custom Kerberos principal for Oozie βΆοΈ How to: Streams Messaging βΆοΈ Configuring Apache Kafka Operating system requirements Performance considerations Quotas βΆοΈ JBOD JBOD setup JBOD Disk migration Setting user limits for Kafka Configuring Kafka ZooΒKeeper chroot Rack awareness βΆοΈ Securing Apache Kafka βΆοΈ Channel encryption Configure Kafka brokers Configure Kafka clients Configure Kafka MirrorΒMaker Configure Zookeeper TLS/SSL support for Kafka βΆοΈ Authentication βΆοΈ TLS/SSL client authentication Configure Kafka brokers Configure Kafka clients Principal name mapping βΆοΈ Kerberos authentication Enable Kerberos authentication Configuring custom Kerberos principal for Kafka βΆοΈ Delegation token based authentication Enable or disable authentication with delegation tokens Manage individual delegation tokens Rotate the master key/secret βΆοΈ Client authentication using delegation tokens Configure clients on a producer or consumer level Configure clients on an application level βΆοΈ LDAP authentication Configure Kafka brokers Configure Kafka clients βΆοΈ PAM authentication Configure Kafka brokers Configure Kafka clients βΆοΈ Authorization βΆοΈ Ranger Enable authorization in Kafka with Ranger Configure the resource-based Ranger service used for authorization βΆοΈ Governance Configuring the Atlas hook in Kafka Inter-broker security Configuring multiple listeners βΆοΈ Kafka security hardening with Zookeeper ACLs Restricting access to Kafka metadata in Zookeeper Unlocking access to Kafka metadata in Zookeeper βΆοΈ Tuning Apache Kafka Performance Handling large messages βΆοΈ Cluster sizing Sizing estimation based on network and disk message throughput Choosing the number of partitions for a topic βΆοΈ Broker Tuning JVM and garbage collection Network and I/O threads ISR management Log cleaner βΆοΈ System Level Broker Tuning File descriptor limits Filesystems Virtual memory handling Networking parameters Configure JMX ephemeral ports Kafka-ZooΒKeeper performance tuning βΆοΈ Managing Apache Kafka βΆοΈ Management basics Broker log management Record management Broker garbage collection log configuration Client and broker compatibility across Kafka versions βΆοΈ Managing topics across multiple Kafka clusters Set up MirrorΒMaker in Cloudera Manager Settings to avoid data loss βΆοΈ Broker migration Migrate brokers by modifying broker IDs in meta.Βproperties Use rsync to copy files from one broker to another βΆοΈ Disk management Monitoring βΆοΈ Handling disk failures Disk Replacement Disk Removal Reassigning replicas between log directories Retrieving log directory replica assignment information βΆοΈ Metrics Building Cloudera Manager charts with Kafka metrics Essential metrics to monitor βΆοΈ Command Line Tools Unsupported command line tools kafka-topics kafka-configs kafka-console-producer kafka-console-consumer kafka-consumer-groups βΆοΈ kafka-reassign-partitions Tool usage Reassignment examples kafka-log-dirs zookeeper-security-migration kafka-delegation-tokens kafka-*-perf-test Configuring log levels for command line tools Understanding the kafka-run-class Bash Script βΆοΈ Developing Apache Kafka Applications Kafka producers βΆοΈ Kafka consumers Subscribing to a topic Groups and fetching Protocol between consumer and broker Rebalancing partitions Retries Kafka clients and ZooΒKeeper βΆοΈ Java client βΆοΈ Client examples Simple Java consumer Simple Java producer Security examples βΆοΈ .NET client βΆοΈ Client examples Simple .NET consumer Simple .NET producer Performant .NET producer Security examples Kafka Streams Kafka public APIs Recommendations for client development βΆοΈ Kafka Connect Kafka Connect Overview βΆοΈ Kafka Connect Setup Installing the Kafka Connect Role Configuring Streams Messaging Manager for Kafka Connect βΆοΈ Using Kafka Connect Configuring the Kafka Connect Role Managing, Deploying and Monitoring Connectors βΆοΈ Writing Kafka data to Ozone with Kafka Connect Writing data in an unsecured cluster Writing data in a Kerberos and TLS/SSL enabled cluster βΆοΈ Securing Kafka Connect Configure TLS/SSL Encryption for the Kafka Connect Role Configure Kerberos Authentication for the Kafka Connect role Kafka Connect API Security βΆοΈ Connectors Installing Connectors βΆοΈ HDFS Sink Connector Configuration example for writing data to HDFS Configuration example for writing data to Ozone FS βΆοΈ Amazon S3 Sink Connector Configuration Example βΆοΈ Configuring Cruise Control Adding Cruise Control as a service βΆοΈ Setting capacity estimations and goals Configuring capacity estimations Configuring goals Example of Cruise Control goal configuration βΆοΈ Enabling self-healing in Cruise Control Changing the Anomaly Notifier Class value to self-healing Enabling self-healing for all or individual anomaly types Adding self-healing goals to Cruise Control in Cloudera Manager βΆοΈ Securing Cruise Control βΆοΈ Enable security for Cruise Control Configuring custom Kerberos principal for Cruise Control βΆοΈ Managing Cruise Control βΆοΈ Rebalancing with Cruise Control Cruise Control REST API endpoints Rebalance after adding Kafka broker Rebalance after demoting Kafka broker Rebalance after removing Kafka broker βΆοΈ Securing Streams Messaging Manager Securing Streams Messaging Manager Verifying the setup βΆοΈ Getting Metrics for Streams Messaging Manager Cloudera Manager metrics overview Prometheus metrics overview βΆοΈ Prometheus configuration for SMM Prerequisites for Prometheus configuration Prometheus properties configuration SMM property configuration in Cloudera Manager for Prometheus Kafka property configuration in Cloudera Manager for Prometheus Kafka Connect property configuration in Cloudera Manager for Prometheus Start Prometheus βΆοΈ Secure Prometheus for SMM βΆοΈ Nginx proxy configuration over Prometheus Nginx installtion Nginx configuration for Prometheus βΆοΈ Setting up TLS for Prometheus Configuring SMM to recognize Prometheus's TLS certificate βΆοΈ Setting up basic authentication with TLS for Prometheus Configuring Nginx for basic authentication Configuring SMM for basic authentication Setting up mΒTLS for Prometheus Prometheus for SMM limitations Troubleshooting Prometheus for SMM Performance comparison between Cloudera Manager and Prometheus βΆοΈ Monitoring Kafka Clusters using Streams Messaging Manager Monitoring Kafka clusters Monitoring Kafka producers Monitoring Kafka topics Monitoring Kafka brokers Monitoring Kafka consumers βΆοΈ Managing Alert Policies using Streams Messaging Manager Introduction to alert policies in Streams Messaging Manager Component types and metrics for alert policies Notifiers βΆοΈ Managing alert policies and notifiers in SMM Creating a notifier Updating a notifier Deleting a notifier Creating an alert policy Updating an alert policy Enabling an alert policy Disabling an alert policy Deleting an alert policy βΆοΈ Managing Kafka Topics using Streams Messaging Manager Creating a Kafka topic Modifying a Kafka topic Deleting a Kafka topic βΆοΈ Monitoring End-to-End Latency using Streams Messaging Manager End to end latency overview Granularity of metrics for end-to-end latency Enabling interceptors Monitoring end to end latency for Kafka topic End to end latency use case βΆοΈ Monitoring Kafka Cluster Replications using Streams Messaging Manager Introduction to monitoring Kafka cluster replications in SMM Configuring SMM for monitoring Kafka cluster replications βΆοΈ Viewing Kafka cluster replication details Searching Kafka cluster replications by source Monitoring Kafka cluster replications by quick ranges Monitoring status of the clusters to be replicated βΆοΈ Monitoring topics to be replicated Searching by topic name Monitoring throughput for cluster replication Monitoring replication latency for cluster replication Monitoring checkpoint latency for cluster replication Monitoring replication throughput and latency by values βΆοΈ Monitoring Kafka Connect using Streams Messaging Manager Introduction to Kafka Connect Default view of Kafka Connect in the SMM UI Creating a connector using Kafka Connect in SMM Modifying a connector using Kafka Connect in SMM Deleting a connector using Kafka Connect in SMM βΆοΈ Monitoring connectors using Kafka Connect in SMM Monitoring connector profile using Kafka Connect in SMM Monitoring connector settings using Kafka Connect in SMM Monitoring cluster profile using Kafka Connect in SMM βΆοΈ Configuring Streams Replication Manager Add Streams Replication Manager to an existing cluster Enable high availability βΆοΈ Defining and adding clusters for replication Defining external Kafka clusters Defining co-located Kafka clusters using a service dependency Defining co-located Kafka clusters using Kafka credentials Adding clusters to SRM's configuration Configuring replications Configuring the driver role target clusters Configuring the service role target cluster Configuring properties not exposed in Cloudera Manager Configuring replication specific REST servers Configuring automatic group offset synchronization Configuring SRM Driver for performance tuning New topic and consumer group discovery βΆοΈ Configuration examples Bidirectional replication example of two active clusters Cross data center replication example of multiple clusters βΆοΈ Using Streams Replication Manager βΆοΈ SRM Command Line Tools βΆοΈ srm-control βΆοΈ Configuring srm-control Configuring the SRM client's secure storage Configuring TLS/SSL properties Configuring Kerberos properties Configuring properties for non-Kerberos authentication mechanisms Setting the secure storage password as an environment variable Topics and Groups Subcommand Offsets Subcommand Monitoring Replication with Streams Messaging Manager Replicating Data βΆοΈ How to Set up Failover and Failback Configure SRM for Failover and Failback Migrating Consumer Groups Between Clusters βΆοΈ Securing Streams Replication Manager Security overview Enabling TLS/SSL for the SRM service Enabling Kerberos for the SRM service Configuring custom Kerberos principal for Streams Replication Manager SRM security example βΆοΈ Integrating with Schema Registry βΆοΈ Integrating with NiΒFi Understand the NiΒFi Record Based Processors and Controller Services Configuring Schema Registry instance in NiΒFi Adding and Configuring Record Reader and Writer Controller Services Using Record-Enabled Processors Integrating Kafka and Schema Registry Integrating with Flink and SSB Improve Performance in Schema Registry βΆοΈ Using Schema Registry Adding a new schema Querying a schema Evolving a schema Deleting a schema Importing Confluent Schema Registry schemas into Cloudera Schema Registry βΆοΈ Securing Schema Registry βΆοΈ TLS Encryption TLS Certificate Requirements and Recommendations Configure TLS Encryption Manually for Schema Registry Schema Registry TLS Properties βΆοΈ Schema Registry Authorization through Ranger Access Policies Pre-defined Access Policies for Schema Registry Add the user or group to a pre-defined access policy Create a Custom Access Policy Configuring custom Kerberos principal for Schema Registry βΆοΈ Troubleshooting βΆοΈ Troubleshooting Security Issues Troubleshooting Security Issues Error Messages and Various Failures Authentication and Kerberos Issues HDFS Encryption Issues Key Trustee KMS Encryption Issues TLS/SSL Issues βΆοΈ YARN, MRv1, and Linux OS Security TaskΒController Error Codes (MRv1) ContainerΒExecutor Error Codes (YARN) βΆοΈ Troubleshooting Apache Hive HeapΒDumpΒPath (/tmp) in Hive data nodes gets full due to .hprof files Query fails with "Counters limit exceeded" error message topics/hive-troubleshooting-high-partition-workload.Βxml HiveΒServer is unresponsive due to large queries running in parallel Whitelisting Configurations at the Session Level βΆοΈ Troubleshooting Apache Impala Troubleshooting Impala Using Breakpad Minidumps for Crash Reporting βΆοΈ Troubleshooting Apache Hadoop YARN Troubleshooting Docker on YARN Troubleshooting on YARN Troubleshooting Linux Container Executor βΆοΈ Troubleshooting Apache HBase Troubleshooting HBase βΆοΈ Using the HBCK2 tool to remediate HBase clusters Running the HBCK2 tool Finding issues Fixing issues HBCK2 tool command reference Thrift Server crashes after receiving invalid data HBase is using more disk space than expected Troubleshoot RegionΒServer grouping βΆοΈ Troubleshooting Apache Kudu βΆοΈ Issues starting or restarting the master or the tablet server Errors during hole punching test Already present: FS layout already exists Troubleshooting NTP stability problems Disk space usage issue βΆοΈ Performance issues βΆοΈ Kudu tracing Accessing the tracing web interface RPC timeout traces Kernel stack watchdog traces Memory limits Block cache size Heap sampling Slow name resolution and nscd βΆοΈ Usability issues ClassΒNotΒFoundΒException: com.Βcloudera.Βkudu.Βhive.ΒKuduΒStorageΒHandler Runtime error: Could not create thread: Resource temporarily unavailable (error 11) Tombstoned or STOPPED tablet replicas Corruption: checksum error on CFile block Symbolizing stack traces βΆοΈ Recover from a dead Kudu master Prepare for the recovery Perform the recovery βΆοΈ Troubleshooting Operational Database powered by Apache Accumulo Underβreplicated block exceptions or cluster failure occurs on small clusters βΆοΈ HDFS storage demands due to retained HDFS trash Change the HDFS trash settings in Cloudera Manager Disable OpΒDB's use of HDFS trash βΆοΈ Troubleshooting Cloudera Search βΆοΈ Troubleshooting Identifying problems βΆοΈ Cloudera Search configuration and log files Cloudera Search configuration files View and modify Search configuration Cloudera Search log files View and modify log levels for Search and related services βΆοΈ Troubleshooting Data Analytics Studio βΆοΈ Problem area: Queries page Queries are not appearing on the Queries page Query column is empty but you can see the DAG ID and Application ID Cannot see the DAG ID and the Application ID Cannot view queries of other users βΆοΈ Problem area: Compose page Cannot see databases, or the query editor is missing Unable to view new databases and tables, or unable to see changes to the existing databases or tables Troubleshooting replication failure in the DAS Event Processor Problem area: Reports page Unable to start DAS How DAS helps to debug Hive on Tez queries βΆοΈ Troubleshooting Hue The Hue load balancer not distributing users evenly across various Hue servers Unable to authenticate users in Hue using SAML Cleaning up old data to improve performance Unable to connect to database with provided credential Activating Hive query editor on Hue UI Completed Hue query shows executing on CM Finding the list of Hue superusers Knox Gateway UI: incorrect username or password HTTP 403 error while accessing Hue 'Type' error while accessing Hue from Knox Gateway Unable to access Hue from Knox Gateway UI Referer checking failed Unable to view Snappy-compressed files "Unknown Attribute Name" exception Invalid query handle Services backed by PostgreΒSQL fail or stop responding Error validating LDAP user in Hue 502 Proxy Error while accessing Hue from the Load Balancer Invalid method name: 'GetΒLog' error Authorization Exception error Cannot alter compressed tables in Hue Connection failed error when accessing the Search app (Solr) from Hue Downloading query results from Hue takes time Hue Load Balancer does not start Unable to terminate Hive queries from Job Browser Unable to view or create Oozie workflows MyΒSQL: 1040, 'Too many connections' exception Unable to connect Oracle database to Hue using SCAN Increasing the maximum number of processes for Oracle database UTF-8 codec error ASCII codec error Fixing authentication issues between HBase and Hue Lengthy BalancerΒMember Route length Enabling access to HBase browser from Hue Fixing a warning related to accessing non-optimized Hue Unable to use pip command in CDP Hue load balancer does not start after enabling TLS Unable to log into Hue with Knox LDAP search fails with invalid credentials error Disabling the web metric collection for Hue Resolving "The user authorized on the connection does not match the session username" error Requirements for compressing and extracting files using Hue File Browser Resolving "You are accessing a non-optimized Hue" error Fixing incorrect start time and duration on Hue Job Browser βΆοΈ Troubleshooting Apache Sqoop Unable to read Sqoop metastore created by an older HSQLDB version Merge process stops during Sqoop incremental imports Sqoop Hive import stops when HS2 does not use Kerberos authentication βΌ Reference βΆοΈ Apache Hadoop YARN Reference βΆοΈ Tuning Apache Hadoop YARN YARN tuning overview Step 1: Worker host configuration Step 2: Worker host planning Step 3: Cluster size Steps 4 and 5: Verify settings Step 6: Verify container settings on cluster Step 6A: Cluster container capacity Step 6B: Container parameters checking Step 7: MapΒReduce configuration Step 7A: MapΒReduce settings checking Set properties in Cloudera Manager Configure memory settings YARN Configuration Properties Use the YARN REST APIs to manage applications βΆοΈ Comparison of Fair Scheduler with Capacity Scheduler Why one scheduler? Scheduler performance improvements Feature comparison Migration from Fair Scheduler to Capacity Scheduler βΆοΈ Configuring and using Queue Manager REST API Limitations Using the REST API Prerequisites Start Queue Stop Queue Add Queue Change Queue Capacities Change Queue Properties Delete Queue βΆοΈ Data Access βΆοΈ Apache Hive Materialized View Commands ALTER MATERIALIZED VIEW REBUILD ALTER MATERIALIZED VIEW REWRITE CREATE MATERIALIZED VIEW DESCRIBE EXTENDED and DESCRIBE FORMATTED DROP MATERIALIZED VIEW SHOW MATERIALIZED VIEWS βΆοΈ Apache Hive Reference βΆοΈ Apache Impala Reference βΆοΈ Performance Considerations Performance Best Practices Query Join Performance βΆοΈ Table and Column Statistics Generating Table and Column Statistics Runtime Filtering βΆοΈ Partitioning Partition Pruning for Queries HDFS Caching HDFS Block Skew Understanding Performance using EXPLAIN Plan Understanding Performance using SUMMARY Report Understanding Performance using Query Profile βΆοΈ Scalability Considerations Scaling Limits and Guidelines Dedicated Coordinator βΆοΈ Hadoop File Formats Support Using Text Data Files Using Parquet Data Files Using ORC Data Files Using Avro Data Files Using RCFile Data Files Using SequenceΒFile Data Files βΆοΈ Storage Systems Supports Impala with HDFS βΆοΈ Impala with Kudu Configuring for Kudu Tables βΆοΈ Impala DDL for Kudu Partitioning for Kudu Tables Impala DML for Kudu Tables Impala with HBase Impala with Azure Data Lake Store (ADLS) βΆοΈ Impala with Amazon S3 Specifying Impala Credentials to Access S3 Ports Used by Impala Migration Guide Setting up Data Cache for Remote Reads βΆοΈ Managing Metadata in Impala On-demand Metadata Automatic Invalidation of Metadata Cache βΆοΈ Automatic Invalidation/Refresh of Metadata Configuring Event Based Automatic Metadata Sync Transactions βΆοΈ Apache Impala SQL Reference Apache Impala SQL Overview βΆοΈ Schema objects Impala aliases Databases Functions Identifiers Tables Views βΆοΈ Data types ARRAY complex type BIGINT data type BOOLEAN data type CHAR data type DATE data type DECIMAL data type DOUBLE data type FLOAT data type INT data type MAP complex type REAL data type SMALLINT data type STRING data type STRUCT complex type βΆοΈ TIMESTAMP data type Customizing time zones TINYINT data type VARCHAR data type Complex types Literals Operators Comments βΆοΈ SQL statements ROLE statements DDL statements DML statements ALTER DATABASE statement ALTER TABLE statement ALTER VIEW statement COMMENT statement COMPUTE STATS statement CREATE DATABASE statement CREATE FUNCTION statement CREATE ROLE statement CREATE TABLE statement CREATE VIEW statement DELETE statement DESCRIBE statement DROP DATABASE statement DROP FUNCTION statement DROP ROLE statement DROP STATS statement DROP TABLE statement DROP VIEW statement EXPLAIN statement GRANT statement GRANT ROLE statement INSERT statement INVALIDATE METADATA statement LOAD DATA statement REFRESH statement REFRESH AUTHORIZATION statement REFRESH FUNCTIONS statement REVOKE statement REVOKE ROLE statement βΆοΈ SELECT statement Joins in Impala SELECT statements ORDER BY clause GROUP BY clause HAVING clause LIMIT clause OFFSET clause UNION clause Subqueries in Impala SELECT statements TABLESAMPLE clause WITH clause DISTINCT operator SET statement SHOW statement SHOW ROLES statement SHOW CURRENT ROLES statement SHOW ROLE GRANT GROUP statement SHUTDOWN statement TRUNCATE TABLE statement UPDATE statement UPSERT statement USE statement VALUES statement Optimizer hints Query options βΆοΈ Built-in functions Mathematical functions Bit functions Conversion functions Date and time functions Conditional functions String functions Miscellaneous functions βΆοΈ Aggregate functions APPX_ΒMEDIAN function AVG function COUNT function GROUP_ΒCONCAT function MAX function MIN function NDV function STDDEV, STDDEV_ΒSAMP, STDDEV_ΒPOP functions SUM function VARIANCE, VARIANCE_ΒSAMP, VARIANCE_ΒPOP, VAR_ΒSAMP, VAR_ΒPOP functions βΆοΈ Analytic functions OVER WINDOW AVG COUNT CUME_ΒDIST DENSE_ΒRANK FIRST_ΒVALUE LAG LAST_ΒVALUE LEAD MAX MIN NTILE PERCENT_ΒRANK RANK ROW_ΒNUMBER SUM βΆοΈ User-defined functions (UDFs) UDF concepts Runtime environment for UDFs Installing the UDF development package Writing UDFs Writing user-defined aggregate functions (UDAFs) Building and deploying UDFs Performance considerations for UDFs Examples of creating and using UDFs Security considerations for UDFs Limitations and restrictions for Impala UDFs Transactions Reserved words Impala SQL and Hive SQL SQL migration to Impala βΆοΈ Cloudera Search solrctl Reference solrctl Reference Using solrctl with an HTTP proxy βΆοΈ Cloudera Search Morphlines Reference Implementing your own Custom Command Morphline commands overview kite-morphlines-core-stdio kite-morphlines-core-stdlib kite-morphlines-avro kite-morphlines-json kite-morphlines-hadoop-core kite-morphlines-hadoop-parquet-avro kite-morphlines-hadoop-rcfile kite-morphlines-hadoop-sequencefile kite-morphlines-maxmind kite-morphlines-metrics-servlets kite-morphlines-protobuf kite-morphlines-tika-core kite-morphlines-tika-decompress kite-morphlines-saxon kite-morphlines-solr-core kite-morphlines-solr-cell kite-morphlines-useragent βΆοΈ Operational Database βΆοΈ Apache Phoenix Frequently Asked Questions Frequently asked questions βΆοΈ Apache Phoenix Performance Tuning Performance tuning βΆοΈ Apache Phoenix Command Reference Apache Phoenix SQL command reference βΆοΈ Operational Database powered by Apache Accumulo Reference Default ports of OpΒDB βΆοΈ Apache Atlas Reference Apache Atlas Advanced Search language reference Apache Atlas Statistics reference Apache Atlas metadata attributes Defining Apache Atlas enumerations βΆοΈ Purging deleted entities Auditing purged entities PUT /admin/purge/ API POST /admin/audits/ API βΆοΈ Apache Atlas technical metadata migration reference System metadata migration HDFS entity metadata migration Hive entity metadata migration Impala entity metadata migration Spark entity metadata migration AWS S3 entity metadata migration βΆοΈ NiΒFi metadata collection How Lineage strategy works Understanding the data that flow into Atlas NiΒFi lineage Atlas NiΒFi relationships Atlas NiΒFi audit entries How the reporting task runs in a NiΒFi cluster Analysing event analysis Limitations of Atlas-NiΒFi integration βΆοΈ HiveΒServer metadata collection HiveΒServer actions that produce Atlas entities HiveΒServer entities created in Atlas HiveΒServer relationships HiveΒServer lineage HiveΒServer audit entries βΆοΈ HBase metadata collection HBase actions that produce Atlas entities HBase entities created in Atlas Hbase lineage HBase audit entries βΆοΈ Impala metadata collection Impala actions that produce Atlas entities Impala entities created in Atlas Impala lineage Impala audit entries βΆοΈ Kafka metadata collection Kafka actions that produce Atlas entities Kafka relationships Kafka lineage Kafka audit entries βΆοΈ Spark metadata collection Spark actions that produce Atlas entities Spark entities created in Apache Atlas Spark lineage Spark relationships Spark audit entries Spark troubleshooting βΆοΈ Streams Messaging βΆοΈ Kafka Connect Connector Reference HDFS Sink Connector Properties Reference Amazon S3 Sink Connector Properties Reference Schema Registry REST API Reference βΆοΈ Streams Replication Manager Reference srm-control Options Reference Configuration Properties Reference for Properties not Available in Cloudera Manager Kafka credentials property reference Streams Messaging Manager REST API Reference Streams Replication Manager REST API Reference Cruise Control REST API Reference βΌ Cloudera Manager Reference βΌ Cloudera Manager Configuration Properties Reference βΌ Cloudera Manager Configuration Properties Reference for Cloudera Runtime 7.1.7 ADLS Connector Properties in Cloudera Runtime 7.1.7 Atlas Properties in Cloudera Runtime 7.1.7 Core Configuration Properties in Cloudera Runtime 7.1.7 Data Analytics Studio Properties in Cloudera Runtime 7.1.7 Data Context Connector Properties in Cloudera Runtime 7.1.7 HBase Properties in Cloudera Runtime 7.1.7 HDFS Properties in Cloudera Runtime 7.1 Hive Properties in Cloudera Runtime 7.1.7 Hive LLAP Properties in Cloudera Runtime 7.1.7 Hive on Tez Properties in Cloudera Runtime 7.1.7 Hue Properties in Cloudera Runtime 7.1.7 Impala Properties in Cloudera Runtime 7.1.7 Java KeyΒStore KMS Properties in Cloudera Runtime 7.1.7 Kafka Properties in Cloudera Runtime 7.1.7 Key Trustee KMS Properties in Cloudera Runtime 7.1.7 Key Trustee Server Properties in Cloudera Runtime 7.1.7 Key-Value Store Indexer Properties in Cloudera Runtime 7.1.7 Knox Properties in Cloudera Runtime 7.1.7 Kudu Properties in Cloudera Runtime 7.1.7 Livy Properties in Cloudera Runtime 7.1.7 Livy for Spark 3 Properties in Cloudera Runtime 7.1.7 Oozie Properties in Cloudera Runtime 7.1.7 Ozone Properties in Cloudera Runtime 7.1.7 Phoenix Properties in Cloudera Runtime 7.1.7 Ranger Properties in Cloudera Runtime 7.1.7 S3 Connector Properties in Cloudera Runtime 7.1.7 Schema Registry Properties in Cloudera Runtime 7.1.7 Solr Properties in Cloudera Runtime 7.1.7 Spark Properties in Cloudera Runtime 7.1.7 Spark 3 Properties in Cloudera Runtime 7.1.7 SQOOP_ΒCLIENT Properties in Cloudera Runtime 7.1.7 Streams Messaging Manager Properties in Cloudera Runtime 7.1.7 Streams Replication Manager Properties in Cloudera Runtime 7.1.7 Stub DFS Properties in Cloudera Runtime 7.1.7 Tez Properties in Cloudera Runtime 7.1.7 YARN Properties in Cloudera Runtime 7.1.7 YARN Queue Manager Properties in Cloudera Runtime 7.1.7 Zeppelin Properties in Cloudera Runtime 7.1.7 ZooΒKeeper Properties in Cloudera Runtime 7.1.7 Host Configuration Properties Cloudera Manager Server Properties Cloudera Management Service βΆοΈ Cloudera Manager Metrics Reference βΆοΈ Cloudera Manager Metrics Accumulo Metrics Active Database Metrics Active Key Trustee Server Metrics Activity Metrics Activity Monitor - Unsupported Since 7.0.0 Metrics Agent Metrics Alert Publisher Metrics Atlas Metrics Atlas Server Metrics Attempt Metrics Authentication Server Metrics Authentication Server Load Balancer Metrics Authentication Service Metrics Cloudera Management Service Metrics Cloudera Manager Server Metrics Cluster Metrics Core Configuration Metrics Cruise Control Metrics Cruise Control Server Metrics Data Analytics Studio Metrics Data Analytics Studio Eventprocessor Metrics Data Analytics Studio Webapp Server Metrics Data Discovery Service Agent Metrics DataΒNode Metrics Directory Metrics Disk Metrics Docker Server Metrics Ecs Agent Metrics Ecs Server Metrics Event Server Metrics Failover Controller Metrics Filesystem Metrics Flink Metrics Flink Dashboard Metrics Flume Metrics Flume Channel Metrics Flume Sink Metrics Flume Source Metrics Garbage Collector Metrics HBase Metrics HBase REST Server Metrics HBase RegionΒServer Replication Peer Metrics HBase Thrift Server Metrics HDFS Metrics HDFS Cache Directive Metrics HDFS Cache Pool Metrics HRegion Metrics HTable Metrics History Server Metrics Hive Metrics Hive Execution Metrics Hive LLAP Metrics Hive Metastore Server Metrics Hive Table Metrics Hive on Tez Metrics HiveΒServer2 Metrics Host Metrics Host Monitor Metrics HttpΒFS Metrics Hue Metrics Hue Server Metrics Impala Metrics Impala Catalog Server Metrics Impala Daemon Metrics Impala Daemon Resource Pool Metrics Impala Llama ApplicationΒMaster Metrics Impala Pool Metrics Impala Pool User Metrics Impala Query Metrics Impala StateΒStore Metrics Isilon Metrics Java KeyΒStore KMS Metrics JobΒHistory Server Metrics JobΒTracker Metrics JournalΒNode Metrics Kafka Metrics Kafka Broker Metrics Kafka Broker Log Directory Metrics Kafka Broker Topic Metrics Kafka Broker Topic Partition Metrics Kafka Connect Metrics Kafka Connect Connector Sink Task Metrics Metrics Kafka Connect Connector Source Task Metrics Metrics Kafka Connect Connector Task Error Metrics Metrics Kafka Connect Connector Task Metrics Metrics Kafka Consumer Group Metrics Kafka MirrorΒMaker Metrics Kafka Producer Metrics Kafka Replica Metrics Kerberos Ticket Renewer Metrics Key Management Server Metrics Key Management Server Proxy Metrics Key Trustee KMS Metrics Key Trustee Server Metrics Key-Value Store Indexer Metrics Knox Metrics Knox Gateway Metrics Knox IDBroker Metrics Kudu Metrics Kudu Replica Metrics LLAP Proxy Metrics Lily HBase Indexer Metrics Livy Metrics Livy Server Metrics Livy Server for Spark 3 Metrics Livy for Spark 3 Metrics Load Balancer Metrics MapΒReduce Metrics Master Metrics Materialized View Engine Metrics Monitor Metrics NFS Gateway Metrics NameΒNode Metrics Navigator Audit Server Metrics Navigator HSM KMS backed by SafeΒNet Luna HSM Metrics Navigator HSM KMS backed by Thales HSM Metrics Navigator Luna KMS Metastore Metrics Navigator Luna KMS Proxy Metrics Navigator Metadata Server Metrics Navigator Thales KMS Metastore Metrics Navigator Thales KMS Proxy Metrics Network Interface Metrics NodeΒManager Metrics Omid Metrics Omid tso server Metrics Oozie Metrics Oozie Server Metrics Ozone Metrics Ozone DataΒNode Metrics Ozone Manager Metrics Ozone Prometheus Metrics Ozone Recon Metrics Passive Database Metrics Passive Key Trustee Server Metrics Phoenix Metrics Profiler Admin Agent Metrics Profiler Manager Metrics Profiler Metrics Agent Metrics Profiler Scheduler Metrics Profiler Scheduler Agent Metrics Query Processor Metrics Query Server Metrics Ranger Metrics Ranger Admin Metrics Ranger KMS Metrics Ranger KMS Server Metrics Ranger KMS Server with KTS Metrics Ranger KMS with Key Trustee Server Metrics Ranger RMS Metrics Ranger RMS Server Metrics Ranger Raz Metrics Ranger Raz Server Metrics Ranger Tagsync Metrics Ranger Usersync Metrics RegionΒServer Metrics Reports Manager Metrics ResourceΒManager Metrics S3 Gateway Metrics SQL Stream Builder Metrics SRM Distributed Herder metrics Metrics SRM Driver Metrics SRM Service Metrics Schema Registry Metrics Schema Registry Server Metrics SecondaryΒNameΒNode Metrics Sentry Metrics Sentry Server Metrics Server Metrics Service Monitor Metrics Solr Metrics Solr Replica Metrics Solr Server Metrics Solr Shard Metrics Spark Metrics Spark 3 Metrics Sqoop 2 Metrics Sqoop 2 Server Metrics Storage Container Manager Metrics Streaming SQL Console Metrics Streaming SQL Engine Metrics Streams Messaging Manager Metrics Streams Messaging Manager Rest Admin Server Metrics Streams Messaging Manager UI Server Metrics Streams Replication Manager Metrics Tablet Server Metrics TaskΒTracker Metrics Telemetry Publisher Metrics Tez Metrics Time Series Table Metrics Tracer Metrics User Metrics WebΒHCat Server Metrics YARN Metrics YARN Pool Metrics YARN Pool User Metrics YARN Queue Manager Metrics YARN Queue Manager Store Metrics YARN Queue Manager Webapp Metrics Zeppelin Metrics Zeppelin Server Metrics ZooΒKeeper Metrics common.Βservice.Βtype.Βdocker Metrics common.Βservice.Βtype.Βecs Metrics βΆοΈ Cloudera Manager Health Tests Reference βΆοΈ Cloudera Manager Health Tests Active Database Health Tests Active Key Trustee Server Health Tests Activity Monitor - Unsupported Since 7.0.0 Health Tests Alert Publisher Health Tests Atlas Health Tests Atlas Server Health Tests Authentication Server Health Tests Authentication Server Load Balancer Health Tests Authentication Service Health Tests Cloudera Management Service Health Tests Cruise Control Health Tests Cruise Control Server Health Tests DOCKER Health Tests Data Analytics Studio Eventprocessor Health Tests Data Analytics Studio Webapp Server Health Tests Data Discovery Service Agent Health Tests DataΒNode Health Tests Docker Server Health Tests ECS Health Tests Ecs Agent Health Tests Ecs Server Health Tests Event Server Health Tests Failover Controller Health Tests Flink Dashboard Health Tests Flume Health Tests Flume Agent Health Tests Garbage Collector Health Tests HBase Health Tests HBase REST Server Health Tests HBase Thrift Server Health Tests HDFS Health Tests History Server Health Tests Hive Health Tests Hive Execution Health Tests Hive LLAP Health Tests Hive Metastore Server Health Tests Hive on Tez Health Tests HiveΒServer2 Health Tests Host Health Tests Host Monitor Health Tests HttpΒFS Health Tests Hue Health Tests Hue Server Health Tests Impala Health Tests Impala Catalog Server Health Tests Impala Daemon Health Tests Impala Llama ApplicationΒMaster Health Tests Impala StateΒStore Health Tests JobΒHistory Server Health Tests JobΒTracker Health Tests JournalΒNode Health Tests Kafka Health Tests Kafka Broker Health Tests Kafka Connect Health Tests Kafka MirrorΒMaker Health Tests Kerberos Ticket Renewer Health Tests Key Management Server Health Tests Key Management Server Proxy Health Tests Key-Value Store Indexer Health Tests Knox Health Tests Knox Gateway Health Tests Knox IDBroker Health Tests Kudu Health Tests LLAP Proxy Health Tests Lily HBase Indexer Health Tests Livy Health Tests Livy Server Health Tests Livy Server for Spark 3 Health Tests Livy for Spark 3 Health Tests Load Balancer Health Tests MapΒReduce Health Tests Master Health Tests Materialized View Engine Health Tests Monitor Health Tests NFS Gateway Health Tests NameΒNode Health Tests Navigator Audit Server Health Tests Navigator Luna KMS Metastore Health Tests Navigator Luna KMS Proxy Health Tests Navigator Metadata Server Health Tests Navigator Thales KMS Metastore Health Tests Navigator Thales KMS Proxy Health Tests NodeΒManager Health Tests Omid Health Tests Omid tso server Health Tests Oozie Health Tests Oozie Server Health Tests Ozone Health Tests Ozone DataΒNode Health Tests Ozone Manager Health Tests Ozone Prometheus Health Tests Ozone Recon Health Tests Passive Database Health Tests Passive Key Trustee Server Health Tests Phoenix Health Tests Profiler Admin Agent Health Tests Profiler Metrics Agent Health Tests Profiler Scheduler Agent Health Tests Query Processor Health Tests Query Server Health Tests Ranger Health Tests Ranger Admin Health Tests Ranger KMS Health Tests Ranger KMS Server Health Tests Ranger KMS Server with KTS Health Tests Ranger KMS with Key Trustee Server Health Tests Ranger RMS Health Tests Ranger RMS Server Health Tests Ranger Raz Health Tests Ranger Raz Server Health Tests Ranger Tagsync Health Tests Ranger Usersync Health Tests RegionΒServer Health Tests Reports Manager Health Tests ResourceΒManager Health Tests S3 Gateway Health Tests SRM Driver Health Tests SRM Service Health Tests Schema Registry Health Tests Schema Registry Server Health Tests SecondaryΒNameΒNode Health Tests Sentry Health Tests Sentry Server Health Tests Service Monitor Health Tests Solr Health Tests Solr Server Health Tests Spark Health Tests Spark 3 Health Tests Sqoop 2 Health Tests Sqoop 2 Server Health Tests Storage Container Manager Health Tests Streaming SQL Console Health Tests Streaming SQL Engine Health Tests Streams Messaging Manager Health Tests Streams Messaging Manager Rest Admin Server Health Tests Streams Messaging Manager UI Server Health Tests Streams Replication Manager Health Tests Tablet Server Health Tests TaskΒTracker Health Tests Telemetry Publisher Health Tests Tracer Health Tests WebΒHCat Server Health Tests YARN Health Tests YARN Queue Manager Store Health Tests YARN Queue Manager Webapp Health Tests Zeppelin Health Tests Zeppelin Server Health Tests ZooΒKeeper Health Tests ZooΒKeeper Server Health Tests βΆοΈ Cloudera Manager Event Schema Reference LOG_ΒMESSAGE Category ACTIVITY_ΒEVENT Category AUDIT_ΒEVENT Category HEALTH_ΒCHECK Category SYSTEM Category HBASE Category βΆοΈ Cloudera Manager Entities Reference βΆοΈ Cloudera Manager Entity Types and Attributes Cloudera Manager Entity Types Cloudera Manager Entity Type Attributes βΆοΈ Security βΆοΈ Authorization Migrating from Sentry to Ranger Check MyΒSQL isolation configuration Ranger audit schema reference Ranger database schema reference Ranger policies allowing create privilege for Hadoop_ΒSQL databases Ranger policies allowing create privilege for Hadoop_ΒSQL tables Access required to Read/Write on Hadoop_ΒSQL tables using SQL Mapping Sentry permissions for Solr to Ranger policies βΆοΈ Encryption Auto-TLS Requirements and Limitations Rotate Auto-TLS Certificate Authority and Host Certificates Auto-TLS Agent File Locations "Unknown Attribute Name" exception 'Type' error while accessing Hue from Knox Gateway (Recommended) Enable Auto-TLS .NET client 502 Proxy Error while accessing Hue from the Load Balancer 7.1.7 7.1.7 SP1 7.1.7 SP2 7.1.7 SP3 A List of S3A Configuration Properties Aborting a Pending Command About Apache Ozone integration with Apache Atlas About Atlas High Availability About HBase snapshots About the Off-heap BucketCache Access HDFS from the NFS Gateway Access Ozone S3 Gateway using the S3A filesystem Access required to Read/Write on Hadoop_SQL tables using SQL Access the Recon web user interface Access the YARN Web User Interface Accessing Aggregate Statistics Through tsquery Accessing Apache HBase Accessing Atlas logs Accessing Avro data files from Spark SQL applications Accessing Azure Storage account container from spark-shell Accessing Cloud Data Accessing compressed files in Spark Accessing data stored in Amazon S3 through Spark Accessing external storage from Spark Accessing Files Within an Encryption Zone Accessing HDFS Files from Spark Accessing Hive files in Ozone Accessing Hive files in Ozone Accessing Hive from Spark Accessing ORC Data in Hive Tables Accessing ORC files from Spark Accessing Ozone object store with Amazon Boto3 client Accessing Parquet files from Spark SQL applications Accessing Spark SQL through the Spark shell Accessing Storage Using Amazon S3 Accessing Storage Using Microsoft ADLS Accessing the Cloudera Manager Admin Console Accessing the Cloudera Manager Admin Console Accessing the Cloudera Manager Admin Console Accessing the Directory Usage Report Accessing the License Page Accessing the Oozie server with a browser Accessing the Oozie server with the Oozie Client Accessing the Ranger console Accessing the Ranger KMS Web UI Accessing the Spark History Server Accessing the tracing web interface Accessing the Web UI of a Completed Spark Application Accessing the Web UI of a Running Spark Application Accommodate HMS changes for Hive replication policies Accumulo Metrics Achieving cross-cluster availability through Hive Load Balancer failover ACID operations ACL examples ACLS on HDFS features ACLs supported by Ranger KMS and Ranger KMS Mapping Activate read replicas on a table Activating Hive query editor on Hue UI Activating the Hive web UI Active / Active Architecture Active / Stand-by Architecture Active Database Health Tests Active Database Metrics Active Directory Settings Active Key Trustee Server Health Tests Active Key Trustee Server Metrics Activity Charts Activity Metrics Activity Monitor - Unsupported Since 7.0.0 Health Tests Activity Monitor - Unsupported Since 7.0.0 Metrics Activity, Application, and Query Reports ACTIVITY_EVENT Category Add a custom coprocessor Add a custom descriptor to Apache Knox Add a group Add a new provider in an existing provider configuration Add a new shared provider configuration Add a role through Hive Add a role through Ranger Add a user Add a ZooKeeper service Add Accumulo on CDP service Add Accumulo on CDP service Add Accumulo on CDP service Add custom service parameter to descriptor Add custom service to existing descriptor in Apache Knox Proxy Add HDFS system mount Add or edit permissions Add Queue Add queues using YARN Queue Manager UI Add secure Accumulo on CDP service to your cluster Add secure Accumulo on CDP service to your cluster Add secure Accumulo on CDP service to your cluster Add source cluster as peer to use in replication policies Add storage directories using Cloudera Manager Add Streams Replication Manager to an existing cluster Add the HttpFS role Add the user or group to a pre-defined access policy Add unsecure Accumulo on CDP service to your cluster Add unsecure Accumulo on CDP service to your cluster Add unsecure Accumulo on CDP service to your cluster Add-on Services Adding a Cluster Using Currently Managed Hosts Adding a Cluster Using New Hosts Adding a Compute Cluster and Data Context Adding a custom banner in Hue Adding a Filter Adding a HiveServer role Adding a HiveServer role Adding a Host to a Cluster Adding a Hue role instance with Cloudera Manager Adding a Hue service with Cloudera Manager Adding a load balancer Adding a New Chart to the Custom Dashboard Adding a new schema Adding a policy label to a resource-based policy Adding a Ranger security zone Adding a Role Instance Adding a Service Adding a splash screen in Hue Adding a tag-based PII policy Adding a tag-based service Adding an Event Filter Adding and Configuring Record Reader and Writer Controller Services Adding and Deleting Clusters Adding and Removing Charts from a Dashboard Adding and Removing Range Partitions Adding attributes to Business Metadata Adding attributes to classifications Adding clusters to SRM's configuration Adding Cruise Control as a service Adding default service users and roles for Ranger Adding Files to an Encryption Zone Adding schema to Oozie using Cloudera Manager Adding self-healing goals to Cruise Control in Cloudera Manager Adding tag-based policies Adding the Lily HBase Indexer Service Adding the Oozie service using Cloudera Manager Adding trusted realms to the cluster Additional Configuration Options for GCS Additional considerations when configuring TLS/SSL for Oozie HA Additional HDFS haadmin commands to administer the cluster Additional Security Topics Additional Steps for Apache Ranger Adjust the Solr replication factor for index files stored in HDFS ADLS Connector Properties in Cloudera Runtime 7.1.7 ADLS Proxy Setup ADLS Trash Folder Behavior Admin ACLs Administering Hue Administering Ranger Reports Administering Ranger Users, Groups, Roles, and Permissions Administrative commands Administrative tools for Hive Metastore integration Admission Control and Query Queuing Admission Control Sample Scenario Advanced Committer Configuration Advanced configuration for write-heavy workloads Advanced erasure coding configuration Advanced ORC properties Advanced partitioning Advantages of defining a schema for production use Advantages of Parcels Advantages of Separating Compute and Data Resources After Evaluating Trial Software After You Install Agent Hosts Agent Metrics Aggregate functions Aggregating and grouping data Aggregation for Analytics Alert Publisher Alert Publisher Health Tests Alert Publisher Metrics Alerts Allocating DataNode memory as storage Allocating Hosts for Key Trustee Server and Key Trustee KMS Already present: FS layout already exists Alter a table ALTER DATABASE statement ALTER MATERIALIZED VIEW REBUILD ALTER MATERIALIZED VIEW REWRITE ALTER TABLE statement ALTER VIEW statement Amazon S3 Security Amazon S3 Sink Connector Amazon S3 Sink Connector Properties Reference Analysing event analysis Analytic functions Analyzing Ranger RMS resources Apache Atlas Advanced Search language reference Apache Atlas dashboard tour Apache Atlas metadata attributes Apache Atlas metadata collection overview Apache Atlas Reference Apache Atlas Statistics reference Apache Atlas technical metadata migration reference Apache Hadoop HDFS Overview Apache Hadoop YARN Overview Apache Hadoop YARN Reference Apache HBase Overview Apache Hive 3 ACID transactions Apache Hive 3 architectural overview Apache Hive 3 tables Apache Hive content roadmap Apache Hive features Apache Hive Materialized View Commands Apache Hive Metastore Overview Apache Hive Overview Apache Hive Performance Tuning Apache Hive query basics Apache Hive Reference Apache Hive-Kafka integration Apache Impala Overview Apache Impala Reference Apache Impala SQL Overview Apache Impala SQL Reference Apache Kafka Overview Apache Knox Authentication Apache Knox Gateway Overview Apache Knox Install Role Parameters Apache Knox Install Role Parameters Apache Knox Overview Apache Kudu Background Operations Apache Kudu Overview Apache Kudu usage limitations Apache Ozone Overview Apache Phoenix and SQL Apache Phoenix Command Reference Apache Phoenix Frequently Asked Questions Apache Phoenix Performance Tuning Apache Phoenix SQL command reference Apache Phoenix-Hive usage examples Apache Ranger Access Control and Auditing Apache Ranger Auditing Apache Ranger Authorization Apache Ranger User Management Apache Spark executor task statistics Apache Spark Overview Apache Spark Overview Apache Zeppelin Overview API Compatibility changes in 7.1.7 SP3 for Spark API Compatibility changes in 7.1.7 SP3 for Zookeeper APIs for accessing HDFS Application ACL evaluation Application ACLs Application logs' ACLs Application not running message Application reservations Applications and permissions reference Applying a Host Template to a Host APPX_MEDIAN function Architecture Architecture ARRAY complex type ASCII codec error Assign or unassign a node to a partition Assign Roles Assigning administrator privileges to users Assigning superuser status to an LDAP user Assigning terms to categories Associate a table in a non-customized environment without Kerberos Associate partitions with queues Associate table in a customized Kerberos environment Associating Business Metadata attributes with entities Associating classifications with entities Associating tables of a schema to a namespace Associating terms with entities Atlas Atlas Atlas Atlas Atlas classifications drive Ranger policies Atlas Export and Import Operations Atlas Health Tests Atlas Hook for Sqoop Atlas index repair configuration Atlas metadata model overview Atlas Metrics Atlas NiFi audit entries Atlas NiFi relationships Atlas Properties in Cloudera Runtime 7.1.7 Atlas Server Health Tests Atlas Server Metrics Atlas Server Operations Atlas Type Definitions Attempt Metrics Audit enhancements Audit Operations Audit Overview Auditing Atlas Entities Auditing purged entities Audits AUDIT_EVENT Category Authenticating with ADLS Gen2 Authentication Authentication Authentication Authentication Authentication Authentication and Kerberos Issues Authentication Server Health Tests Authentication Server Load Balancer Health Tests Authentication Server Load Balancer Metrics Authentication Server Metrics Authentication Service Health Tests Authentication Service Metrics Authentication using Kerberos Authentication using Knox SSO Authentication using LDAP Authentication using SAML Authorization Authorization Authorization Authorization Exception error Authorizing external tables Auto-TLS Agent File Locations Auto-TLS Requirements and Limitations Autoconfiguration Automatic Invalidation of Metadata Cache Automatic Invalidation of Metadata Cache Automatic Invalidation/Refresh of Metadata Automatic Invalidation/Refresh of Metadata Automatic Logout Automatic Logout Automating partition discovery and repair Automating Spark Jobs with Oozie Spark Action AVG AVG function Avro Avro AWS S3 entity metadata migration Back up HDFS metadata Back up HDFS metadata using Cloudera Manager Back up Key Trustee Server clients Back up Key Trustee Server manually Back up Key Trustee Server using Cloudera Manager Back up Key Trustee Server using the ktbackup.sh script Back up tables Backing up a collection from HDFS Backing up a collection from local file system Backing up and Recovering Apache Kudu Backing up and restoring data Backing up Cloudera Manager databases Backing Up Encryption Keys Backing up HDFS metadata Backing up Key Trustee Server and clients Backing up NameNode metadata Backing up the Cloudera Manager configuration Backing up the Hue database Backup directory structure Backup tools Balancer commands Balancing data across an HDFS cluster Balancing data across disks of a DataNode Basic partitioning Basics Batch Indexing Batch indexing into offline Solr shards Batch indexing into online Solr servers using GoLive Batch indexing to Solr using SparkApp framework Before You Begin a Trial Installation Before You Install Before You Install Behavioral changes in Apache HBase Behavioral changes in Apache Hive Behavioral changes in Apache Hive Behavioral changes in Apache Hive Behavioral changes in Apache Impala Behavioral changes in Cloudera Runtime 7.1.7 Behavioral changes in Cloudera Runtime 7.1.7 SP1 Behavioral changes in Cloudera Runtime 7.1.7 SP2 Behavioral changes in Cloudera Runtime 7.1.7 SP3 Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF5 Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF6 Behavioral Changes in Cloudera Search Behavioral Changes in Cloudera Search Benefits and Capabilities Benefits of centralized cache management in HDFS Best practices for building Apache Spark applications Best practices for performance tuning Best practices for rack and node setup for EC Best practices when adding new tablet servers Best practices when using RegionServer grouping Bidirectional replication example of two active clusters Bidirectional Replication Flows BIGINT data type Bit functions Block cache size Block move execution Block move scheduling BOOLEAN data type Bring a tablet that has lost a majority of replicas back online Broker garbage collection log configuration Broker log management Broker migration Broker Tuning Brokers Browse HBase tables Browse HDFS directories BucketCache IO engine Bucketed tables in Hive Building a Chart with Time-Series Data Building and deploying UDFs Building and running a Spark Streaming application Building Cloudera Manager charts with Kafka metrics Building reusable modules in Apache Spark applications Building Spark Applications Building the project and upload the JAR Built-in functions Bulk Write Access Business Metadata overview Bypass the BlockCache Cache eviction priorities Caching terminology Calculating Infra Solr resource needs Calculations for reports Calling Hive user-defined functions (UDFs) Calling the UDF in a query Canary test for pyspark command Cancelling a Query Cannot alter compressed tables in Hue Cannot see databases, or the query editor is missing Cannot see the DAG ID and the Application ID Cannot view queries of other users Catalog operations CDP 7.1.7 SP2 and 7.1.7 SP3 Components with API differences CDP Private Cloud Base CDP Private Cloud Base API Modifications and Removals CDP Private Cloud Base Installation Guide CDP Private Cloud Base Requirements and Supported Versions CDP Private Cloud Base service groups and component reference CDP Private Cloud Base Trial Download Information CDP PVC Base - Data Engineering CDP PVC Base - Data Warehouse CDP PVC Base - Enterprise Essentials CDP PVC Base - Operational Database CDP Security Overview CDS 3 Powered by Apache Spark CDS 3.2.3 Maven Artifacts CDS 3.2.3 Overview CDS 3.2.3 Packaging, and Download CDS 3.2.3 Requirements Centralized cache management architecture Certmanager Options - Using CM's GenerateCMCA API Change master hostnames Change Queue Capacities Change Queue Properties Change resource allocation mode Change root user password Change the HDFS trash settings in Cloudera Manager Changed Behavior after HDFS Encryption is Enabled Changes for a cluster Changes for a service, role, or host Changing a nameservice name for Highly Available HDFS using Cloudera Manager Changing directory configuration Changing Embedded PostgreSQL Database Passwords Changing Hostnames Changing Ranger audit storage location and migrating data Changing the Anomaly Notifier Class value to self-healing Changing the Chart Type Changing the Configuration of a Service or Role Instance Changing the Hive warehouse location Changing the page logo in Hue Changing the retention period of DAS event logs Changing the Upgrade Domain for hosts Channel encryption Channel encryption CHAR data type CHAR data type support Chart Properties Charting Time-Series Data Charts Charts Library Check Cluster Security Settings Check Job History Check Job Status Check MySQL isolation configuration Check trace table Check trace table Checking Host Heartbeats Checking query execution Choose the right import method Choosing and Configuring Data Compression Choosing and Running a Filter Choosing and Running a Filter Choosing Data Formats Choosing manual TLS or Auto-TLS Choosing the number of partitions for a topic Choosing the Sufficient Security Level for Your Environment Choosing Transformations to Minimize Shuffles ClassNotFoundException: com.cloudera.kudu.hive.KuduStorageHandler Cleaning up after failed jobs Cleaning up old data to improve performance Cleaning up old queries, DAG information, and reports data Cleaning up old queries, DAG information, and reports data using Ambari CLI commands to perform snapshot operations CLI tool support Client and broker compatibility across Kafka versions Client authentication to secure Kudu clusters Client authentication using delegation tokens Client Configuration Files Client connections to HiveServer Client examples Client examples Closing HiveWarehouseSession operations Cloud storage connectors overview Cloudera Authorization Cloudera license requirements for Replication Manager Cloudera Logging is now available in CDP Private Cloud Base 7.1.7 SP1 Cloudera Management Service Cloudera Management Service Cloudera Management Service Cloudera Management Service Health Tests Cloudera Management Service Metrics Cloudera Manager Cloudera Manager Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Cloudera Manager 7.4.4 Release Notes Cloudera Manager 7.6.1 Cumulative hotfix 1 Cloudera Manager 7.6.1 Cumulative hotfix 2 Cloudera Manager 7.6.1 Cumulative hotfix 3 Cloudera Manager 7.6.1 Cumulative hotfix 4 Cloudera Manager 7.6.1 Cumulative hotfix 5 Cloudera Manager 7.6.1 Cumulative hotfix 6 Cloudera Manager 7.6.1 Cumulative hotfix 7 Cloudera Manager 7.6.1 Cumulative hotfix 8 Cloudera Manager 7.6.1 Cumulative hotfix 9 Cloudera Manager 7.6.1 Release Notes (CDP Private Cloud Base 7.1.7 SP1) Cloudera Manager 7.6.7 Cumulative hotfix 1 Cloudera Manager 7.6.7 Cumulative hotfix 10 Cloudera Manager 7.6.7 Cumulative hotfix 11 Cloudera Manager 7.6.7 Cumulative hotfix 12 Cloudera Manager 7.6.7 Cumulative hotfix 13 Cloudera Manager 7.6.7 Cumulative hotfix 2 Cloudera Manager 7.6.7 Cumulative hotfix 3 Cloudera Manager 7.6.7 Cumulative hotfix 4 Cloudera Manager 7.6.7 Cumulative hotfix 5 Cloudera Manager 7.6.7 Cumulative hotfix 6 Cloudera Manager 7.6.7 Cumulative hotfix 7 Cloudera Manager 7.6.7 Cumulative hotfix 8 Cloudera Manager 7.6.7 Cumulative hotfix 9 Cloudera Manager 7.6.7 Release Notes (CDP Private Cloud Base 7.1.7 SP2) Cloudera Manager Admin Console Cloudera Manager Agents Cloudera Manager Agents Cloudera Manager API Cloudera Manager Configuration Properties Reference Cloudera Manager Configuration Properties Reference for Cloudera Runtime 7.1.7 Cloudera Manager Download Information Cloudera Manager Entities Reference Cloudera Manager Entity Type Attributes Cloudera Manager Entity Types Cloudera Manager Entity Types and Attributes Cloudera Manager Event Schema Reference Cloudera Manager Health Tests Cloudera Manager Health Tests Reference Cloudera Manager Metrics Cloudera Manager metrics overview Cloudera Manager Metrics Reference Cloudera Manager Overview Cloudera Manager Release Notes Cloudera Manager Server Cloudera Manager Server Metrics Cloudera Manager Server Properties Cloudera Manager sudo command options Cloudera Manager support for Cloudera Runtime and CDH Cloudera Manager Trigger Use Cases Cloudera Manager user accounts Cloudera Manager User Roles Cloudera Manager Version Information Cloudera Navigator Key HSM Overview Cloudera Navigator Key Trustee Server Cloudera Navigator Key Trustee Server Overview Cloudera Runtime Cloudera Runtime 7.1.7 SP1 component versions Cloudera Runtime 7.1.7 SP2 component versions Cloudera Runtime 7.1.7 SP3 component versions Cloudera Runtime component versions Cloudera Runtime Download Information Cloudera Runtime Release Notes Cloudera Runtime Version Information Cloudera Search and CDP Cloudera Search architecture Cloudera Search authentication Cloudera Search config templates Cloudera Search configuration and log files Cloudera Search configuration files Cloudera Search configuration files Cloudera Search ETL Cloudera Search log files Cloudera Search Morphlines Reference Cloudera Search Overview Cloudera Search security aspects Cloudera Search solrctl Reference Cloudera Search tasks and processes Cluster balancing algorithm Cluster Configuration Overview Cluster Lifecycle Management with Cloudera Manager Cluster management limitations Cluster management limitations Cluster Metrics Cluster Migration Architectures Cluster sizing Cluster Support Tokens using Cloudera Manager Cluster Utilization Report overview Cluster-Wide Configuration Collecting metrics through HTTP Column compression Column design Column encoding Command Details Command Details Command Line Tools Commands Commands for configuring storage policies Commands for managing buckets Commands for managing keys Commands for managing volumes Commands for using cache pools and directives COMMENT statement Comments Committing a transaction for Direct Reader Common replication topologies Common web interface pages common.service.type.docker Metrics common.service.type.ecs Metrics Communication encryption Compacting on-disk data Compaction prerequisites Compaction tasks Compactor properties Compare queries Comparing Configurations for a Service Between Clusters Comparing replication and erasure coding Comparing Similar Activities Comparing tables using ANY/SOME/ALL Comparison of Fair Scheduler with Capacity Scheduler Compatibility Considerations for Virtual Private Clusters Compatibility policies Completed Hue query shows executing on CM Complex types Component types and metrics for alert policies Components Components Compose queries Compound operators COMPUTE STATS statement Conditional functions Configuration Configuration Example Configuration example for writing data to HDFS Configuration example for writing data to Ozone FS Configuration examples Configuration for enabling mTLS in Ozone Configuration options for Spark to work with o3fs Configuration options to store Hive managed tables on Ozone Configuration parameters migrated to Core Settings Service Configuration properties Configuration Properties Reference for Properties not Available in Cloudera Manager Configuration to expose buckets under non-default volumes Configurations and CLI options for the HDFS Balancer Configure a resource-based policy: Atlas Configure a resource-based policy: HadoopSQL Configure a resource-based policy: HBase Configure a resource-based policy: HDFS Configure a resource-based policy: Kafka Configure a resource-based policy: Knox Configure a resource-based policy: NiFi Configure a resource-based policy: NiFi Registry Configure a resource-based policy: Solr Configure a resource-based policy: YARN Configure a resource-based service: Atlas Configure a resource-based service: HadoopSQL Configure a resource-based service: HBase Configure a resource-based service: HDFS Configure a resource-based service: Kafka Configure a resource-based service: Knox Configure a resource-based service: NiFi Configure a resource-based service: NiFi Registry Configure a resource-based service: Solr Configure a resource-based service: YARN Configure a resource-based storage handler policy: HadoopSQL Configure a Spark job for dynamic resource allocation Configure Access to GCS from Your Cluster Configure Antivirus Software on CDP Hosts Configure Apache Knox authentication for AD/LDAP Configure Apache Knox authentication for PAM Configure Apache Knox authentication for SAML Configure archival storage Configure Atlas authentication for AD Configure Atlas authentication for LDAP Configure Atlas file-based authentication Configure Atlas PAM authentication Configure Authentication for Amazon S3 Configure authentication using Active Directory Configure authentication using an external program Configure authentication using an LDAP-compliant identity service Configure authentication using Kerberos (SPNEGO) Configure authentication using SAML Configure AWS Credentials Configure Browser-based Interfaces to Require Authentication (SPNEGO) Configure Browsers for Kerberos Authentication (SPNEGO) Configure BucketCache IO engine Configure bulk load replication Configure clients on a producer or consumer level Configure clients on an application level Configure Cloudera Manager for FIPS Configure cluster capacity with queues Configure Cluster to Use Kerberos Authentication Configure columns to store MOBs Configure CPU scheduling and isolation Configure Cross-Origin Support for YARN UIs and REST APIs Configure data locality Configure DataNode memory as storage Configure Debug Delay Configure dynamic queue properties Configure Encryption for Amazon S3 Configure encryption in HBase Configure four-letter-word commands in ZooKeeper Configure FPGA scheduling and isolation Configure GPU scheduling and isolation Configure HBase for use with Phoenix Configure HBase garbage collection Configure HBase in Cloudera Manager to store snapshots in Amazon S3 Configure HBase servers to authenticate with a secure HDFS cluster Configure HBase-Spark connector using Cloudera Manager Configure HDFS RPC protection Configure High Availability for Hive-HDFS ACL Sync Configure High Availability for Ranger KMS with DB Configure High Availability for Ranger KMS with KTS Configure Hive-HDFS ACL Sync Configure HMS properties for authorization Configure HSTS for HBase Web UIs Configure JMX ephemeral ports Configure Kafka brokers Configure Kafka brokers Configure Kafka brokers Configure Kafka brokers Configure Kafka brokers Configure Kafka brokers Configure Kafka clients Configure Kafka clients Configure Kafka clients Configure Kafka clients Configure Kafka clients Configure Kafka clients Configure Kafka MirrorMaker Configure Kafka MirrorMaker Configure Kerberos authentication for Apache Atlas Configure Kerberos authentication for Apache Ranger Configure Kerberos authentication for Solr Configure Kerberos Authentication for the Kafka Connect role Configure Kudu processes Configure Lily HBase Indexer Service to Use Kerberos Authentication Configure Lily HBase Indexer to use TLS/SSL Configure Lily HBase Indexer to use TLS/SSL Configure Log Aggregation Configure memory settings Configure mountable HDFS Configure Network Names Configure NodeManager heartbeat Configure Oozie client when TLS/SSL is enabled Configure Oracle Database Configure Partitions Configure Per Queue Properties Configure Phoenix-Hive connector Configure PostgreSQL as the backend database for Hue Configure PostgreSQL for Streaming Components Configure preemption Configure queue ordering policies Configure Ranger Admin High Availability Configure Ranger Admin High Availability with a Load Balancer Configure Ranger authentication for AD Configure Ranger authentication for LDAP Configure Ranger authentication for PAM Configure Ranger authentication for UNIX Configure Ranger authorization for Infra Solr Configure Ranger Usersync for Deleted Users and Groups Configure Ranger Usersync for invalid usernames Configure Ranger with SSL/TLS enabled PostgreSQL Database Configure read replicas using Cloudera Manager Configure RegionServer grouping Configure S3 credentials for working with Ozone Configure Scheduler Properties at the Global Level Configure secure HBase replication Configure secure HBase replication Configure secure replication Configure session timeout for Ranger Admin Web UI Configure snapshots Configure source and destination realms in krb5.conf Configure SRM for Failover and Failback Configure storage balancing for DataNodes using Cloudera Manager Configure the blocksize for a column family Configure the Cluster Utilization Report Configure the compaction speed using Cloudera Manager Configure the dynamic resource pool used for exporting and importing snapshots in Amazon S3 Configure the graceful shutdown timeout property Configure the HBase canary Configure the HBase client TGT renewal period Configure the HBase thrift server role Configure the MOB cache using Cloudera Manager Configure the off-heap BucketCache using Cloudera Manager Configure the off-heap BucketCache using the command line Configure the PostgreSQL server Configure the resource-based Ranger service used for authorization Configure the scanner heartbeat using Cloudera Manager Configure the storage policy for WALs using Cloudera Manager Configure the storage policy for WALs using the Command Line Configure TLS encryption manually for Phoenix Query Server Configure TLS encryption manually for Phoenix Query Server Configure TLS Encryption Manually for Schema Registry Configure TLS/SSL encryption for Solr Configure TLS/SSL encryption for Solr Configure TLS/SSL Encryption for the Kafka Connect Role Configure TLS/SSL encryption manually for Apache Ranger Configure TLS/SSL encryption manually for Apache Ranger Configure TLS/SSL encryption manually for Ranger KMS Configure TLS/SSL encryption manually for Ranger KMS Configure TLS/SSL encryption manually for Ranger RMS Configure TLS/SSL encryption manually for Ranger RMS Configure TLS/SSL for Core Hadoop Services Configure TLS/SSL for HBase REST Server Configure TLS/SSL for HBase Thrift Server Configure TLS/SSL for HBase Web UIs Configure TLS/SSL for Oozie Configure TLS/SSL for Oozie Configure TLS/SSL for YARN Configure transaction support Configure ulimit for HBase using Cloudera Manager Configure ulimit using Pluggable Authentication Modules using the Command Line Configure User Impersonation for Access to Hive Configure User Impersonation for Access to Phoenix Configure Usersync assignment of Admin users Configure work preserving recovery on NodeManager Configure work preserving recovery on ResourceManager Configure YARN ResourceManager high availability Configure YARN Security for Long-Running Applications Configure YARN Services API to Manage Long-running Applications Configure YARN Services using Cloudera Manager Configure ZooKeeper client shell for Kerberos authentication Configure ZooKeeper server for Kerberos authentication Configure Zookeeper TLS/SSL support for Kafka Configure Zookeeper TLS/SSL support for Kafka Configure ZooKeeper TLS/SSL using Cloudera Manager Configure ZooKeeper TLS/SSL using Cloudera Manager Configuring /tmp directory for cluster hosts Configuring a database for Ranger or Ranger KMS Configuring a dedicated MIT KDC for cross-realm trust Configuring a Local Package Repository Configuring a Local Parcel Repository Configuring a Mail Transfer Agent for Key Trustee Server Configuring a PostgreSQL Database for Ranger or Ranger KMS Configuring a Proxy Server Configuring a Ranger audit filter policy Configuring a Ranger or Ranger KMS Database: MySQL/MariaDB Configuring a Ranger or Ranger KMS Database: Oracle Configuring a Ranger or Ranger KMS Database: Oracle using /ServiceName format Configuring a Secure Credential Storage Provider for Cloudera Manager (Technical Preview) Configuring a secure Kudu cluster using Cloudera Manager Configuring Access to Azure on CDP Public Cloud Configuring Access to Azure on Cloudera Private Cloud Base Configuring Access to Google Cloud Storage Configuring access to Hive on YARN Configuring Access to S3 Configuring Access to S3 on CDP Public Cloud Configuring Access to S3 on Cloudera Private Cloud Base Configuring ACLs on HDFS Configuring Advanced Security Options for Apache Ranger Configuring Alert Delivery Configuring Alert Email Delivery Configuring Alert SNMP Delivery Configuring Alerts Transitioning Out of Alerting Health Threshold Configuring an external database for Oozie Configuring and Managing S3Guard Configuring and Monitoring Atlas Configuring and running the HDFS balancer using Cloudera Manager Configuring and Starting the PostgreSQL Server Configuring and tuning S3A block upload Configuring and Using Hive-HDFS ACL Sync Configuring and using Queue Manager REST API Configuring and Using Ranger KMS Configuring and Using Zeppelin Interpreters Configuring Apache Hadoop YARN High Availability Configuring Apache Hadoop YARN Log Aggregation Configuring Apache Hadoop YARN Security Configuring Apache HBase Configuring Apache HBase for Apache Phoenix Configuring Apache HBase High Availability Configuring Apache Hive Configuring Apache Impala Configuring Apache Kafka Configuring Apache Kudu Configuring Apache Ranger High Availability Configuring Apache Spark Configuring Apache Zeppelin Configuring Apache ZooKeeper Configuring Atlas Authentication Configuring Atlas Authorization Configuring Atlas Authorization using Ranger Configuring Atlas using Cloudera Manager Configuring authentication for long-running Spark Streaming jobs Configuring Authentication in Cloudera Manager Configuring Authentication in Cloudera Manager Configuring authentication with LDAP and Direct Bind Configuring authentication with LDAP and Search Bind Configuring Authorization Configuring auto split policy in an HBase table Configuring automatic group offset synchronization Configuring block size Configuring Built-in TLS Acceleration Configuring capacity estimations Configuring CDP Services for HDFS Encryption Configuring Client Access to Impala Configuring Cloudera Manager Configuring Cloudera Manager Agents Configuring Cloudera Manager Server Ports Configuring Cloudera Manager to Use an Internal Remote Parcel Repository Configuring Clusters Configuring coarse-grained authorization with ACLs Configuring collection of Cloudera Manager table data Configuring compaction in Cloudera Manager Configuring compaction using table properties Configuring concurrent moves Configuring Cruise Control Configuring Custom Alert Scripts Configuring Custom Cgroups Configuring custom Kerberos principal for Apache Flink Configuring custom Kerberos principal for Atlas Configuring custom Kerberos principal for Cruise Control Configuring custom Kerberos principal for Cruise Control Configuring custom Kerberos principal for HBase Configuring custom Kerberos principal for HDFS Configuring custom Kerberos principal for Hive and Hive-on-Tez Configuring custom Kerberos principal for HttpFS Configuring custom Kerberos principal for Hue Configuring custom Kerberos principal for Kafka Configuring custom Kerberos principal for Kafka Configuring custom Kerberos principal for Knox Configuring custom Kerberos principal for Kudu Configuring custom Kerberos principal for Kudu Configuring custom Kerberos principal for Livy Configuring custom Kerberos principal for NiFi and NiFi Registry Configuring custom Kerberos principal for Omid Configuring custom Kerberos principal for Oozie Configuring custom Kerberos principal for Oozie Configuring custom Kerberos principal for Ozone Configuring custom Kerberos principal for Ozone Configuring custom Kerberos principal for Phoenix Configuring custom Kerberos principal for Schema Registry Configuring custom Kerberos principal for Schema Registry Configuring custom Kerberos principal for Spark Configuring custom Kerberos principal for SQL Stream Builder Configuring custom Kerberos principal for Streams Messaging Manager Configuring custom Kerberos principal for Streams Replication Manager Configuring custom Kerberos principal for Streams Replication Manager Configuring custom Kerberos principal for Zeppelin Configuring custom Kerberos principal for ZooKeeper Configuring custom Kerberos principals and custom system users for Solr Configuring custom Kerberos principals and custom system users for Solr Configuring custom Kerberos principals and custom system users for Solr Configuring Dashboards Configuring Data Protection Configuring Dedicated Coordinators and Executors Configuring dedicated Impala coordinator Configuring Delegation for Clients Configuring Directories for Intermediate Data Configuring Directory Monitoring Configuring dynamic resource allocation Configuring Dynamic Resource Pool Configuring Encryption for Specific Buckets Configuring Event Based Automatic Metadata Sync Configuring Event Based Automatic Metadata Sync Configuring external authentication and authorization for Cloudera Manager Configuring Fault Tolerance Configuring file and directory permissions for Hue Configuring for HDFS high availability Configuring for Kudu Tables Configuring goals Configuring group permissions Configuring HBase BlockCache Configuring HBase Hive integration Configuring HBase MultiWAL Configuring HBase snapshots Configuring HBase to use HDFS HA Configuring HBase-Spark connector when both are on same cluster Configuring HBase-Spark connector when HBase is on remote cluster Configuring HDFS ACLs Configuring HDFS High Availability Configuring HDFS properties to optimize log collection Configuring HDFS trash Configuring Health Monitoring Configuring heap size to replicate large directories using replication policies Configuring heterogeneous storage in HDFS Configuring high availability for Hue Configuring Hive access for S3A Configuring Hive and Impala for high availability with Hue Configuring Hive to use with HBase Configuring HiveServer for ETL using YARN queues Configuring HiveServer high availability using a load balancer Configuring HiveServer high availability using Dynamic Service Discovery Configuring HMS for high availability Configuring Host Monitor Data Storage Configuring Host Monitoring Configuring Hosts Configuring Hosts to Use the Internal Repository Configuring HSTS for HDFS Web UIs Configuring HSTS for Spark Configuring HTTPS encryption Configuring https endpoints in Ozone S3 Gateway to work with AWS CLI Configuring Hue as a TLS/SSL client Configuring Hue as a TLS/SSL client Configuring Hue as a TLS/SSL server Configuring Hue as a TLS/SSL server Configuring Impala Configuring Impala access for S3A Configuring Impala Query Data Store Maximum Size Configuring Impala Query Monitoring Configuring Impala Query Monitoring Configuring Impala TLS/SSL Configuring Impala TLS/SSL Configuring Impala to work with HDFS HA Configuring Impala Web UI Configuring Impyla for Impala Configuring Infra Solr Configuring JDBC for Impala Configuring JVM options and system properties for Ranger services Configuring Kafka ZooKeeper chroot Configuring Kerberos Authentication Configuring Kerberos Authentication Configuring Kerberos authentication in Apache Knox shared providers Configuring Kerberos properties Configuring Key Trustee Server High Availability Using Cloudera Manager Configuring LDAP Authentication Configuring LDAP Group Mappings Configuring LDAP on unmanaged clusters Configuring legacy CREATE TABLE behavior Configuring Lily HBase Indexer Security Configuring Livy Configuring Load Balancer for Impala Configuring Local Package and Parcel Repositories Configuring Log Alerts Configuring Log Alerts Configuring Log Directories Configuring Log Events Configuring log levels for command line tools Configuring Logging Thresholds Configuring Logs Configuring Management Service Database Limits Configuring MariaDB as the backend database for Hue Configuring MariaDB for Oozie Configuring MariaDB server Configuring Maximum File Descriptors Configuring Memory Allocations Configuring metastore database properties Configuring metastore location Configuring Monitoring Settings Configuring multiple listeners Configuring multiple listeners Configuring MultiWAL support using Cloudera Manager Configuring MySQL 5 for Oozie Configuring MySQL 8 for Oozie Configuring MySQL as the backend database for Hue Configuring MySQL for Streaming Components Configuring MySQL server Configuring Network Settings for a Proxy Server Configuring Nginx for basic authentication Configuring OAuth in Data Hub Configuring OAuth with core-site.xml Configuring OAuth with the Hadoop CredentialProvider Configuring ODBC for Impala Configuring Oozie data purge settings using Cloudera Manager Configuring Oozie High Availability using Cloudera Manager Configuring Oozie Sqoop1 Action workflow JDBC drivers Configuring Oozie to enable MapReduce jobs to read or write from Amazon S3 Configuring oozie to use HDFS HA Configuring Oozie to use HDFS HA Configuring Oracle as backend database for Hue Configuring Oracle for Oozie Configuring Oracle for Streaming Components Configuring other CDP components to use HDFS HA Configuring Ozone Configuring Ozone Security Configuring Ozone to work as a pure object store Configuring Ozone to work with Prometheus Configuring PAM authentication using Apache Knox Configuring PAM authentication with LDAP and SSSD Configuring PAM authentication with Linux users Configuring partitions for transactions Configuring Per-Bucket Settings Configuring Per-Bucket Settings to Access Data Around the World Configuring Periodic Stacks Collection Configuring Phoenix-Spark connector when both are on same cluster Configuring Phoenix-Spark connector when Phoenix is on remote cluster Configuring PostgreSQL for Oozie Configuring properties for non-Kerberos authentication mechanisms Configuring properties not exposed in Cloudera Manager Configuring Proxy Users to Access HDFS Configuring queue mapping to use the user name from the application tag using Cloudera Manager Configuring queue mapping to use the user name from the application tag using Cloudera Manager Configuring quotas Configuring Ranger audit properties for HDFS Configuring Ranger audit properties for Solr Configuring Ranger audits to show actual client IP address Configuring Ranger Authentication with UNIX, LDAP, AD, or PAM Configuring Ranger Authentication with UNIX, LDAP, or AD Configuring Ranger authorization Configuring Ranger Authorization for Atlas Configuring Ranger KMS High Availability Configuring replication specific REST servers Configuring replications Configuring Resource Parameters Configuring resource-based policies Configuring resource-based services Configuring Roles to Use a Custom Garbage Collection Parameter Configuring S3Guard for Cluster Access to S3 Configuring SAML authentication on managed clusters Configuring Schema Registry instance in NiFi Configuring secure access between Solr and Hue Configuring security for Storage Container Managers in High Availability Configuring Service Monitor Data Storage Configuring Service Monitoring Configuring Services to Use LZO Compression Configuring Simple Authorization in Atlas Configuring SMM for basic authentication Configuring SMM for monitoring Kafka cluster replications Configuring SMM to recognize Prometheus's TLS certificate Configuring Spark access for S3A Configuring Spark application logging properties Configuring Spark application properties in spark-defaults.conf Configuring Spark Applications Configuring Spark on YARN Applications Configuring SRM Driver for performance tuning Configuring srm-control Configuring SSL/TLS certificate exchange between two Cloudera Manager instances Configuring storage balancing for DataNodes Configuring Streams Messaging Manager for Kafka Connect Configuring Streams Replication Manager Configuring Suppression of Health Tests Before Tests Run Configuring tablet servers Configuring temporary table storage Configuring the ABFS Connector Configuring the Atlas hook in Kafka Configuring the balancer threshold Configuring the compaction check interval Configuring the Database for Streaming Components Configuring the driver role target clusters Configuring the Frequency of Diagnostic Data Collection Configuring the Hive Delegation Token Store Configuring the Hive Metastore to use HDFS HA Configuring the HiveServer load balancer Configuring the Hue Server to Store Data in the Oracle database Configuring the Kafka Connect Role Configuring the Kudu master Configuring the Livy Thrift Server Configuring the number of objects displayed in Hue Configuring the number of storage container copies for a DataNode Configuring the Ozone trash checkpoint values Configuring the resource capacity of root queue Configuring the server work directory path for a Ranger service Configuring the service role target cluster Configuring the SRM client's secure storage Configuring the storage policy for the Write-Ahead Log (WAL) Configuring Time-Series Query Results Configuring timezone for Hue Configuring TLS Encryption for Cloudera Manager Using Auto-TLS Configuring TLS encryption manually for Apache Atlas Configuring TLS encryption manually for Apache Atlas Configuring TLS encryption manually for Schema Registry Configuring TLS/SSL encryption Configuring TLS/SSL encryption for Kudu using Cloudera Manager Configuring TLS/SSL encryption for Kudu using Cloudera Manager Configuring TLS/SSL encryption manually for Apache Knox Configuring TLS/SSL encryption manually for CDP Services Configuring TLS/SSL encryption manually for DAS using Cloudera Manager Configuring TLS/SSL encryption manually for DAS using Cloudera Manager Configuring TLS/SSL encryption manually for Key Trustee Server Configuring TLS/SSL encryption manually for Livy Configuring TLS/SSL encryption manually for NiFi and NiFi Registry Configuring TLS/SSL encryption manually for Ozone Configuring TLS/SSL encryption manually for Spark Configuring TLS/SSL encryption manually for Zeppelin Configuring TLS/SSL for Core Hadoop Services Configuring TLS/SSL for HBase Configuring TLS/SSL for HBase Configuring TLS/SSL for HBase REST Server Configuring TLS/SSL for HBase Thrift Server Configuring TLS/SSL for HBase Web UIs Configuring TLS/SSL for HDFS Configuring TLS/SSL for HDFS Configuring TLS/SSL for Hue Configuring TLS/SSL for Hue Configuring TLS/SSL for the KMS Configuring TLS/SSL for YARN Configuring TLS/SSL manually Configuring TLS/SSL properties Configuring TLSv1.2-enforced MySQL server Configuring Transparent Data Encryption for Ozone Configuring ulimit for HBase Configuring Upgrade Domains Configuring Upgrade Domains Configuring user authentication Configuring user authentication using LDAP Configuring user authentication using SPNEGO Configuring Which Log Messages Become Events Configuring YARN Application Monitoring Configuring YARN Application Monitoring Configuring Zeppelin caching Confirm the election status of a ZooKeeper service Connect to Phoenix Query Server Connect to Phoenix Query Server through Apache Knox Connect workers Connecting Hive to BI tools using a JDBC/ODBC driver Connecting KeySecure HSM to CipherTrust Manager after migration from Key Secure HSM Connecting to an Apache Hive endpoint through Apache Knox Connecting to Impala Daemon in Impala Shell Connecting to PQS using JDBC Connecting to the Apache Livy Thrift Server Connection failed error when accessing the Search app (Solr) from Hue Connectors Connectors Considerations for backfill inserts Considerations for configuring High Availability on Storage Container Manager Considerations for configuring High Availability on the Ozone Manager Considerations for enabling SCM HA security Considerations for Knox Considerations for Oozie to work with AWS Considerations for realm names to use for replication Considerations for working with HDFS snapshots ContainerExecutor Error Codes (YARN) Contents of the BlockCache Control access to queues using ACLs Controlling Data Access with Tags Conversion functions Convert DER, JKS, PEM Files for TLS/SSL Artifacts Converting a managed non-transactional table to external Converting a queue to a Managed Parent Queue Converting an HDFS file to ORC Converting from an NFS-mounted shared edits directory to Quorum-Based Storage Converting from Device Names to UUIDs for Encrypted Devices Converting Hive CLI scripts to Beeline Converting instance directories to configs Copy sample tweets to HDFS Copying data between a secure and an insecure cluster using DistCp and WebHDFS Copying data with Hadoop DistCp Core Configuration Metrics Core Configuration Properties in Cloudera Runtime 7.1.7 Core Settings Service Corruption: checksum error on CFile block COUNT COUNT function Create a bucket Create a collection for tweets Create a Collection in Cloudera Search Create a Collection in Cloudera Search Create a Custom Access Policy Create a Custom Role Create a custom YARN service Create a GCP Service Account Create a Hadoop archive Create a Hive authorizer URL policy Create a Kafka Topic to Store your Events Create a new Kudu table from Impala Create a read-only Admin user (Auditor) Create a snapshot policy Create a standard YARN service Create a Streams Cluster on CDP Private Cloud Base Create a table in Hive Create a test collection Create a time-bound policy Create a topology map Create a topology script Create a user-defined function Create and Run a Note CREATE DATABASE statement Create empty table on the destination cluster CREATE FUNCTION statement Create indexer Maven project CREATE MATERIALIZED VIEW Create new YARN services using UI Create partitions Create placement rules CREATE ROLE statement Create snapshots on a directory Create snapshots using Cloudera Manager CREATE TABLE statement CREATE VIEW statement Creating a connector using Kafka Connect in SMM Creating a CRUD transactional table Creating a Custom Cluster Utilization Report Creating a Dashboard Creating a default directory for managed tables Creating a group in Hue Creating a Hive external table replication policy Creating a Host Template Creating a Hue user Creating a JAAS configuration file Creating a Kafka topic Creating a Lily HBase Indexer Configuration File Creating a Lily HBase Indexer Configuration File Creating a Morphline Configuration File Creating a Morphline Configuration File Creating a new Dynamic Configuration Creating a notifier Creating a Permanent Internal Repository Creating a Pre-Deployed Cloudera Manager Host Creating a Pre-Deployed Worker Host Creating a replica of an existing shard Creating a Role Group Creating a Runtime Cluster Using a Cloudera Manager Template Creating a Solr collection Creating a Sqoop import command Creating a table for a Kafka stream Creating a Temporary Internal Repository Creating a temporary table Creating a trace user in unsecure Accumulo deployment Creating a Trigger for CPU Capacity Creating a Trigger for Memory Capacity Creating a Trigger Using the Expression Editor Creating a truststore file in PEM format Creating a truststore file in PEM format Creating a view from Spark Creating an alert policy Creating an insert-only transactional table Creating an Ozone-based external table Creating and managing snapshot policies Creating and using a materialized view Creating and using a partitioned materialized view Creating Business Metadata Creating categories Creating classifications Creating Encryption Zones Creating glossaries Creating HDFS replication policy to replicate HDFS data Creating Hue Schema in Oracle database Creating labels Creating partitions dynamically Creating Static Pools Creating system tables to run query on Hive and Tez DAG events Creating tables Creating terms Creating the Hue database Creating the Hue database Creating the tables and view Creating the Template Creating the UDF class Creating trace user in unsecure OpDB deployment Creating Triggers from Charts Creating Virtual Images of Cluster Hosts Creating, using, and dropping an external table Cross Data Center Replication Cross data center replication example of multiple clusters Cruise Control Cruise Control Cruise Control Cruise Control Cruise Control Health Tests Cruise Control Metrics Cruise Control Overview Cruise Control REST API endpoints Cruise Control Server Health Tests Cruise Control Server Metrics CUME_DIST Cumulative hotfix CDP Private Cloud Base 7.1.7.3008-2 (SP3 Cumulative hotfix1) Cumulative hotfix CDP Private Cloud Base 7.1.7.3010-1 (SP3 Cumulative hotfix2) Cumulative hotfix CDP Private Cloud Base 7.1.7.3011-1 (SP3 Cumulative hotfix3) Cumulative hotfix CDP Private Cloud Base 7.1.7.3013-1 (SP3 Cumulative hotfix4) Cumulative hotfix CDP Private Cloud Base 7.1.7.3014-1 (SP3 Cumulative hotfix5) Cumulative hotfix CDP Private Cloud Base 7.1.7.3016-1 (SP3 Cumulative hotfix6) Cumulative hotfix CDP Private Cloud Base 7.1.7.3017-1 (SP3 Cumulative hotfix7) Cumulative hotfix CDP Private Cloud Base 7.1.7.3018-1 (SP3 Cumulative hotfix8) Cumulative hotfix CDP PvC Base 7.1.7.2002-1 (SP2 cumulative hotfix1) Cumulative hotfix CDP PvC Base 7.1.7.2009-1 (SP2 cumulative hotfix2) Cumulative hotfix CDP PvC Base 7.1.7.2010-1 (SP2 cumulative hotfix3) Cumulative hotfix CDP PvC Base 7.1.7.2011-1 (SP2 cumulative hotfix4) Cumulative hotfix CDP PvC Base 7.1.7.2013-1 (SP2 cumulative hotfix5) Cumulative hotfix CDP PvC Base 7.1.7.2016-1 (SP2 cumulative hotfix6) Cumulative hotfix CDP PvC Base 7.1.7.2021-1 (SP2 cumulative hotfix7) Cumulative hotfix CDP PvC Base 7.1.7.2023-1 (SP2 cumulative hotfix8) Cumulative hotfix CDP PvC Base 7.1.7.2024-1 (SP2 cumulative hotfix9) Cumulative hotfix CDP PvC Base 7.1.7.2025-2 (SP2 cumulative hotfix10) Cumulative hotfix CDP PvC Base 7.1.7.2026-3 (SP2 cumulative hotfix11) Cumulative hotfix CDP PvC Base 7.1.7.2030-1 (SP2 cumulative hotfix12) Cumulative hotfix CDP PvC Base 7.1.7.2032-1 (SP2 cumulative hotfix13) Cumulative hotfix CDP PvC Base 7.1.7.2035-2 (SP2 cumulative hotfix14) Cumulative hotfix CDP PvC Base 7.1.7.2038-1 (SP2 cumulative hotfix15) Cumulative hotfix CDP PvC Base 7.1.7.2040-4 (SP2 cumulative hotfix16) Cumulative hotfix CDP PvC Base 7.1.7.2046-1 (SP2 cumulative hotfix17) Cumulative hotfix CDP PvC Base 7.1.7.2047-1 (SP2 cumulative hotfix18) Cumulative hotfix CDP PvC Base 7.1.7.2050-1 (SP2 cumulative hotfix19) Cumulative hotfix CDS 3.2.7172000.10-1 Cumulative hotfix CDS 3.2.7172000.12-1 Cumulative hotfix CDS 3.2.7172000.13-4 Cumulative hotfix CDS 3.2.7172000.14-1 Cumulative hotfix CDS 3.2.7172000.15-1 Cumulative hotfix CDS 3.2.7172000.16-1 Cumulative hotfix CDS 3.2.7172000.3-3 Cumulative hotfix CDS 3.2.7172000.6-1 Cumulative hotfix CDS 3.2.7172000.8-1 Cumulative hotfix CDS 3.2.7172000.9-1 Cumulative hotfix CDS 3.2.7173000.2-1 Cumulative hotfix CDS 3.2.7173000.3-1 Cumulative hotfix CDS 3.2.7173000.4-1 Cumulative hotfixes Cumulative hotfixes Cumulative hotfixes Cumulative hotfixes Cumulative hotfixes Cumulative hotfixes for CDS Custom Configuration Custom Installation Scenarios Custom Installation Solutions Customize dynamic resource allocation settings Customize interpreter settings in a note Customize the HDFS home directory Customizing HDFS Customizing Kerberos principals Customizing Per-Bucket Secrets Held in Credential Files Customizing the Hue web UI Customizing time zones DAS DAS DAS administration using Ambari in CDP DAS administration using Cloudera Manager in CDP DAS architecture Dashboard Types Dashboards Data Access Data Access Data Analytics Studio (DAS) Data Analytics Studio Eventprocessor Health Tests Data Analytics Studio Eventprocessor Metrics Data Analytics Studio Metrics Data Analytics Studio Overview Data Analytics Studio overview Data Analytics Studio Properties in Cloudera Runtime 7.1.7 Data Analytics Studio Webapp Server Health Tests Data Analytics Studio Webapp Server Metrics Data at Rest Encryption Reference Architecture Data at Rest Encryption Requirements Data at Rest Encryption Requirements Data compaction Data Context Connector Properties in Cloudera Runtime 7.1.7 Data Discovery Service Agent Health Tests Data Discovery Service Agent Metrics Data Encryption Components and Solutions Data Granularity and Time-Series Metric Data Data migration to Apache Hive Data protection Data Science Data Stewardship with Apache Atlas Data Storage for Monitoring Data Data storage metrics Data types Database Requirements Databases Databases and Table Names DataNode Health Tests DataNode Metrics DataNodes DataNodes DataNodes page Date and time functions DATE data type DDL statements Deactivate and Remove Parcels Debug Web UI for Catalog Server Debug Web UI for Impala Daemon Debug Web UI for StateStore Decide to use the BucketCache DECIMAL data type Decimal type Decommission or remove a tablet server Decommissioning Hosts Decommissioning Ozone DataNodes Decommissioning Role Instances Decrease Reserve Space Dedicated Coordinator Default EXPIRES ON tag policy Default ports of OpDB Default Ranger audit filters Default User Roles Default view of Kafka Connect in the SMM UI Defining a backup target in solr.xml Defining and adding clusters for replication Defining Apache Atlas enumerations Defining co-located Kafka clusters using a service dependency Defining co-located Kafka clusters using Kafka credentials Defining external Kafka clusters Defining related terms Delegation token based authentication Delete a bucket Delete a group Delete a Key Delete a role Delete a user Delete data Delete HBase snapshots from Amazon S3 Delete Objects Delete placement rules Delete Queue Delete queues Delete snapshots using Cloudera Manager DELETE statement Delete the Cluster Deleting a Cluster Deleting a collection Deleting a connector using Kafka Connect in SMM Deleting a Host from Cloudera Manager Deleting a Host Template Deleting a Kafka topic Deleting a notifier Deleting a schema Deleting all documents in a collection Deleting an alert policy Deleting data from a table Deleting dynamically created child queues Deleting Encryption Zone Keys Deleting Encryption Zones Deleting Hosts Deleting partitions Deleting Role Instances Deleting Services Deleting tables Deletion Dell EMC PowerScale DENSE_RANK Deploy and manage services on YARN Deploy HBase replication Deploying and configuring Oozie Sqoop1 Action JDBC drivers Deploying Atlas service Deploying Clients Deployment Planning for Cloudera Search Deprecation notices in Cloudera Manager 7.11.3 CHF4 Deprecation notices in Cloudera Runtime 7.1.7 Deprecation notices in Cloudera Runtime 7.1.7 SP3 DESCRIBE EXTENDED and DESCRIBE FORMATTED DESCRIBE statement Describing a materialized view Designating Directories to Include in Disk Usage Reports Detecting slow DataNodes Determining the table type Developing and running an Apache Spark WordCount application Developing Apache Kafka Applications Developing Apache Spark Applications Developing Applications with Apache Kudu Diagnostic Data Collection Diagnostics logging Dimensioning guidelines Direct Reader configuration properties Direct Reader limitations Direct Reader mode introduction Directory configurations Directory Metrics Directory Usage Report Disable a provider in an existing provider configuration Disable loading of coprocessors Disable OpDB's use of HDFS trash Disable proxy for a known service in Apache Knox Disable RegionServer grouping Disable replication at the peer level Disable the BoundedByteBufferPool Disable the Firewall Disabling an alert policy Disabling and redeploying HDFS HA Disabling auto queue deletion Disabling automatic compaction Disabling impersonation (doas) Disabling Oozie High Availability Disabling Oozie UI using Cloudera Manager Disabling or changing the Credential Storage Provider (Technical Preview) Disabling redaction Disabling Redaction of sensitive information when using the Cloudera Manager API Disabling replication of parameters during Hive replication Disabling Static Service Pools Disabling the Automatic Sending of Diagnostic Data from a Manually Triggered Collection Disabling the Firewall Disabling the reporting feature Disabling the share option in Hue Disabling the web metric collection for Hue Disabling TLS protocols on JMX ports Disabling Transparent Hugepages (THP) Disassociate partitions from queues Discovering possible predicates Disk Balancer commands Disk management Disk Metrics Disk Removal Disk Replacement Disk space usage issue Disk space versus namespace Disk Usage Reports Disk Usage Reports Displaying Chart Details DistCp and Proxy Settings Distcp between secure clusters in different Kerberos realms Distcp syntax and examples DISTINCT operator DML statements DOCKER Health Tests Docker Server Health Tests Docker Server Metrics Documentation Errata in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Documentation Errata in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) Documentation Errata in Cloudera Runtime 7.1.7 SP1 Documentation Errata in Cloudera Runtime 7.1.7 SP2 Documentation Errata in Cloudera Runtime 7.1.7 SP3 DOUBLE data type Download a file Download and install PostgreSQL Download the Cluster Utilization Report Download the Trial version of CDP Private Cloud Base Download the Trial version of CDP Private Cloud Base Downloading and exporting data from Hue Downloading and installing MariaDB database Downloading and installing MySQL database Downloading and Publishing the Package Repository Downloading and Publishing the Parcel Repository Downloading Audit Events Downloading Client Configuration Files Downloading HDFS Directory Access Permission Reports Downloading Hdfsfindtool from the CDH archives Downloading query results from Hue takes time Downloading Reports as CSV and XLS Files Driver inter-node coordination Drop a Kudu table DROP DATABASE statement DROP FUNCTION statement DROP MATERIALIZED VIEW DROP ROLE statement DROP STATS statement DROP TABLE statement DROP VIEW statement Dropping a materialized view Dropping an external table along with data Dumping the Oozie database Dynamic allocation Dynamic Queue Scheduling [Technical Preview] Dynamic resource allocation properties Dynamic Resource Pool Settings Dynamic resource-based column masking in Hive with Ranger policies Dynamic tag-based column masking in Hive with Ranger policies Dynamically loading a custom filter Ecs Agent Health Tests Ecs Agent Metrics ECS Health Tests Ecs Server Health Tests Ecs Server Metrics Edit a group Edit a role Edit a user Edit or delete a snapshot policy Editing a Chart Editing a Host Template Editing rack assignments for hosts Editing tables Editing the S3Guard Configuration Editing, Deleting, Suppressing, or Deleting a Trigger Effects of WAL rolling on replication Elements of the Recon web user interface Enable Access Control for Data Enable Access Control for Interpreter, Configuration, and Credential Settings Enable Access Control for Notebooks Enable an NTP Service Enable an NTP Service Enable and disable snapshot creation using Cloudera Manager Enable asynchronous scheduler Enable authorization for additional HDFS web UIs Enable authorization for HDFS web UIs Enable authorization in Kafka with Ranger Enable bulk load replication using Cloudera Manager Enable Cgroups Enable core dump Enable detection of slow DataNodes Enable disk IO statistics Enable document-level authorization Enable garbage collector logging Enable GZipCodec as the default compression codec Enable HBase high availability using Cloudera Manager Enable HBase indexing Enable hedged reads for HBase Enable high availability Enable HTTPS communication Enable Intra-Queue Preemption for a specific queue Enable Kerberos authentication Enable Kerberos authentication in Solr Enable LDAP authentication in Solr Enable multi-threaded faceting Enable namespace mapping Enable node label on a cluster to configure partition Enable or disable authentication with delegation tokens Enable override of default queue mappings Enable Phoenix ACLs Enable proxy for a known service in Apache Knox Enable Ranger Admin login using kerberos authentication Enable Ranger authorization in Solr Enable RegionServer grouping using Cloudera Manager Enable replication on a specific table Enable Replication on HBase Column Families Enable security for Cruise Control Enable security for Cruise Control Enable Sensitive Data Redaction Enable server-server mutual authentication Enable snapshot creation on a directory Enable the AdminServer Enable the Cluster Utilization Report Enabling a multi-threaded environment for Hue Enabling Access Control for Zeppelin Elements Enabling access to HBase browser from Hue Enabling ACL for RegionServer grouping Enabling Admission Control Enabling all scheduled queries Enabling an alert policy Enabling and Configuring Static Service Pools Enabling and disabling HDFS snapshots Enabling and Disabling Log Event Capture Enabling and disabling trash Enabling CDS 3.2.3 with GPU Support Enabling Configuration Change Alerts Enabling Configuration Change Alerts Enabling custom Kerberos principal support in a Queue Manager cluster Enabling custom Kerberos principal support in a Queue Manager cluster Enabling custom Kerberos principal support in YARN Enabling custom Kerberos principal support in YARN Enabling DEBUG Enabling dynamic child creation in weight mode Enabling Fast Upload using Cloudera Manager Enabling fault-tolerant processing in Spark Streaming Enabling HBase Alerts Enabling HDFS HA Enabling Health Alerts Enabling High Availability and automatic failover Enabling httpd log rotation for Hue Enabling Hue applications with Cloudera Manager Enabling Hue as a TLS/SSL client Enabling Hue as a TLS/SSL client Enabling Hue as a TLS/SSL server using Cloudera Manager Enabling Hue as a TLS/SSL server using Cloudera Manager Enabling interceptors Enabling Intra-Queue preemption Enabling Kerberos authentication and RPC encryption Enabling Kerberos Authentication for CDP Enabling Kerberos Authentication for the KMS Enabling Kerberos for the SRM service Enabling LazyPreemption Enabling LDAP Authentication for impala-shell Enabling LDAP authentication with HiveServer2 and Impala Enabling LDAP for in Hue Enabling Native Acceleration For MLlib Enabling Oozie High Availability Enabling Oozie SLA with Cloudera Manager Enabling or disabling anonymous usage date collection Enabling preemption for a specific queue Enabling Ranger authorization Enabling replication between clusters with Kerberos authentication Enabling Resource Management with Control Groups Enabling SASL in HiveServer Enabling scheduled queries Enabling security for Apache Flink Enabling self-healing for all or individual anomaly types Enabling self-healing in Cruise Control Enabling Snapshots Enabling Solr clients to authenticate with a secure Solr Enabling Spark authentication Enabling Spark Encryption Enabling Spark rolling event log files in CDP Enabling Speculative Execution Enabling SSE-C Enabling SSE-KMS Enabling SSE-S3 Enabling the Dynamic Queue Scheduling feature Enabling the Hive Metastore integration Enabling the Oozie web console on managed clusters Enabling the SQL editor autocompleter Enabling TLS Encryption for SMM on CDP Private Cloud Enabling TLS/SSL communication with HiveServer2 Enabling TLS/SSL communication with HiveServer2 Enabling TLS/SSL communication with Impala Enabling TLS/SSL communication with Impala Enabling TLS/SSL for HiveServer Enabling TLS/SSL for HiveServer Enabling TLS/SSL for Hue Load Balancer Enabling TLS/SSL for Hue Load Balancer Enabling TLS/SSL for the SRM service Enabling TLS/SSL for the SRM service Enabling vectorized query execution Encrypting an S3 Bucket with Amazon S3 Default Encryption Encrypting and Decrypting Data Using Cloudera Navigator Encrypt Encrypting Data at Rest Encrypting Data at Rest Encrypting Data in Transit Encrypting Data in Transit Encrypting data in transit between clusters Encrypting Data on S3 Encryption Encryption Encryption in SSB Encryption Zones and Keys End to end latency overview End to end latency use case Ending a CDP Private Cloud Base Trial Enforcing TLS version 1.2 for Hue Enhancements related to bulk glossary terms import Enter Required Parameters Environment variables for sizing NameNode heap memory Erasure coding CLI command Erasure coding examples Erasure coding overview Error Messages and Various Failures Error validating LDAP user in Hue Errors during hole punching test Escaping an invalid identifier Essential metrics to monitor ETL with Cloudera Morphlines Event Server Event Server Health Tests Event Server Metrics Events Evolving a schema Example - Placement rules creation Example configuration to add to the sudoers file Example for using THttpClient API in secure cluster Example for using THttpClient API in unsecure cluster Example for using TSaslClientTransport API in secure cluster without HTTP Example of Cruise Control goal configuration Example use cases Example workload Example: Configuration for work preserving recovery Example: Running SparkPi on YARN Example: Using the HBase-Spark connector Examples Examples of accessing Amazon S3 data from Spark Examples of Audit Operations Examples of controlling data access using classifications Examples of creating and using UDFs Examples of DistCp commands using the S3 protocol and hidden credentials Examples of estimating NameNode heap memory Examples of interacting with Schema Registry Examples of overlapping quota policies Examples of using the AWS CLI for Ozone S3 Gateway Examples of using the S3A filesystem with Ozone S3 Gateway Examples of writing data in various file formats Excluding audits for specific users, groups, and roles Exit statuses for the HDFS Balancer Experimental flags EXPLAIN statement Exploring using Lineage Export a Note Export a snapshot to another cluster Export all resource-based policies for all services Export Ranger reports Export resource-based policies for a specific service Export tag-based policies Exporting Data from Charts Exporting the Cluster Configuration Expose HBase metrics to a Ganglia server Extending Atlas to Manage Metadata from Additional Sources Extending Cloudera Manager External table access Failover Controller Health Tests Failover Controller Metrics Failures during INSERT, UPDATE, UPSERT, and DELETE operations Fan-in and Fan-out Replication Flows FAQ Feature comparison Feature Comparisons Fetching Spark Maven dependencies File descriptor limits File descriptors File system partitioning recommendations Files and directories Files and directories Filesystem Metrics Filesystems Filter Attributes Filter Attributes Filter Expressions Filter Expressions Filter HMS results Filter service access logs from Ranger UI Filter types Filtering Audit Events Filtering by Day of Week or Hour of Day Filtering Events Filtering Jobs Filtering Logs Filtering Metrics Filtering Metrics Filtering Queries Filtering the Activities List Filtering the Tasks List Filters Find latest OpDB keytab Finding issues Finding the list of Hue superusers Finding the list of Hue superusers FIRST_VALUE Fixed Common Vulnerabilities and Exposures 7.1.7 SP1 Fixed Common Vulnerabilities and Exposures 7.1.7 SP2 Fixed Common Vulnerabilities and Exposures 7.1.7 SP3 Fixed Common Vulnerabilities and Exposures in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) Fixed Issues in Apache Atlas Fixed Issues in Apache Atlas Fixed Issues in Apache Atlas Fixed Issues in Apache Avro Fixed Issues in Apache Avro Fixed Issues in Apache Avro Fixed issues in Apache Calcite Fixed issues in Apache Calcite Fixed Issues in Apache Hadoop Fixed Issues in Apache Hadoop Fixed Issues in Apache Hadoop Fixed Issues in Apache HBase Fixed Issues in Apache HBase Fixed Issues in Apache HBase Fixed Issues in Apache HDFS Fixed Issues in Apache HDFS Fixed Issues in Apache HDFS Fixed Issues in Apache Hive Fixed Issues in Apache Hive Fixed Issues in Apache Hive Fixed Issues in Apache Impala Fixed Issues in Apache Impala Fixed Issues in Apache Impala Fixed Issues in Apache Kafka Fixed Issues in Apache Kafka Fixed Issues in Apache Kafka Fixed Issues in Apache Knox Fixed Issues in Apache Knox Fixed Issues in Apache Knox Fixed Issues in Apache Kudu Fixed Issues in Apache Kudu Fixed Issues in Apache Kudu Fixed Issues in Apache Oozie Fixed Issues in Apache Oozie Fixed Issues in Apache Oozie Fixed issues in Apache Ozone Fixed issues in Apache Ozone Fixed issues in Apache Ozone Fixed Issues in Apache Parquet Fixed Issues in Apache Parquet Fixed Issues in Apache Parquet Fixed Issues in Apache Phoenix Fixed Issues in Apache Phoenix Fixed Issues in Apache Ranger Fixed Issues in Apache Ranger Fixed Issues in Apache Ranger Fixed Issues in Apache Solr Fixed Issues in Apache Solr Fixed Issues in Apache Spark Fixed Issues in Apache Spark Fixed Issues in Apache Spark Fixed Issues in Apache Sqoop Fixed Issues in Apache Sqoop Fixed Issues in Apache Sqoop Fixed Issues in Apache Tez Fixed Issues in Apache Tez Fixed Issues in Apache Tez Fixed Issues in Apache YARN Fixed Issues in Apache YARN and YARN Queue Manager Fixed Issues in Apache YARN and YARN Queue Manager Fixed Issues in Apache Zookeeper Fixed Issues in Apache Zookeeper Fixed Issues in Apache Zookeeper Fixed issues in Cloud Connectors Fixed issues in Cloud Connectors Fixed Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Fixed Issues in Cloudera Manager 7.4.4 Fixed Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) Fixed Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) Fixed issues in Cloudera Runtime 7.1.7 Fixed issues in Cloudera Runtime 7.1.7 SP1 Fixed issues in Cloudera Runtime 7.1.7 SP2 Fixed issues in Cloudera Runtime 7.1.7 SP3 Fixed Issues in Cloudera Search Fixed Issues in Cloudera Search Fixed Issues in Cloudera Search Fixed issues in Cruise Control Fixed issues in Cruise Control Fixed issues in Cruise Control Fixed issues in Data Analytics Studio Fixed issues in Data Analytics Studio Fixed issues in Data Analytics Studio Fixed Issues in Hue Fixed Issues in Hue Fixed Issues in Hue Fixed Issues in Kerberos Fixed Issues in Kerberos Fixed Issues in Livy Fixed Issues in Livy Fixed Issues in MapReduce Fixed Issues in MapReduce Fixed Issues in Navigator Encrypt Fixed Issues in Navigator Encrypt Fixed Issues in Navigator Encrypt Fixed Issues in Phoenix Fixed Issues in Schema Registry Fixed Issues in Schema Registry Fixed Issues in Schema Registry Fixed Issues in Streams Messaging Manager Fixed Issues in Streams Messaging Manager Fixed Issues in Streams Messaging Manager Fixed Issues in Streams Replication Manager Fixed Issues in Streams Replication Manager Fixed Issues in Streams Replication Manager Fixed Issues in Zeppelin Fixed Issues in Zeppelin Fixed Issues in Zeppelin Fixing a warning related to accessing non-optimized Hue Fixing authentication issues between HBase and Hue Fixing block inconsistencies Fixing incorrect start time and duration on Hue Job Browser Fixing issues Flink Dashboard Health Tests Flink Dashboard Metrics Flink Metrics FLOAT data type Flume Agent Health Tests Flume Channel Metrics Flume Health Tests Flume Metrics Flume Sink Metrics Flume Source Metrics Flush options Flushing data to disk Format for using Hadoop archives with MapReduce Frequently asked questions Functions Garbage Collector Health Tests Garbage Collector Metrics General Quota Syntax General Settings Generate a table list Generating a New Certificate Generating and viewing Apache Hive statistics Generating collection configuration using configs Generating Solr collection configuration using instance directories Generating statistics Generating surrogate keys Generating Table and Column Statistics Getting Metrics for Streams Messaging Manager Getting scheduled query information and monitor the query Getting Started on your Streams Cluster Getting the JDBC driver Getting the ODBC driver Glossaries overview Governance Governance Governance Overview Graceful HBase shutdown Gracefully shut down an HBase RegionServer Gracefully shut down the HBase service GRANT ROLE statement GRANT statement Granularity of metrics for end-to-end latency GROUP BY clause Grouping (Faceting) Time Series Groups and fetching GROUP_CONCAT function Guidelines for Schema Design Guidelines to use snapshot diff-based replication Hadoop Hadoop Hadoop archive components Hadoop File Formats Support Hadoop File System commands Hadoop Users (user:group) and Kerberos Principals Handling disk failures Handling large messages Hardware Requirements Hash and hash partitioning Hash and range partitioning Hash partitioning Hash partitioning HashTable/SyncTable tool configuration HAVING clause HBase HBase HBase HBase HBase actions that produce Atlas entities HBase audit entries HBase authentication HBase authorization HBase backup and disaster recovery strategies HBASE Category HBase entities created in Atlas HBase filtering HBase Health Tests HBase I/O components HBase is using more disk space than expected Hbase lineage HBase MCC Configurations HBase MCC Restrictions HBase MCC Usage in Spark with Java HBase MCC Usage in Spark with Scala HBase MCC Usage with Kerberos HBase metadata collection HBase Metrics HBase online merge HBase Properties in Cloudera Runtime 7.1.7 HBase read replicas HBase RegionServer Replication Peer Metrics HBase REST Server Health Tests HBase REST Server Metrics HBase Shell example HBase snapshots on Amazon S3 with Kerberos enabled HBase Thrift Server Health Tests HBase Thrift Server Metrics HBaseMapReduceIndexerTool command line reference HBCK2 tool command reference HDFS HDFS HDFS HDFS HDFS HDFS ACLs HDFS Block Skew HDFS Cache Directive Metrics HDFS Cache Pool Metrics HDFS Caching HDFS commands for metadata files and directories HDFS Encryption Issues HDFS entity metadata migration HDFS Health Tests HDFS Metrics HDFS Metrics HDFS Properties in Cloudera Runtime 7.1 HDFS replication in Sentry-enabled clusters HDFS replication policies HDFS replication policy considerations HDFS Sink Connector HDFS Sink Connector Properties Reference HDFS storage demands due to retained HDFS trash HDFS storage policies HDFS storage types HDFS storage types HDFS to Apache Hive data migration HDFS Transparent Encryption Head a bucket Head an object Health Tests Health Tests and Health History Health Tests and Health History HEALTH_CHECK Category Heap sampling HeapDumpPath (/tmp) in Hive data nodes gets full due to .hprof files Hierarchical namespaces vs. non-namespaces Hierarchical queue characteristics High Availability on HDFS clusters Highly Available Kafka Architectures History Server Health Tests History Server Metrics Hive Hive Hive Hive Hive Hive access authorization Hive authentication Hive entity metadata migration Hive Execution Health Tests Hive Execution Metrics Hive external table replication policies Hive Health Tests Hive LLAP Health Tests Hive LLAP Metrics Hive LLAP Properties in Cloudera Runtime 7.1.7 Hive Metastore Server Health Tests Hive Metastore Server Metrics Hive Metrics Hive on Tez configurations Hive on Tez Health Tests Hive on Tez introduction Hive on Tez Metrics Hive on Tez Properties in Cloudera Runtime 7.1.7 Hive Properties in Cloudera Runtime 7.1.7 Hive replication policy considerations Hive reserved words Hive Table Metrics Hive tables and DDL commands Hive unsupported interfaces and features Hive Warehouse Connector for accessing Apache Spark data Hive Warehouse Connector Interfaces Hive-HDFS ACL Sync Reference Hive-HDFS ACL Sync Use Cases Hive/Impala replication using snapshots HiveServer actions that produce Atlas entities HiveServer audit entries HiveServer entities created in Atlas HiveServer is unresponsive due to large queries running in parallel HiveServer lineage HiveServer metadata collection HiveServer relationships HiveServer2 Health Tests HiveServer2 Metrics HMS table storage Home Page Host Configuration Properties Host Details Host Health Tests Host Inspector Host Management Host Metrics Host Monitor Host Monitor and Service Monitor Memory Configuration Host Monitor Health Tests Host Monitor Metrics Host Templates Hosts Disks Overview Hotfixes in Cloudera Runtime 7.1.7 Hotfixes in Cloudera Runtime 7.1.7 SP1 Hotfixes in Cloudera Runtime 7.1.7 SP2 How Client Configurations are Deployed How Cloudera Search works How DAS helps to debug Hive on Tez queries How HDFS replication policy works How Integration works How Lineage strategy works How NameNode manages blocks on a failed DataNode How NFS Gateway authenticates and maps users How Ozone manages read operations How Ozone manages write operations How tag-based access control works How the reporting task runs in a NiFi cluster How to add a coarse URI check for Hive agent How to Add Root and Intermediate CAs to Truststore for TLS/SSL How to Authenticate Kerberos Principals Using Java How to change the password for Ranger users How to clear Ranger Admin access logs How to Configure a MapReduce Job to Access S3 with an HDFS Credstore How to configure Ranger HDFS plugin configs per (NameNode) Role Group How to full sync the Ranger RMS database How to pass JVM options to Ranger KMS services How to read the Placement Rules table How to read the Schedule table How to set audit filters in Ranger Admin Web UI How to Set up Failover and Failback How to suppress database connection notifications How to: Compute How to: Data Access How to: Data Science How to: Governance How to: Jobs Management How to: Next-Gen Storage How to: Operational Database How to: Security How to: Storage How to: Streams Messaging HRegion Metrics HSM-Specific Setup for Cloudera Navigator Key HSM HTable Metrics HTTP 403 error while accessing Hue HttpFS authentication HttpFS Health Tests HttpFS Metrics Hue Hue Hue Hue Hue Advanced Configuration Snippet Hue configuration files Hue configurations in CDP Runtime Hue Health Tests Hue in a Virtual Private Cluster Environment Hue Load Balancer does not start Hue load balancer does not start after enabling TLS Hue logs Hue Metrics Hue Overview Hue overview Hue Properties in Cloudera Runtime 7.1.7 Hue Server Health Tests Hue Server Metrics Hue service Django logs Hue supported browsers HWC and DataFrame API limitations HWC and DataFrame APIs HWC API Examples HWC authorization HWC authorization HWC integration with pyspark, sparklyr, and Zeppelin HWC limitations HWC supported types mapping IAM Role permissions for working with SSE-KMS IBM Spectrum Scale Identifiers Identify Roles that Use the Embedded Database Server Identifying problems Identity Management Impact of quota violation policy Impala Impala Impala Impala Impala Impala actions that produce Atlas entities Impala aliases Impala audit entries Impala Authentication Impala Authorization Impala Best Practices Impala Catalog Server Health Tests Impala Catalog Server Metrics Impala Daemon Health Tests Impala Daemon Metrics Impala Daemon Resource Pool Metrics Impala database containment model Impala DDL for Kudu Impala DML for Kudu Tables Impala entities created in Atlas Impala entity metadata migration Impala Health Tests Impala integration limitations Impala integration limitations Impala lineage Impala lineage Impala Llama ApplicationMaster Health Tests Impala Llama ApplicationMaster Metrics Impala Logs Impala metadata collection Impala Metrics Impala Pool Metrics Impala Pool User Metrics Impala Properties in Cloudera Runtime 7.1.7 Impala query counter metrics Impala Query Metrics Impala Requirements Impala Shell Command Reference Impala Shell Configuration File Impala Shell Configuration Options Impala Shell Tool Impala SQL and Hive SQL Impala StateStore Health Tests Impala StateStore Metrics Impala Tab Impala with Amazon S3 Impala with Azure Data Lake Store (ADLS) Impala with HBase Impala with HDFS Impala with Kudu Implementing your own Custom Command Import a Note Import and sync LDAP users and groups Import command options Import Data from RDBMS into an S3 Bucket Import Data into an External Hive Table Backed by S3 Import Data into S3 Bucket in Incremental Mode Import External Packages Import resource-based policies for a specific service Import resource-based policies for all services Import tag-based policies Importance of a Secure Cluster Importing and exporting resource-based policies Importing and exporting tag-based policies Importing Business Metadata associations in bulk Importing Confluent Schema Registry schemas into Cloudera Schema Registry Importing Data into Amazon S3 Using Sqoop Importing data into HBase Importing Data into Microsoft Azure Data Lake Store (Gen1 and Gen2) Using Sqoop Importing Glossary terms in bulk Importing Hive Metadata using Command-Line (CLI) utility Importing RDBMS data into Hive Importing RDBMS data to HDFS Importing Sentry privileges into Ranger policies Importing the Template to a New Cluster Imports into Hive Improve network latency during replication job run Improve Performance in Schema Registry Improving Performance for S3A Improving Performance in Shuffle Handler and IFile Reader Improving performance with centralized cache management Improving performance with short-circuit local reads Improving Software Performance Increasing StateStore Timeout Increasing storage capacity with HDFS compression Increasing the maximum number of processes for Oracle database Incrementally updating an imported table Index sample data Indexing Indexing Data Indexing Data Using Morphlines Indexing Data Using Spark-Solr Connector Indexing data with MapReduceIndexerTool in Solr backup format Indexing sample Tweets with Cloudera Search Information and debugging Ingestion Initializing Navigator Key HSM Initializing Standalone Key Trustee Server Initializing Standalone Key Trustee Server Using Cloudera Manager Initiate replication when data already exist Initiating automatic compaction in Cloudera Manager Initiating HDFS failover using the Cloudera Manager API INSERT and primary key uniqueness violations Insert data Insert data in test_table through Spark INSERT statement Inserting data into a table Inspecting Network Performance Install Accumulo Install Accumulo 1.10 parcel Install Accumulo CSD file Install Accumulo parcel using Local Parcel Repository Install Accumulo using Remote Parcel Repository Install and configure additional required components Install and Configure MariaDB for CDP Install and Configure MySQL for CDP Install and Configure PostgreSQL for CDP Install CDP Install CDP Install CDP Install Cloudera Manager Packages Install Cloudera Runtime Install Cloudera Runtime Install OpDB Install OpDB Install OpDB CSD file Install OpDB CSD file Install OpDB parcel Install OpDB parcel Install OpDB parcel using Local Parcel Repository Install OpDB parcel using Local Parcel Repository Install OpDB parcel using Remote Parcel Repository Install OpDB parcel using Remote Parcel Repository Install the NFS Gateway Installation Reference Installation Wizard Installing a Java Keystore KMS Installing a Kafka-centric cluster Installing a Trial Cluster Installing a Trial Streaming Cluster Installing Accumulo Parcel 1.0.0 Installing Accumulo Parcel 1.1.0 Installing Accumulo Parcel 1.10 Installing and Configuring CDP with FIPS Installing and configuring MariaDB on RHEL 8 Installing and configuring MySQL on RHEL 8 Installing Apache Knox Installing Apache Knox Installing Atlas in HA using CDP Private Cloud Base cluster Installing Atlas using Add Service Installing CDP Private Cloud Base Installing CDS 3.2.3 Installing Cloudera Manager, Cloudera Runtime, and Managed Services Installing Cloudera Navigator Encrypt Installing Cloudera Navigator Key HSM Installing Connectors Installing Hive on Tez and adding a HiveServer role Installing OpenJDK for CDP Runtime Installing OpenJDK on Cloudera Manager Installing Operational Database powered by Apache Accumulo Installing Oracle JDK for CDP Runtime Installing Postgres JDBC Driver Installing PostgreSQL Server Installing Ranger KMS backed by a Database and HA Installing Ranger KMS backed with a Key Trustee Server and HA Installing Ranger RMS Installing Ranger using Add Service Installing the GPL Extras Parcel Installing the Kafka Connect Role Installing the psycopg2 Python package for PostgreSQL-backed Hue Installing the REST Server using Cloudera Manager Installing the UDF development package Instantiating a Cloudera Manager Image Instantiating a worker host INT data type Integrating Apache Hive with Apache Spark and BI Integrating Atlas with Ozone Integrating Components for Encrypting Data at Rest Integrating Hive and a BI tool Integrating Kafka and Schema Registry Integrating Key HSM with Key Trustee Server Integrating MIT Kerberos and Active Directory Integrating Ranger KMS DB with CipherTrust Manager HSM Integrating Ranger KMS DB with Google Cloud HSM Integrating Ranger KMS DB with SafeNet Keysecure HSM Integrating the Hive Metastore with Apache Kudu Integrating with Flink and SSB Integrating with NiFi Integrating with Schema Registry Integrating your identity provider's SAML server with Hue Inter-broker security Inter-broker security Interacting with Hive views Internal and external Impala tables Introducing the S3A Committers Introduction Introduction Introduction Introduction Introduction Introduction Introduction Introduction Introduction Introduction Introduction to alert policies in Streams Messaging Manager Introduction to Apache HBase Introduction to Apache Phoenix Introduction to Azure Storage and the ABFS Connector Introduction to HBase Multi-cluster Client Introduction to HBase Multi-cluster Client Introduction to HDFS metadata files and directories Introduction to Hive metastore Introduction to Kafka Connect Introduction to monitoring Kafka cluster replications in SMM Introduction to Ozone Introduction to Parcels Introduction to Streams Messaging Manager Invalid method name: 'GetLog' error Invalid query handle INVALIDATE METADATA statement Isilon Metrics ISR management Issues starting or restarting the master or the tablet server Java API example Java client Java KeyStore KMS Metrics Java KeyStore KMS Properties in Cloudera Runtime 7.1.7 Java Requirements JBOD JBOD Disk migration JBOD setup JDBC connection string syntax JDBC connection string syntax JDBC mode configuration properties JDBC mode limitations JDBC read mode introduction JobHistory Server Health Tests JobHistory Server Metrics JobTracker Health Tests JobTracker Metrics Joins in Impala SELECT statements JournalNode Health Tests JournalNode Metrics JournalNodes JournalNodes JVM and garbage collection Kafka Kafka Kafka Kafka Kafka Kafka Kafka actions that produce Atlas entities Kafka Architecture Kafka audit entries Kafka Broker Health Tests Kafka Broker Log Directory Metrics Kafka Broker Metrics Kafka Broker Topic Metrics Kafka Broker Topic Partition Metrics Kafka brokers and Zookeeper Kafka clients and ZooKeeper Kafka cluster load balancing using Cruise Control Kafka Connect Kafka Connect API Security Kafka Connect Connector Reference Kafka Connect Connector Sink Task Metrics Metrics Kafka Connect Connector Source Task Metrics Metrics Kafka Connect Connector Task Error Metrics Metrics Kafka Connect Connector Task Metrics Metrics Kafka Connect Health Tests Kafka Connect Metrics Kafka Connect Overview Kafka Connect property configuration in Cloudera Manager for Prometheus Kafka Connect Setup Kafka Consumer Group Metrics Kafka consumers Kafka credentials property reference Kafka FAQ Kafka Health Tests Kafka Introduction Kafka lineage Kafka metadata collection Kafka Metrics Kafka MirrorMaker Health Tests Kafka MirrorMaker Metrics Kafka Producer Metrics Kafka producers Kafka Properties in Cloudera Runtime 7.1.7 Kafka property configuration in Cloudera Manager for Prometheus Kafka public APIs Kafka relationships Kafka Replica Metrics Kafka security hardening with Zookeeper ACLs Kafka storage handler and table properties Kafka Streams kafka-*-perf-test kafka-configs kafka-console-consumer kafka-console-producer kafka-consumer-groups kafka-delegation-tokens kafka-log-dirs kafka-reassign-partitions kafka-topics Kafka-ZooKeeper performance tuning Keep replicas current Kerberos Kerberos Kerberos authentication Kerberos authentication for non-default users Kerberos configuration for Ozone Kerberos Configuration Strategies for CDP Kerberos configurations for HWC Kerberos connectivity test Kerberos principal and keytab properties for Ozone service daemons Kerberos Security Artifacts Overview Kerberos setup guidelines for Distcp between secure clusters Kerberos Ticket Renewer Health Tests Kerberos Ticket Renewer Metrics Kernel stack watchdog traces Key Concepts and Architecture Key Features Key Management Server Health Tests Key Management Server Metrics Key Management Server Proxy Health Tests Key Management Server Proxy Metrics Key management using ofs Key Trustee KMS Encryption Issues Key Trustee KMS Metrics Key Trustee KMS operations not supported by Ranger KMS Key Trustee KMS Properties in Cloudera Runtime 7.1.7 Key Trustee Server Key Trustee Server Metrics Key Trustee Server Properties for TLS Key Trustee Server Properties in Cloudera Runtime 7.1.7 Key Trustee Server System Requirements Key-Value Store Indexer Health Tests Key-Value Store Indexer Metrics Key-Value Store Indexer Properties in Cloudera Runtime 7.1.7 Keystores and the Key Management Server kite-morphlines-avro kite-morphlines-core-stdio kite-morphlines-core-stdlib kite-morphlines-hadoop-core kite-morphlines-hadoop-parquet-avro kite-morphlines-hadoop-rcfile kite-morphlines-hadoop-sequencefile kite-morphlines-json kite-morphlines-maxmind kite-morphlines-metrics-servlets kite-morphlines-protobuf kite-morphlines-saxon kite-morphlines-solr-cell kite-morphlines-solr-core kite-morphlines-tika-core kite-morphlines-tika-decompress kite-morphlines-useragent KMS ACL Configuration for Hive Known issues and limitations Known Issues for Apache Sqoop Known Issues for Apache Sqoop Known Issues for Apache Sqoop Known Issues for IBM PowerPC Known Issues in 7.1.7 SP3 CHF 6 Known Issues in 7.1.7 SP3 CHF 7 Known Issues in 7.1.7 SP3 CHF 8 Known Issues in Apache Atlas Known Issues in Apache Atlas Known Issues in Apache Atlas Known Issues in Apache Avro Known Issues in Apache Avro Known Issues in Apache Avro Known Issues in Apache Calcite Known issues in Apache Calcite Known Issues in Apache Hadoop Known Issues in Apache Hadoop Known Issues in Apache Hadoop Known Issues in Apache HBase Known Issues in Apache HBase Known Issues in Apache HBase Known Issues in Apache Hive Known Issues in Apache Hive Known Issues in Apache Hive Known Issues in Apache Impala Known Issues in Apache Impala Known Issues in Apache Impala Known Issues in Apache Kafka Known Issues in Apache Kafka Known Issues in Apache Kafka Known Issues in Apache Knox Known Issues in Apache Knox Known Issues in Apache Knox Known Issues in Apache Kudu Known Issues in Apache Kudu Known Issues in Apache Kudu Known Issues in Apache Oozie Known Issues in Apache Oozie Known Issues in Apache Oozie Known Issues in Apache Ozone Known Issues in Apache Ozone Known Issues in Apache Ozone Known Issues in Apache Parquet Known Issues in Apache Parquet Known Issues in Apache Parquet Known Issues in Apache Phoenix Known Issues in Apache Phoenix Known Issues in Apache Phoenix Known Issues in Apache Ranger Known Issues in Apache Ranger Known Issues in Apache Ranger Known Issues in Apache Spark Known Issues in Apache Spark Known Issues in Apache Spark Known Issues in Apache Zeppelin Known Issues in Apache Zeppelin Known Issues in Apache Zeppelin Known Issues in Apache ZooKeeper Known Issues in Apache ZooKeeper Known Issues in Apache ZooKeeper Known Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) Known Issues in Cloudera Manager 7.4.4 Known Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) Known Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) Known issues in Cloudera Runtime 7.1.7 Known issues in Cloudera Runtime 7.1.7 SP1 Known issues in Cloudera Runtime 7.1.7 SP2 Known Issues in Cloudera Runtime 7.1.7 SP3 Known Issues in Cloudera Search Known Issues in Cloudera Search Known Issues in Cloudera Search Known Issues in Cruise Control Known issues in Cruise Control Known issues in Cruise Control Known Issues in Data Analytics Studio Known Issues in Data Analytics Studio Known Issues in Data Analytics Studio Known Issues in HDFS Known Issues in HDFS Known Issues in HDFS Known Issues in Hue Known Issues in Hue Known Issues in Hue Known Issues in Kerberos Known Issues in MapReduce and YARN Known Issues in MapReduce and YARN Known Issues in MapReduce and YARN Known Issues in Navigator Encrypt Known Issues in Navigator Encrypt Known Issues in Navigator Encrypt Known Issues in Schema Registry Known Issues in Schema Registry Known Issues in Schema Registry Known issues in Streams Messaging Manager Known issues in Streams Messaging Manager Known issues in Streams Messaging Manager Known Issues in Streams Replication Manager Known Issues in Streams Replication Manager Known Issues in Streams Replication Manager Knox Knox Knox Gateway Health Tests Knox Gateway Metrics Knox Gateway UI: incorrect username or password Knox Health Tests Knox IDBroker Health Tests Knox IDBroker Metrics Knox Metrics Knox Properties for TLS Knox Properties in Cloudera Runtime 7.1.7 Knox Supported Services Matrix Knox Topology Management in Cloudera Manager Kudu Kudu Kudu Kudu Kudu Kudu and Apache Ranger integration Kudu architecture in a CDP private cloud base deployment Kudu authentication Kudu authentication tokens Kudu authentication with Kerberos Kudu authorization policies Kudu authorization tokens Kudu backup Kudu coarse-grained authorization Kudu concepts Kudu example applications Kudu fine-grained authorization Kudu Health Tests Kudu integration with Spark Kudu introduction Kudu master web interface Kudu metrics Kudu Metrics Kudu network architecture Kudu Properties in Cloudera Runtime 7.1.7 Kudu Python client Kudu recovery Kudu Replica Metrics Kudu schema design Kudu security considerations Kudu security limitations Kudu security limitations Kudu tablet server web interface Kudu tracing Kudu transaction semantics Kudu web interfaces Kudu-Impala integration LAG LAST_VALUE Lateral View Launch a YARN service Launch distcp Launch Zeppelin Launching Apache Phoenix Thin Client LAZY_PERSIST memory storage policy LDAP authentication LDAP properties LDAP search fails with invalid credentials error LDAP Settings LEAD Leader positions and in-sync replicas Lengthy BalancerMember Route length Leveraging Business Metadata Lifecycle and Security Auditing Lily HBase batch indexing for Cloudera Search Lily HBase Indexer Health Tests Lily HBase Indexer Metrics Lily HBase Near Real Time Indexing for Cloudera Search LIMIT clause Limit CPU usage with Cgroups Limitations Limitations Limitations Limitations and restrictions for Impala UDFs Limitations of Amazon S3 Limitations of Atlas-NiFi integration Limitations of erasure coding Limitations of Phoenix-Hive connector Limitations of the S3A Committers Limiting concurrent connections Limiting the speed of compactions Lineage lifecycle Lineage overview Linux Container Executor Linux Control Groups (cgroups) List and Create Keys List buckets List files in Hadoop archives List of APIs verified List of supported non-alphanumeric characters for file and directory names in Hue List of Thrift API and HBase configurations Listing available metrics Listing Repositories Literals Live write access Livy API reference for batch jobs Livy API reference for interactive sessions Livy batch object Livy for Spark 3 Health Tests Livy for Spark 3 Metrics Livy for Spark 3 Properties in Cloudera Runtime 7.1.7 Livy Health Tests Livy interpreter configuration Livy Metrics Livy objects for interactive sessions Livy Properties in Cloudera Runtime 7.1.7 Livy Server for Spark 3 Health Tests Livy Server for Spark 3 Metrics Livy Server Health Tests Livy Server Metrics LLAP Proxy Health Tests LLAP Proxy Metrics Load Balancer Health Tests Load Balancer Metrics LOAD DATA statement Loading ORC data into DataFrames using predicate push-down Loading the Oozie database Local file system support Locating Hive tables and changing the location Log a Security Support Case Log Aggregation File Controllers Log Aggregation Properties Log cleaner Log Details Log support in Cloudera Manager for ECS cluster Logical Architecture Logical operators, comparison operators and comparators Logs Logs and Events Logs and log segments Logs List LOG_MESSAGE Category Main Use Cases Maintaining Cloudera Navigator Encrypt Maintenance manager Maintenance Mode Manage databases and tables Manage dynamic queues Manage HBase snapshots on Amazon S3 in Cloudera Manager Manage HBase snapshots using Cloudera Manager Manage HBase snapshots using the HBase shell Manage individual delegation tokens Manage placement rules Manage Policies for HBase snapshots in Amazon S3 Manage queries Manage Queues Manage reports Manage the YARN service life cycle through the REST API Managed Parent Queues Management basics Management of existing Apache Knox shared providers Management of Knox shared providers in Cloudera Manager Management of Service Parameters for Apache Knox via Cloudera Manager Management of services for Apache Knox via Cloudera Manager Managing Access Control Lists Managing alert policies and notifiers in SMM Managing Alert Policies using Streams Messaging Manager Managing Alerts Managing and Allocating Cluster Resources using Capacity Scheduler Managing Anonymous Usage Data Collection Managing Apache Hadoop YARN Services Managing Apache HBase Managing Apache HBase Security Managing Apache Hive Managing Apache Impala Managing Apache Kafka Managing Apache Kudu Managing Apache Kudu Security Managing Apache Phoenix Security Managing Apache Phoenix security Managing Apache ZooKeeper Managing Apache ZooKeeper Security Managing Auditing with Ranger Managing Business Terms with Atlas Glossaries Managing Cloudera Manager Managing Cloudera Manager Server Logs Managing Cloudera Runtime Services Managing Cloudera Search Managing Clusters Managing collection configuration Managing collections Managing columns Managing Cruise Control Managing Dashboards Managing Data Storage Managing Disk Space for Log Files Managing dynamic child creation enabled parent queues Managing Dynamic Configurations Managing dynamically created child queues Managing Encryption Keys and Zones Managing HDFS snapshots in Cloudera Manager Managing Hosts Managing Hue permissions Managing Kafka Topics using Streams Messaging Manager Managing Kerberos credentials using Cloudera Manager Managing Key Trustee Server Certificates Managing Key Trustee Server Organizations Managing Licenses Managing Logs Managing Metadata in Impala Managing Metadata in Impala Managing Operational Database powered by Apache Accumulo Managing Parcels Managing partition retention time Managing partitions Managing query rewrites Managing Re-encryption Operations Managing replication policies Managing Resources in Impala Managing Role Groups Managing Roles Managing snapshot policies using Cloudera Manager Managing Spark Driver Logs Managing storage elements by using the command-line interface Managing Suppressed Validations Managing tables Managing the Cloudera Manager Agent Logs Managing the Navigator Key HSM Service Managing topics across multiple Kafka clusters Managing YARN queue users Managing, Deploying and Monitoring Connectors Manually configuring SAML authentication Manually Configuring TLS Encryption for Cloudera Manager Manually Configuring TLS Encryption on the Agent Listening Port Manually failing over to the standby NameNode Manually Install Cloudera Manager Agent Packages Manually Install Cloudera Software Packages Manually Redeploying Client Configuration Files Manually Triggering Collection and Transfer of Diagnostic Data to Cloudera MAP complex type Mapping Apache Phoenix schemas to Apache HBase namespaces Mapping Kerberos Principals to Short Names Mapping Sentry permissions for Solr to Ranger policies MapReduce Health Tests MapReduce indexing MapReduce Job ACLs MapReduce Metrics MapReduceIndexerTool MapReduceIndexerTool input splits MapReduceIndexerTool metadata MapReduceIndexerTool usage syntax Master Health Tests Master Metrics Materialized View Engine Health Tests Materialized View Engine Metrics Materialized views Mathematical functions Maven Artifacts for Cloudera Runtime 7.1.7 SP1 Maven Artifacts for Cloudera Runtime 7.1.7.0 MAX MAX function Memory Memory limits Merge process stops during Sqoop incremental imports Merging data in tables Metric Aggregation Metric Expression Functions Metric Expressions Metrics Metrics and Insight Metrics and queries Migrate brokers by modifying broker IDs in meta.properties Migrate data on the same host Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server Migrate from the Cloudera Manager External PostgreSQL Database Server to a MySQL/Oracle Database Server Migrate ResourceManager to another host Migrate the Ranger Admin role instance to a new host Migrate the Ranger KMS db role instance to a new host Migrate the Ranger KMS KTS role instance to a new host Migrate to multiple Kudu masters Migrate to strongly consistent indexing Migrating ACLs from Key Trustee KMS to Ranger KMS Migrating Consumer Groups Between Clusters Migrating Data Using Sqoop Migrating database configuration to a new location Migrating Embedded PostgreSQL Database to External PostgreSQL Database Migrating from PostgreSQL Database Server to MySQL/Oracle Database Server Migrating from Sentry to Ranger Migrating from the Cloudera Manager Embedded PostgreSQL Database Server to an External PostgreSQL Database Migrating Hue service by adding new role instances Migrating Hue service using Add Service wizard Migrating Keys from a Java KeyStore to Cloudera Navigator Key Trustee Server Migrating Ranger Key Management Server Role Instances to a New Host Migrating Ranger Usersync and Tagsync role groups Migrating Solr replicas Migration from Fair Scheduler to Capacity Scheduler Migration Guide MIN MIN function Minimize cluster distruption during planned downtime Miscellaneous functions Missing Containers page MOB cache properties Modify a provider in an existing provider configuration Modify custom service parameter in descriptor Modify GCS Bucket Permissions Modify interpreter settings Modifying a collection configuration generated using an instance directory Modifying a connector using Kafka Connect in SMM Modifying a Kafka topic Modifying Configuration Properties Using Cloudera Manager Modifying Impala Startup Options Modifying the Health Threshold Modifying the session cookie timeout value Monitor cluster health with ksck Monitor Health Tests Monitor Metrics Monitor RegionServer grouping Monitor the BlockCache Monitor your Cluster from the SMM UI Monitoring Monitoring a Cluster Using Cloudera Manager Monitoring Activities Monitoring and Debugging Spark Applications Monitoring and Diagnostics Monitoring and Diagnostics Monitoring Apache Impala Monitoring Apache Kudu Monitoring checkpoint latency for cluster replication Monitoring cluster profile using Kafka Connect in SMM Monitoring Clusters Monitoring connector profile using Kafka Connect in SMM Monitoring connector settings using Kafka Connect in SMM Monitoring connectors using Kafka Connect in SMM Monitoring end to end latency for Kafka topic Monitoring End-to-End Latency using Streams Messaging Manager Monitoring heap memory usage Monitoring Hosts Monitoring Impala Queries Monitoring Kafka brokers Monitoring Kafka cluster replications by quick ranges Monitoring Kafka Cluster Replications using Streams Messaging Manager Monitoring Kafka clusters Monitoring Kafka Clusters using Streams Messaging Manager Monitoring Kafka Connect using Streams Messaging Manager Monitoring Kafka consumers Monitoring Kafka producers Monitoring Kafka topics Monitoring replication latency for cluster replication Monitoring replication throughput and latency by values Monitoring Replication with Streams Messaging Manager Monitoring Service Status Monitoring Services Monitoring Spark Applications Monitoring status of the clusters to be replicated Monitoring the performance of HDFS replication policies Monitoring the performance of Hive/Impala replication policies Monitoring throughput for cluster replication Monitoring topics to be replicated Monitoring YARN Applications More Resources Morphline commands overview Move HBase Master Role to another host Moving a Host Between Clusters Moving a NameNode to a different host using Cloudera Manager Moving and Resizing Charts Moving highly available NameNode, failover controller, and JournalNode roles using the Migrate Roles wizard Moving Monitoring Data on an Active Cluster Moving NameNode roles Moving the Cloudera Manager Server to a New Host Moving the Hue service to a different host Moving the JournalNode edits directory for a role group using Cloudera Manager Moving the JournalNode edits directory for a role instance using Cloudera Manager Moving the Oozie service to a different host Multi-Raft configuration for efficient write performances Multi-server LDAP/AD autentication Multilevel partitioning Multipart upload MySQL: 1040, 'Too many connections' exception NameNode architecture NameNode Health Tests NameNode Metrics NameNodes NameNodes Navigator Audit Server Health Tests Navigator Audit Server Metrics Navigator Encrypt Navigator Encrypt Navigator Encrypt Navigator Encrypt Navigator Encrypt Access Control List Navigator Encrypt Overview Navigator HSM KMS backed by SafeNet Luna HSM Metrics Navigator HSM KMS backed by Thales HSM Metrics Navigator Key HSM Navigator Key Trustee Server Navigator Luna KMS Metastore Health Tests Navigator Luna KMS Metastore Metrics Navigator Luna KMS Proxy Health Tests Navigator Luna KMS Proxy Metrics Navigator Metadata Server Health Tests Navigator Metadata Server Metrics Navigator Thales KMS Metastore Health Tests Navigator Thales KMS Metastore Metrics Navigator Thales KMS Proxy Health Tests Navigator Thales KMS Proxy Metrics NDV function Near Real Time Indexing Network and I/O threads Network Interface Metrics Networking and Security Requirements Networking Considerations for Virtual Private Clusters Networking parameters New topic and consumer group discovery NFS Gateway Health Tests NFS Gateway Metrics Nginx configuration for Prometheus Nginx installtion Nginx proxy configuration over Prometheus NiFi lineage NiFi metadata collection NiFi Registry TLS/SSL Properties NiFi TLS/SSL properties NodeManager Health Tests NodeManager Metrics Non-covering range partitions Notes about replication Notifiers NTILE Number-of-Regions Quotas Number-of-Tables Quotas Obtain and Deploy Keys and Certificates for TLS/SSL Obtaining client to Ozone through session Obtaining resources to Ozone Obtaining Time-Series Data Using the API Off-heap BucketCache Offloading Application Logs to Ozone OFFSET clause Offsets Subcommand Omid Health Tests Omid Metrics Omid tso server Health Tests Omid tso server Metrics On-demand Metadata On-demand Metadata On-premise to Cloud and Kafka Version Upgrade Oozie Oozie Oozie Oozie configurations with CDP services Oozie database configurations Oozie Health Tests Oozie High Availability Oozie Load Balancer configuration Oozie Metrics Oozie Properties in Cloudera Runtime 7.1.7 Oozie scheduling examples Oozie security enhancements Oozie Server Health Tests Oozie Server Metrics OpDB overview Operating System Requirements Operating system requirements Operational Database Operational Database Operational Database Overview Operational Database overview Operational Database powered by Apache Accumulo Overview Operational Database powered by Apache Accumulo Reference Operators Optimize mountable HDFS Optimize performance for evaluating SQL predicates Optimizer hints Optimizing data storage Optimizing HBase I/O Optimizing NameNode disk space with Hadoop archives Optimizing performance Optimizing Performance for HDFS Transparent Encryption Optimizing Performance in Cloudera Runtime Optimizing queries using partition pruning Optimizing S3A read performance for different file types Options to determine differences between contents of snapshots Options to rerun Oozie workflows in Hue ORC file format ORC vs Parquet formats Orchestrate a rolling restart with no downtime ORDER BY clause Orphaned snapshots Other known issues Other Tasks and Settings OVER Overriding Configuration Properties Overriding custom keystore alias on a Ranger KMS Server Overriding custom keystore alias on a Ranger KMS Server Overview Overview Overview Overview Overview Overview Overview Overview of Hadoop archives Overview of HDFS Overview of Oozie Overview of Parcels Overview of proxy usage and load balancing for Search Overview of Storage Container Manager in High Availability Overview of the Ozone Manager in High Availability Overview page Overview Tab Ozone Ozone Ozone Ozone Ozone architecture Ozone configuration options to work with CDP components Ozone DataNode Health Tests Ozone DataNode Metrics Ozone Health Tests Ozone Manager Health Tests Ozone Manager Metrics Ozone Manager nodes in High Availability Ozone Metrics Ozone Prometheus Health Tests Ozone Prometheus Metrics Ozone Properties in Cloudera Runtime 7.1.7 Ozone Recon Health Tests Ozone Recon Metrics Ozone security architecture Ozone trash overview Packaging different versions of libraries with an Apache Spark application PAM authentication Parameters to configure the Disk Balancer Parcel Configuration Settings Parcel Life Cycle Parcel Locations Parcels Parquet Parquet Partition pruning Partition Pruning for Queries Partition refresh and configuration Partitioning Partitioning Partitioning examples Partitioning for Kudu Tables Partitioning guidelines Partitioning limitations Partitioning limitations Partitioning tables Partitions Partitions and performance Passive Database Health Tests Passive Database Metrics Passive Key Trustee Server Health Tests Passive Key Trustee Server Metrics Pausing a Cluster in AWS PERCENT_RANK Perform a backup of the HDFS metadata Perform a disk hot swap for DataNodes using Cloudera Manager Perform ETL by ingesting data from Kafka into Hive Perform master hostname changes Perform scans using HBase Shell Perform the migration Perform the recovery Perform the removal Performance and Scalability Performance and scalability limitations to consider for replication policies Performance and storage considerations for Spark SQL DROP TABLE PURGE Performance Best Practices Performance comparison between Cloudera Manager and Prometheus Performance Considerations Performance considerations Performance Considerations Performance considerations for UDFs Performance Impact of Encryption Performance improvement using partitions Performance issues Performance Management Performance Trade Offs Performance tuning Performance tuning for Ozone Performant .NET producer Performing Maintenance on a Cluster Host Periodic Stacks Collection Periodically rebuilding a materialized view Phoenix Phoenix Phoenix Phoenix Phoenix Health Tests Phoenix Metrics Phoenix Properties in Cloudera Runtime 7.1.7 Phoenix-Spark connector usage examples Physical backups of an entire node Pillars of Security Pipelines page Placement rule policies Placing Ozone DataNodes in offline mode Plan the data movement across disks Planning for Apache Impala Planning for Apache Kudu Planning for Infra Solr Planning for Streams Replication Manager Planning overview Platform and OS Platform and OS Pluggable authentication modules in HiveServer Populating an HBase Table Port and network requirements for Replication Manager on CDP Private Cloud Base Ports Ports Used by Cloudera Manager Ports Used by Cloudera Navigator Key Trustee Server Ports Used by Cloudera Runtime Components Ports Used by DistCp Ports Used by Impala Ports Used by Third-Party Components POST /admin/audits/ API Post-migration verification Pre-defined Access Policies for Schema Registry Predicate push-down optimization Predicates Preloaded resource-based services and policies Prepare for master hostname changes Prepare for removal Prepare for the migration Prepare for the recovery Prepare Kerberos authentication-enabled clusters for replication Prepare to back up the HDFS metadata Prepare to replicate using replication policies Preparing a New Cluster Preparing a thrift server and client Preparing for Encryption Using Cloudera Navigator Encrypt Preparing the hardware resources for HDFS High Availability Prerequisite Prerequisites Prerequisites Prerequisites Prerequisites Prerequisites Prerequisites and Assumptions Prerequisites and exceptions for the example configuration Prerequisites for configuring short-ciruit local reads Prerequisites for configuring TLS/SSL for Oozie Prerequisites for enabling erasure coding Prerequisites for enabling HDFS HA using Cloudera Manager Prerequisites for installing Atlas Prerequisites for Prometheus configuration Prerequisites for setting up Atlas HA Prerequisites to configure TLS/SSL for HBase Prerequisites to configure TLS/SSL for HBase Presentation of Aggregate Data Preventing inadvertent deletion of directories Previewing tables using Data Preview Primary key design Primary key index Principal name mapping Principal name mapping Privileged commands for Cloudera Manager installation Problem area: Compose page Problem area: Queries page Problem area: Reports page Process Management Processes Production Installation Profiler Admin Agent Health Tests Profiler Admin Agent Metrics Profiler Manager Metrics Profiler Metrics Agent Health Tests Profiler Metrics Agent Metrics Profiler Scheduler Agent Health Tests Profiler Scheduler Agent Metrics Profiler Scheduler Metrics Prometheus configuration for SMM Prometheus for SMM limitations Prometheus metrics overview Prometheus properties configuration Propagating classifications through lineage Propagation of tags as deferred actions Properties for configuring centralized caching Properties for configuring short-circuit local reads on HDFS Properties for configuring the Balancer Properties to set the size of the NameNode edits directory Protocol between consumer and broker Provide Read-only access to Queue Manager UI Provide user permissions Provide user permissions Proxy Cloudera Manager through Apache Knox Purging deleted entities Purposely using a stale materialized view PUT /admin/purge/ API Putting all Hosts in an Upgrade Domain group into Maintenance Mode Queries are not appearing on the Queries page Query an existing Kudu table from Impala Query column is empty but you can see the DAG ID and Application ID Query Details Query fails with "Counters limit exceeded" error message Query Join Performance Query options Query Processor Health Tests Query Processor Metrics Query results cache Query sample data Query scheduling Query Server Health Tests Query Server Metrics Query vectorization Query vectorization properties Querying Querying a schema Querying correlated data Querying files into a DataFrame Querying Kafka data Querying live data from Kafka Querying metric data Querying the information_schema database Queue ACLs Quick Start Deployment for a Streams Cluster Quota enforcement Quota violation policies Quotas Rack awareness Rack awareness (Location awareness) Range partitioning Range partitioning Ranger Ranger Ranger Ranger Ranger Ranger access conditions Ranger AD Integration Ranger Admin Health Tests Ranger Admin Metrics Ranger Audit Filters Ranger audit schema reference Ranger console navigation Ranger database schema reference Ranger Health Tests Ranger KMS Ranger KMS Health Tests Ranger KMS Metrics Ranger KMS Server Health Tests Ranger KMS Server Metrics Ranger KMS Server with KTS Health Tests Ranger KMS Server with KTS Metrics Ranger KMS with Key Trustee Server Health Tests Ranger KMS with Key Trustee Server Metrics Ranger Metrics Ranger policies allowing create privilege for Hadoop_SQL databases Ranger policies allowing create privilege for Hadoop_SQL tables Ranger policies for Kudu Ranger Policies Overview Ranger Properties in Cloudera Runtime 7.1.7 Ranger Raz Health Tests Ranger Raz Metrics Ranger Raz Server Health Tests Ranger Raz Server Metrics Ranger RMS - HIVE-HDFS ACL Sync Overview Ranger RMS Health Tests Ranger RMS Metrics Ranger RMS Server Health Tests Ranger RMS Server Metrics Ranger Security Zones Ranger special entities Ranger tag-based policies Ranger Tagsync Health Tests Ranger Tagsync Metrics Ranger UI authentication Ranger UI authorization Ranger user management Ranger Usersync Ranger Usersync Health Tests Ranger Usersync Metrics RANK Re-encrypting an EDEK Re-encrypting Encrypted Data Encryption Keys (EDEKs) Read access Read and write operations Read and write requests with Ozone Manager in High Availability Read operations (scans) Read replica properties Read the Events Reading and writing Hive tables in R Reading and writing Hive tables in Zeppelin Reading data from HBase Reading data through HWC Reading Hive ORC tables Reads (scans) REAL data type Reassigning replicas between log directories Reassignment examples Rebalance after adding Kafka broker Rebalance after demoting Kafka broker Rebalance after removing Kafka broker Rebalancing partitions Rebalancing with Cruise Control Rebuild a Kudu filesystem layout Recommendations for client development Recommended configurations for the Balancer Recommended configurations for the balancer Recommended deployment architecture Recommended Hive configurations when using Ozone Recommissioning an Ozone DataNode Recommissioning Hosts Recommissioning Role Instances Record management Record order and assignment Record User Data Paths Records Recover data from a snapshot Recover from a dead Kudu master Recover from disk failure Recover from full disks Recovering a Key Trustee Server Redaction of Sensitive Information from Diagnostic Bundles Redeploying the Oozie ShareLib Redeploying the Oozie sharelib using Cloudera Manager Reducing the Size of Data Structures Refer to a table using dot notation Reference architecture Referencing Amazon S3 in URIs Referencing S3 Credentials for YARN, MapReduce, or Spark Clients Referencing S3 Data in Applications Referer checking failed Refining query search using filters REFRESH AUTHORIZATION statement REFRESH FUNCTIONS statement REFRESH statement RegionServer Health Tests RegionServer Metrics Registering a Lily HBase Indexer Configuration with the Lily HBase Indexer Service Registering Cloudera Navigator Encrypt with Key Trustee Server Registering the UDF Relax WAL durability Release notes Reloading, viewing, and filtering functions Remote Topics Remove a DataNode Remove a provider parameter in an existing provider configuration Remove a RegionServer from RegionServer grouping Remove Cloudera Manager, User Data, and Databases Remove custom service parameter from descriptor Remove Kudu masters Remove or add storage directories for NameNode data directories Remove storage directories using Cloudera Manager Removing a Chart from a Custom Dashboard Removing a Filter Removing a Host From a Cluster Removing an Event Filter Removing Ozone DataNodes from the cluster Removing scratch directories Renaming a Cluster Renaming a Service Renew and Redistribute Certificates Renewing a License Reorder placement rules Repairing partitions manually using MSCK repair Replace a disk on a DataNode host Replace a ZooKeeper disk Replace a ZooKeeper role on an unmanaged cluster Replace a ZooKeeper role with ZooKeeper service downtime Replace a ZooKeeper role without ZooKeeper service downtime Replacing Key Trustee Server Certificates Replicate pre-exist data in an active-active deployment Replicating Data Replicating data to Impala clusters Replicating from unsecure to secure clusters Replication Replication across three or more clusters Replication caveats Replication Flows Overview Replication Manager Replication Manager in CDP Private Cloud Base Replication of encrypted data Replication of Impala and Hive User Defined Functions (UDFs) Replication requirements Report craches using breakpad Reports Reports Manager Reports Manager Health Tests Reports Manager Metrics Repository Configuration Files Request a timeline-consistent read Required Databases Required ports in Kerberos authentication-enabled clusters for replication Requirements for compressing and extracting files using Hue File Browser Requirements for Oozie High Availability Reserved words Resetting Configuration Properties to the Default Value Resetting Hue user password Resolving "The user authorized on the connection does not match the session username" error Resolving "You are accessing a non-optimized Hue" error Resource allocation overview Resource distribution workflow Resource Management Resource Management Resource Planning for Data at Rest Encryption Resource Scheduling and Management Resource Tuning Example Resource-based Services and Policies ResourceManager Health Tests ResourceManager Metrics Resources REST endpoints supported on Ozone S3 Gateway Restarting a Cloudera Runtime Service Restarting Services and Instances after Configuration Changes Restarting the Cloudera Management Service Restore an HBase snapshot from Amazon S3 Restore an HBase snapshot from Amazon S3 with a new name Restore data from a replica Restore HDFS metadata from a backup using Cloudera Manager Restore Key Trustee Server from ktbackup.sh backups Restore Key Trustee Server in package-based installations Restore Key Trustee Server in parcel-based installations Restore tables from backups Restoring a collection Restoring HDFS snapshots Restoring NameNode metadata Restoring Navigator Key Trustee Server Restoring the Cloudera Manager configuration Restricting access to Kafka metadata in Zookeeper Restricting classifications based on user permission Restricting supported ciphers for Hue Restricting user login Results Tab Results Tab Retaining logs for Replication Manager Retries Retrieving log directory replica assignment information Retrieving metric data Retrieving the clusterstate.json file Review Changes REVOKE ROLE statement REVOKE statement Role Assignments Role Groups Role Instance Reference Role Instances ROLE statements Roll Over an Existing Key Rolling Encryption Keys Rolling Restart Rotate Auto-TLS Certificate Authority and Host Certificates Rotate the master key/secret Row-level filtering and column masking in Hive Row-level filtering in Hive with Ranger policies Row-level filtering in Impala with Ranger policies ROW_NUMBER RPC timeout traces Run a tablet rebalancing tool in Cloudera Manager Run a tablet rebalancing tool in command line Run a tablet rebalancing tool on a rack-aware cluster Run the Cloudera Manager Server Installer Run the Cloudera Manager Server Installer Run the Disk Balancer plan Run the spark-submit job Run the tablet rebalancing tool Running a Hive command Running a MapReduce Job Running a query on a different Hive instance Running a query on a different Hive instance Running a Spark MLlib example Running an interactive session with the Livy API Running Apache Spark Applications Running applications with CDS 3.2.3 with GPU Support Running Commands and SQL Statements in Impala Shell Running Diagnostic Commands for Roles Running HBaseMapReduceIndexerTool Running PySpark in a virtual environment Running sample Spark applications Running shell commands Running Spark 3 Applications Running Spark 3 Applications with CDS 3.2.3 Running Spark applications on secure clusters Running Spark applications on YARN Running Spark Python applications Running the balancer Running the HBCK2 tool Running the Host Inspector Running the Prune Command Using Cloudera Manager Admin Console Running the Prune Command Using the Cloudera Manager API Running YARN Services Running your first Spark application Runtime 7.1.7.2000-305 Runtime 7.1.7.2002-1 Runtime 7.1.7.2009-1 Runtime 7.1.7.2010-1 Runtime 7.1.7.2011-1 Runtime 7.1.7.2013-1 Runtime 7.1.7.2016-1 Runtime 7.1.7.2021-1 Runtime 7.1.7.2023-1 Runtime 7.1.7.2024-1 Runtime 7.1.7.2025-2 Runtime 7.1.7.2026-3 Runtime 7.1.7.2030-1 Runtime 7.1.7.2032-1 Runtime 7.1.7.2035-2 Runtime 7.1.7.2038-1 Runtime 7.1.7.2040-4 Runtime 7.1.7.2046-1 Runtime 7.1.7.2047-1 Runtime 7.1.7.2050-1 Runtime 7.1.7.3000-77 Runtime 7.1.7.3008-2 Runtime 7.1.7.3010-1 Runtime 7.1.7.3011-1 Runtime 7.1.7.3013-1 Runtime 7.1.7.3014-1 Runtime 7.1.7.3016-1 Runtime 7.1.7.3017-1 Runtime 7.1.7.3018-1 Runtime Cluster Hosts and Role Assignments Runtime environment for UDFs Runtime error: Could not create thread: Resource temporarily unavailable (error 11) Runtime Filtering S3 Connector Properties in Cloudera Runtime 7.1.7 S3 Gateway Health Tests S3 Gateway Metrics S3 Performance Checklist S3A and Checksums (Advanced Feature) S3Guard with Sqoop Safely Writing to S3 Through the S3A Committers SAML properties Sample Custom Alert Script Sample pom.xml file for Spark Streaming with Kafka Sample Python Code Sample script to connect Spark to Ozone SAN Certificates Save a YARN service definition Saving a Chart Saving aliases Saving Charts to a New Dashboard Saving Charts to an Existing Dashboard Saving Charts to Dashboards Saving searches Saving the search results Scalability Considerations Scaling Kudu Scaling Limits and Guidelines Scaling recommendations and limitations Scaling recommendations and limitations Scheduler performance improvements Scheduling among queues Scheduling in Oozie using cron-like syntax Schema alterations Schema design limitations Schema design limitations Schema Entities Schema objects Schema Registry Schema Registry Schema Registry Schema Registry Authorization through Ranger Access Policies Schema Registry Component Architecture Schema Registry Concepts Schema Registry Health Tests Schema Registry Metrics Schema Registry Overview Schema Registry Overview Schema Registry Properties in Cloudera Runtime 7.1.7 Schema Registry Server Health Tests Schema Registry Server Metrics Schema Registry TLS Properties Schema Registry Use Cases Schemaless mode overview and best practices Script with HBase Shell SDX Search Search Search Search Search Search and other Runtime components Search applications Search Ranger reports Search Tutorial Searching by topic name Searching for entities using Business Metadata attributes Searching for entities using classifications Searching for Properties Searching Kafka cluster replications by source Searching metadata tags Searching overview Searching queries Searching tables Searching using terms Searching with Metadata Searching Within the File System Secondary Sort SecondaryNameNode Health Tests SecondaryNameNode Metrics Secure access mode introduction Secure by Design Secure Prometheus for SMM Secure Your Cluster Securing Access to Hadoop Cluster: Apache Knox Securing an endpoint under AutoTLS Securing Apache Hive Securing Apache Impala Securing Apache Kafka Securing Atlas Securing Atlas Securing Cloudera Search Securing configs with ZooKeeper ACLs and Ranger Securing Cruise Control Securing database connections with TLS/SSL Securing database connections with TLS/SSL Securing DataNodes Securing Hive metastore Securing HiveServer using LDAP Securing Hue Securing Hue passwords with scripts Securing Impala Securing Kafka Connect Securing Schema Registry Securing sensitive information using a Secure Credential Storage Provider (Technical Preview) Securing sessions Securing Streams Messaging Manager Securing Streams Messaging Manager Securing Streams Replication Manager Securing the Key Management System (KMS) Securing the S3A Committers Security considerations for encrypted data during replication Security considerations for UDFs Security examples Security examples Security Levels Security Management Security Management Model Security Model and Operations on S3 Security overview Security Terms Security tokens in Ozone Security Zones Administration Security Zones Example Use Cases Select Services SELECT statement Selecting a Point In Time or a Time Range Selecting Columns to Show in the Activities List Selecting Columns to Show in the Tasks List Sending Diagnostic Data to Cloudera for YARN Applications Sending Usage and Diagnostic Data to Cloudera Sentry Health Tests Sentry Metrics Sentry Server Health Tests Sentry Server Metrics Sentry to Ranger replication for Hive external tables Server and Client Configuration Server management limitations Server management limitations Server Metrics Service Dependencies in Cloudera Manager Service Monitor Health Tests Service Monitor Metrics Service Monitor Requirements Service Summary Services backed by PostgreSQL fail or stop responding Set Application-Master resource-limit for a specific queue Set credentials for Ranger Usersync Set default Application Master resource limit Set global application limits Set HADOOP_CONF to the destination cluster Set HDFS quotas Set Maximum Application limit for a specific queue Set Ordering policies within a specific queue Set properties in Cloudera Manager Set proxy server authentication for clusters using Kerberos SET statement Set up Set Up a Cluster Using the Wizard Set Up a Gateway Host to Restrict Access to the Cluster Set up a PostgreSQL database Set up a storage policy for HDFS Set Up a Streaming Cluster Set Up Access to Cloudera EDH (Microsoft Azure Marketplace) Set Up an Environment Set up an Oracle database Set up GCP Cloud HSM for Ranger KMS, KTS, and KeyHSM Set up Luna 6 HSM for Ranger KMS, KTS, and KeyHSM Set up Luna 7 HSM for Ranger KMS w/database Set up Luna 7 HSM for Ranger KMS, KTS, and KeyHSM Set up MariaDB or MySQL database Set up MirrorMaker in Cloudera Manager Set up SSD storage using Cloudera Manager Set up WebHDFS on a secure cluster Set user limits within a queue Setting an Advanced Configuration Snippet for a Cloudera Runtime Service Setting an Advanced Configuration Snippet for a Cluster Setting capacity estimations and goals Setting consumer and producer table properties Setting global maximum application priority Setting HDFS quotas in Cloudera Manager Setting Java system properties for Solr Setting Oozie permissions Setting Python path variables for Livy Setting Quotas Setting SELinux Mode Setting the cache timeout Setting the Idle Query and Idle Session Timeouts Setting the Oozie database timezone Setting the secure storage password as an environment variable Setting the trash interval Setting the vm.swappiness Linux Kernel Parameter Setting Timeout and Retries for Thrift Connections to Backend Client Setting Timeouts in Impala Setting up a JDBC URL connection override Setting Up a Web Server Setting Up a Web Server Setting up and configuring the ABFS connector Setting up Atlas High Availability Setting up Atlas Kafka import tool Setting up basic authentication with TLS for Prometheus Setting up CipherTrust HSM for Ranger KMS, KTS, and KeyHSM Setting Up Data at Rest Encryption for HDFS Setting up Data Cache for Remote Reads Setting up Data Cache for Remote Reads Setting Up HDFS Caching Setting up JDBCStorageHandler for Postgres Setting Up Key Trustee Server High Availability Setting up mTLS for Prometheus Setting up o3fs Setting up secure access mode Setting Up Sqoop Setting up the backend Hive metastore database Setting up the cost-based optimizer and statistics Setting up the development environment Setting up the metastore database Setting up TLS for Prometheus Setting user limits for HBase Setting user limits for Kafka Settings to avoid data loss Setup Database Shell commands Shiro Settings: Reference shiro.ini Example SHOW CURRENT ROLES statement SHOW MATERIALIZED VIEWS SHOW ROLE GRANT GROUP statement SHOW ROLES statement SHOW statement Showing Atlas Server status Showing materialized views Shut Down Impala SHUTDOWN statement Shutting Down and Starting Up the Cluster Simple .NET consumer Simple .NET producer Simple Java consumer Simple Java producer Single tablet write operations Size the BlockCache Sizing estimation based on network and disk message throughput Sizing NameNode heap memory Slow name resolution and nscd SMALLINT data type SMM property configuration in Cloudera Manager for Prometheus Snapshot failures Snapshot policies in Replication Manager Snapshots Snapshots history Software Distribution Management Solr and HDFS - the block cache Solr Health Tests Solr Metrics Solr Properties in Cloudera Runtime 7.1.7 Solr Replica Metrics Solr Server Health Tests Solr Server Metrics Solr server tuning categories Solr Shard Metrics solrctl Reference Solutions to Common Problems Sorting the Activities List Sorting the Tasks List Space quotas Spark Spark Spark Spark Spark 3 Health Tests Spark 3 Metrics Spark 3 Properties in Cloudera Runtime 7.1.7 Spark actions that produce Atlas entities Spark application model Spark audit entries Spark cluster execution overview Spark entities created in Apache Atlas Spark entity metadata migration Spark execution model Spark Health Tests Spark indexing using morphlines Spark integration best practices Spark integration known issues and limitations Spark integration limitations Spark Job ACLs Spark lineage Spark metadata collection Spark Metrics Spark on YARN deployment modes Spark Properties in Cloudera Runtime 7.1.7 Spark relationships Spark security Spark SQL example Spark Streaming and Dynamic Allocation Spark Streaming Example Spark troubleshooting Spark tuning spark-submit command options Specify the JDBC connection string Specify truststore properties Specifying domains or pages to which Hue can redirect users Specifying hosts to improve HDFS replication policy performance Specifying hosts to improve Hive replication policy performance Specifying HTTP request methods Specifying Impala Credentials to Access S3 Specifying Racks for Hosts Specifying racks for hosts Specifying the Diagnostic Data Directory Specifying TLS/SSL Minimum Allowed Version and Ciphers Specifying trusted users Speeding up Job Commits by Increasing the Number of Threads Spooling Query Results SQL migration to Impala SQL statements SQL Stream Builder Metrics SQLContext and HiveContext Sqoop Sqoop Sqoop Sqoop 2 Health Tests Sqoop 2 Metrics Sqoop 2 Server Health Tests Sqoop 2 Server Metrics Sqoop Hive import stops when HS2 does not use Kerberos authentication Sqoop Import into ADLS Sqoop Import into Amazon S3 SQOOP_CLIENT Properties in Cloudera Runtime 7.1.7 SRM Command Line Tools SRM Distributed Herder metrics Metrics SRM Driver Health Tests SRM Driver Metrics SRM security example SRM Service Health Tests SRM Service Metrics srm-control srm-control Options Reference SSE-C: Server-Side Encryption with Customer-Provided Encryption Keys SSE-KMS: Amazon S3-KMS Managed Encryption Keys SSE-S3: Amazon S3-Managed Encryption Keys Stale Configurations Standard stream logs Start and stop Kudu processes Start and stop queues Start and stop the NFS Gateway services Start HBase Start Prometheus Start Queue Start the NFS Gateway services Starting a Cloudera Runtime Service on All Hosts Starting All the Roles on a Host Starting and Stopping Apache Impala Starting and Stopping Cloudera Management Service Roles Starting and stopping HBase using Cloudera Manager Starting Apache Hive Starting compaction manually Starting Hive on an insecure cluster Starting Hive using a password Starting the Cloudera Management Service Starting the Embedded PostgreSQL Database Starting the Lily HBase NRT Indexer Service Starting the Oozie server Starting, Stopping, and Restarting Cloudera Manager Agents Starting, Stopping, and Restarting Role Instances Starting, Stopping, and Restarting the Cloudera Manager Server Starting, Stopping, Refreshing, and Restarting a Cluster State Management Static Service Pools Statistics generation and viewing commands Status Status Summary Status Summary STDDEV, STDDEV_SAMP, STDDEV_POP functions Step 11: Inspect Cluster Step 1: Configuration changes on HDP and CDP clusters Step 1: Configure a Repository for Cloudera Manager Step 1: Enabling hdfs user to run YARN jobs Step 1: Identify Roles that Use the Embedded Database Server Step 1: Install Cloudera Manager and CDP Step 1: Welcome (Add Cluster - Installation) Step 1: Welcome (Add Cluster - Installation) Step 1: Worker host configuration Step 2: Cluster Basics Step 2: Cluster Basics Step 2: Configuration changes on the CDP cluster Step 2: Configuring user to run YARN jobs on both the clusters Step 2: Create the Kerberos Principal for Cloudera Manager Server Step 2: Install Java Development Kit Step 2: Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server Step 2: Worker host planning Step 3: Cluster size Step 3: Deploy Cloudera Manager Server and Cloudera Manager Agents Step 3: Enable Kerberos using the wizard Step 3: Install Cloudera Manager Server Step 3: Running DistCp job on CDP cluster Step 3: Running the DistCp job on the HDP cluster Step 3: Setup Auto-TLS Step 3: Setup Auto-TLS Step 4. Install and Configure Databases Step 4: Create the HDFS superuser Step 4: Specify Hosts Step 4: Specify Hosts Step 5: Get or create a Kerberos principal for each user account Step 5: Select Repository Step 5: Select Repository Step 5: Set up and configure the Cloudera Manager database Step 6: Install Parcels Step 6: Prepare the cluster for each user Step 6: Select JDK Step 6: Start the Cloudera Manager Server and Agents Step 6: Verify container settings on cluster Step 6A: Cluster container capacity Step 6B: Container parameters checking Step 7: Enter Login Credentials Step 7: MapReduce configuration Step 7: Set Up a Cluster Using the Wizard Step 7: Verify that Kerberos security is working Step 7A: MapReduce settings checking Step 8: (Optional) Enable authentication for HTTP web consoles for Hadoop roles Step 8: Inspect Cluster Step 8: Install Agents Step 9: Install Parcels Steps 4 and 5: Verify settings Stop all Services Stop HBase Stop Queue Stop replication in an emergency Stop the NFS Gateway services Stopping a Cloudera Runtime Service on All Hosts Stopping All the Roles on a Host Stopping the Cloudera Management Service Stopping the Embedded PostgreSQL Database Stopping the Oozie server Storage Storage Container Manager Health Tests Storage Container Manager Metrics Storage Container Manager operations in High Availability Storage group classification Storage group pairing Storage Space Planning for Cloudera Manager Storage Systems Supports Store HBase snapshots on Amazon S3 Storing Data Using Ozone Storing medium objects (MOBs) Streaming SQL Console Health Tests Streaming SQL Console Metrics Streaming SQL Engine Health Tests Streaming SQL Engine Metrics Streams Messaging Streams Messaging Streams Messaging Manager Streams Messaging Manager Streams Messaging Manager Streams Messaging Manager Health Tests Streams Messaging Manager Metrics Streams Messaging Manager Overview Streams Messaging Manager Properties in Cloudera Runtime 7.1.7 Streams Messaging Manager Rest Admin Server Health Tests Streams Messaging Manager Rest Admin Server Metrics Streams Messaging Manager UI Server Health Tests Streams Messaging Manager UI Server Metrics Streams Replication Manager Streams Replication Manager Streams Replication Manager Streams Replication Manager Streams Replication Manager Architecture Streams Replication Manager Driver Streams Replication Manager Health Tests Streams Replication Manager Metrics Streams Replication Manager Overview Streams Replication Manager Properties in Cloudera Runtime 7.1.7 Streams Replication Manager Reference Streams Replication Manager requirements Streams Replication Manager Service STRING data type String functions STRUCT complex type Stub DFS Properties in Cloudera Runtime 7.1.7 Submitting a Python app Submitting a Scala or Java application Submitting batch applications using the Livy API Submitting Spark applications Submitting Spark Applications to YARN Submitting Spark applications using Livy Subqueries in Impala SELECT statements Subquery restrictions Subscribing to a topic SUM SUM function Summary Summary Support matrix for Replication Manager on CDP Private Cloud Base Suppressing a Configuration Validation in Cloudera Manager Suppressing a Health Test Suppressing Configuration and Parameter Validation Warnings Suppressing Configuration Validations Before They Trigger Warnings Suppressing Health Test Results Symbolizing stack traces Synchronize table data using HashTable/SyncTable tool Synchronizing the contents of JournalNodes Syntax for scm_prepare_database.sh SYSTEM Category System Level Broker Tuning System metadata migration System requirements System Requirements for POC Streams Cluster Table and Column Statistics Tables TABLESAMPLE clause Tablet history garbage collection and the ancient history mark Tablet Server Health Tests Tablet Server Metrics Tag-based Services and Policies Tags and policy evaluation Take a snapshot using a shell script Take HBase snapshots Taking and deleting HDFS snapshots Task architecture and load-balancing Task Attempts TaskController Error Codes (MRv1) TaskTracker Health Tests TaskTracker Hosts TaskTracker Metrics Telemetry Publisher Health Tests Telemetry Publisher Metrics Terminology Terminology Terms Test MOB storage and retrieval performance Testing the Installation Testing the LDAP configuration Testing with Hue Tez Tez Metrics Tez Properties in Cloudera Runtime 7.1.7 The Actions Menu The Cloud Storage Connectors The File Browser The HDFS mover command The Hue load balancer not distributing users evenly across various Hue servers The perfect schema The Processes Tab The S3A Committers and Third-Party Object Stores The Task Distribution Chart Third-party filesystems Thread Tuning for S3A Data Upload Threads Thrift Server crashes after receiving invalid data Throttle quota examples Throttle quotas Time Line Time Series Attributes Time Series Entities and their Attributes Time Series Table Metrics Timeline consistency TIMESTAMP compatibility for Parquet files TIMESTAMP data type TINYINT data type Tips and Best Practices for Jobs TLS Certificate Requirements and Recommendations TLS Encryption TLS Mutual Authentication TLS/SSL certificate requirements and recommendations TLS/SSL client authentication TLS/SSL client authentication TLS/SSL Issues TLS/SSL settings for Streams Messaging Manager Tombstoned or STOPPED tablet replicas Tool usage Top-down process for adding a new metadata source Topics Topics and Groups Subcommand topics/hive-troubleshooting-high-partition-workload.xml Tracer Health Tests Tracer Metrics Tracking an Apache Hive query in YARN Tracking Hive on Tez query execution Transactional table access Transactions Transactions Transparent Encryption Recommendations for HBase Transparent Encryption Recommendations for Hive Transparent Encryption Recommendations for Hue Transparent Encryption Recommendations for Impala Transparent Encryption Recommendations for MapReduce and YARN Transparent Encryption Recommendations for Search Transparent Encryption Recommendations for Spark Transparent Encryption Recommendations for Sqoop Trash behavior with HDFS Transparent Encryption enabled Trial Installation Triggers Troubleshoot RegionServer grouping Troubleshooting Troubleshooting ABFS Troubleshooting Apache Hadoop YARN Troubleshooting Apache HBase Troubleshooting Apache Hive Troubleshooting Apache Impala Troubleshooting Apache Kudu Troubleshooting Apache Sqoop Troubleshooting Cloudera Search Troubleshooting Cluster Configuration and Operation Troubleshooting Data Analytics Studio Troubleshooting Docker on YARN Troubleshooting HBase Troubleshooting Hue Troubleshooting Impala Troubleshooting Installation Problems Troubleshooting Linux Container Executor Troubleshooting NTP stability problems Troubleshooting on YARN Troubleshooting Operational Database powered by Apache Accumulo Troubleshooting Performance of Decommissioning Troubleshooting Prometheus for SMM Troubleshooting replication failure in the DAS Event Processor Troubleshooting replication policies between on-premises clusters Troubleshooting S3 Troubleshooting SAML authentication Troubleshooting Security Issues Troubleshooting Security Issues Troubleshooting the S3A Committers TRUNCATE TABLE statement tsquery Language tsquery Syntax Tuning and Troubleshooting Host Decommissioning Tuning Apache Hadoop YARN Tuning Apache Impala Tuning Apache Kafka Performance Tuning Apache Spark Tuning Apache Spark Applications Tuning Cloudera Search Tuning garbage collection Tuning HBase Prior to Decommissioning DataNodes Tuning HDFS Prior to Decommissioning DataNodes Tuning Hue Tuning JVM Garbage Collection Tuning replication Tuning Resource Allocation Tuning S3A Uploads Tuning Spark Shuffle Operations Tuning the metastore Tuning the Number of Partitions Turning safe mode on HA NameNodes Tutorial Tutorial: Using Impala, Hive and Hue with Virtual Private Clusters UDF concepts UI Tools Unable to access Hue from Knox Gateway UI Unable to authenticate users in Hue using SAML Unable to connect Oracle database to Hue using SCAN Unable to connect to database with provided credential Unable to log into Hue with Knox Unable to read Sqoop metastore created by an older HSQLDB version Unable to start DAS Unable to terminate Hive queries from Job Browser Unable to use pip command in CDP Unable to view new databases and tables, or unable to see changes to the existing databases or tables Unable to view or create Oozie workflows Unable to view Snappy-compressed files Unaffected Components in this release Understand the NiFi Record Based Processors and Controller Services Understanding --go-live and HDFS ACLs Understanding co-located and external clusters Understanding erasure coding policies Understanding HBase garbage collection Understanding Hue users and groups Understanding Impala integration with Kudu Understanding Keystores and Truststores Understanding Package Management Understanding Performance using EXPLAIN Plan Understanding Performance using Query Profile Understanding Performance using SUMMARY Report Understanding Replication Flows Understanding SRM properties, their configuration and hierarchy Understanding the data that flow into Atlas Understanding the extractHBaseCells Morphline Command Understanding the extractHBaseCells Morphline Command Understanding the kafka-run-class Bash Script Understanding YARN architecture Underβreplicated block exceptions or cluster failure occurs on small clusters Uninstall Cloudera Manager Agent and Managed Software Uninstall the Cloudera Manager Server Uninstalling a Runtime Component From a Single Host Uninstalling Cloudera Manager and Managed Software UNION clause Unlocking access to Kafka metadata in Zookeeper Unsupported Apache Spark Features Unsupported command line tools Unsuppressing Health Tests Update data UPDATE statement Updating a notifier Updating an alert policy Updating data in a table Updating Spark 2 apps for Spark 3 Updating Spark 2 apps for Spark 3.x Updating the schema in a collection Upgrading existing Kudu tables for Hive Metastore integration Upgrading from a CDP Private Cloud Base Trial to CDP Private Cloud Base Upload a file Uploading tables Upsert a row Upsert option in Kudu Spark UPSERT statement Usability issues Use a CTE in a query Use a custom MapReduce job Use BulkLoad Use Case 1: Registering and Querying a Schema for a Kafka Topic Use case 1: Use Cloudera Manager to generate internal CA and corresponding certificates Use case 2: Enabling Auto-TLS with an intermediate CA signed by an existing Root CA Use Case 2: Reading/Deserializing and Writing/Serializing Data from and to a Kafka Topic Use Case 3: Dataflow Management with Schema-based Routing Use case 3: Enabling Auto-TLS with Existing Certificates Use Case Architectures Use cases Use cases for ACLs on HDFS Use cases for BulkLoad Use cases for centralized cache management Use Cgroups Use cluster names in the kudu command line tool Use cluster replication Use CopyTable Use CPU scheduling Use CPU scheduling with distributed shell Use CREATE TABLE AS SELECT Use curl to access a URL protected by Kerberos HTTP SPNEGO Use Digest Authentication Provider Use DistCp to migrate HDFS data from HDP to CDP Use FPGA scheduling Use FPGA with distributed shell Use GPU scheduling Use GPU scheduling with distributed shell Use GZipCodec with a one-time job Use HashTable and SyncTable Tool Use multiple ZooKeeper services Use partitions when submitting a job Use rsync to copy files from one broker to another Use Self-Signed Certificates for TLS Use snapshots Use Spark Use Spark with a secure Kudu cluster Use Sqoop USE statement Use strongly consistent indexing Use the Charts Library Use the Cluster Utilization Report to manage resources Use the HBase APIs for Java Use the HBase command-line utilities Use the HBase REST server Use the HBase shell Use the Hue HBase app Use the JDBC interpreter to access Hive Use the Livy interpreter to access Spark Use the Network Time Protocol (NTP) with HBase Use the YARN CLI to View Logs for Applications Use the YARN REST APIs to manage applications Use the yarn rmadmin tool to administer ResourceManager high availability Use transactions with tables Use wildcards with SHOW DATABASES User Account Requirements User authentication in Hue User authorization configuration for Oozie User Management User management in Hue User Metrics User-defined functions (UDFs) Using --go-live with SSL or Kerberos Using a Credential Provider to Secure S3 Credentials Using a credential provider to secure S3 credentials Using a custom Kerberos keytab retrieval script Using a load balancer Using a load balancer Using a Local Parcel Repository Using a subquery Using ABFS using CLI Using advanced search Using an Internally Hosted Remote Parcel Repository Using Apache HBase Backup and Disaster Recovery Using Apache Hive Using Apache Impala with Apache Kudu Using Apache Phoenix to Store and Access Data Using Apache Phoenix-Hive connector Using Apache Phoenix-Spark connector Using Apache Zeppelin Using Atlas-Hive import utility with Ozone entities Using auth-to-local rules to isolate cluster users Using Avro Data Files Using Basic Search Using Breakpad Minidumps for Crash Reporting Using CLI commands to create and list ACLs Using Cloudera Manager to manage HDFS HA Using common table expressions Using Configuration Properties to Authenticate Using constraints Using Context-Sensitive Variables in Charts Using custom JAR files with Search Using custom libraries with Spark Using Data Analytics Studio Using dfs.datanode.max.transfer.threads with HBase Using Direct Reader mode Using DistCp Using DistCp between HA clusters using Cloudera Manager Using DistCp to copy files Using DistCp to migrate data from secure HDP to secure CDP using DistCp Using DistCp to migrate data from secure HDP to unsecure CDP Using DistCp with Amazon S3 Using DistCp with Highly Available remote clusters Using DNS with HBase Using EC2 Instance Metadata to Authenticate Using Environment Variables to Authenticate Using erasure coding for existing data Using erasure coding for new data Using Fast Upload with Amazon S3 Using Free-text Search Using functions Using governance-based data discovery Using HBase blocksize Using HBase coprocessors Using HBase replication Using HBase scanner heartbeat Using HDFS snapshots for data protection Using HdfsFindTool to find files Using hedged reads Using Hive Metastore with Apache Kudu Using Hive Warehouse Connector with Oozie Spark Action Using HttpFS to provide access to HDFS Using Hue Using Hue Using HWC for streaming Using Impala to query Kudu tables Using impala-shell and Hive Using import utility tools with Atlas Using JDBC API Using JDBC read mode Using JdbcStorageHandler to query RDBMS Using JdbcStorageHandler to query RDBMS Using JMX for accessing HDFS metrics Using Kafka Connect Using Livy with interactive notebooks Using Livy with Spark Using Load Balancer with HttpFS Using MapReduce batch indexing to index sample Tweets Using MariaDB database with Hue Using metadata for cluster governance Using Morphlines to index Avro Using Morphlines with Syslog Using MySQL database with Hue Using non-JDBC drivers Using optimizations from a subquery Using Oracle database with Hue Using ORC Data Files Using Ozone S3 Gateway to work with storage elements Using Parquet Data Files Using Per-Bucket Credentials to Authenticate Using PostgreSQL database with Hue Using PySpark Using quota management Using rack awareness for read replicas Using Ranger client libraries Using Ranger to Provide Authorization in CDP Using Ranger to Provide Authorization in CDP Using Ranger with Ozone Using RCFile Data Files Using Record-Enabled Processors Using RegionServer grouping Using Schema Registry Using Search filters Using secondary indexing Using secure access mode Using SequenceFile Data Files Using session cookies to validate Ranger policies Using snapshots with replication Using solrctl with an HTTP proxy Using Spark Hive Warehouse and HBase Connector Client .jar files with Livy Using Spark MLlib Using Spark SQL Using Spark Streaming Using Sqoop actions with Oozie Using Streams Replication Manager Using tag attributes and values in Ranger tag-based policy conditions Using Tags in Cloudera Manager Using Text Data Files Using the Apache HBase Hive integration Using the Apache Thrift Proxy API Using the AWS CLI with Ozone S3 Gateway Using the CDS 3.2.3 Maven Repo Using the Cloudera Manager API Using the Cloudera Manager API for Cluster Automation Using the Cloudera Manager API to backup and restore clusters Using the Cloudera Manager API to Manage and Configure Clusters Using the Cloudera Manager API to Obtain Configuration Files Using the Cloudera Manager API to Set Advanced Configuration Snippets (Safety Valves) Using the Cloudera Runtime Maven repository 7.1.7 Using the Cloudera Runtime Maven repository 7.1.7 SP1 Using the Cloudera Runtime Maven repository 7.1.7 SP2 Using the Cloudera Runtime Maven repository 7.1.7 SP3 Using the Database Explorer Using the Directory Committer in MapReduce Using the Directory Usage Report Using the HBCK2 tool to remediate HBase clusters Using the Indexer HTTP Interface Using the Lily HBase NRT Indexer Service Using the Livy API to run Spark jobs Using the NFS Gateway for accessing HDFS Using the Note Toolbar Using the Ranger Console Using the Ranger Key Management Service Using the REST API Using the REST API Using the REST proxy API Using the Spark DataFrame API Using transactions Using Unique Filenames to Avoid File Update Inconsistency Using YARN Web UI and CLI Using Zeppelin Interpreters UTF-8 codec error Validating Hadoop Key Operations Validating Key HSM Settings Validating the Cloudera Search deployment Validation of Configuration Properties VALUES statement VARCHAR data type Varchar type VARIANCE, VARIANCE_SAMP, VARIANCE_POP, VAR_SAMP, VAR_POP functions Variations on Put Verifing use of a query rewrite Verify that replication works Verify the ZooKeeper authentication Verify validity of the NFS services Verify your Accumulo installation Verify your OpDB installation Verify your OpDB installation Verifying Cloudera Navigator Key Trustee Server Operations Verifying if a memory limit is sufficient Verifying That an S3A Committer Was Used Verifying that Indexing Works Verifying the Impala dependency on Kudu Verifying the setup Version and Download Information Versions View All Applications View and modify log levels for Search and related services View and modify Search configuration View application details View audit details View Cluster Overview View HDFS directory structure of Compute clusters View HDFS replication policy details View historical details for an HDFS replication policy View Nodes and Node Details View partitions View query details View Queues and Queue Details View Ranger reports View the API documentation Viewing a Job's Task Attempts Viewing a List of All Suppressed Validations Viewing a List of Suppressed Health Tests Viewing Activity Details in a Report Format Viewing All Hosts Viewing and Debugging Spark Applications Using Logs Viewing and Downloading Stacks Logs Viewing and Editing Host Overrides Viewing and Editing Overridden Configuration Properties Viewing and Reverting Configuration Changes Viewing Audit Events Viewing Charts for Cluster, Service, Role, and Host Instances Viewing Cloudera Manager Agent Logs in the Logs Page Viewing Cloudera Manager Server Logs in the Logs Page Viewing compaction progress Viewing Current Disk Usage by User, Group, or Directory Viewing detailed information Viewing Events Viewing existing collections Viewing Health Test Results Viewing Historical Disk Usage by User, Group, or Directory Viewing Host and Service Monitor Data Storage Viewing Host Details Viewing Host Role Assignments Viewing Host Status Viewing Individual Hosts Viewing Jobs Viewing Kafka cluster replication details Viewing lineage Viewing Logs Viewing Parcel Usage Viewing Past Host Inspector Results Viewing Past Status Viewing Past Status Viewing Queries Viewing racks assigned to cluster hosts Viewing Role Instance Status Viewing Running and Recent Commands Viewing Running and Recent Commands For a Cluster Viewing Running and Recent Commands for a Service or Role Viewing Service Instance Details Viewing Service Status Viewing storage information Viewing table and column statistics Viewing the Cloudera Manager Agent Log Viewing the Cloudera Manager Agent Logs Viewing the Cloudera Manager Server Log Viewing the Cloudera Manager Server Log Viewing the DAG counters Viewing the DAG flow Viewing the Disks Overview Viewing the Distribution of Task Attempts Viewing the Health and Status of a Role Instance Viewing the Hive configurations for a query Viewing the Hosts in a Cluster Viewing the Jobs in a Pig, Oozie, or Hive Activity Viewing the Join report Viewing the Maintenance Mode Status of a Cluster Viewing the Maintenance Mode Status of a Cluster Viewing the query details Viewing the query recommendations Viewing the query timeline Viewing the Read and Write report Viewing the Status of a Service Instance Viewing the task-level DAG information Viewing the Tez configurations for a query Viewing the URLs of the Client Configuration Files Viewing the visual explain for a query Viewing transaction locks Viewing transactions Views Virtual machine options for HBase Shell Virtual memory handling Virtual Private Clusters and Cloudera SDX Visualizing Spark Applications Using the Web Application UI Volume and bucket management using ofs Web User Interface for Debugging WebHCat Server Health Tests WebHCat Server Metrics What is CDP Private Cloud? What is Cloudera Search What's new in 7.1.7 What's New in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3) What's New in Cloudera Manager 7.4.4 What's New in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1) What's New in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2) What's new in Cloudera Runtime 7.1.7 SP1 What's new in Cloudera Runtime 7.1.7 SP2 What's new in Cloudera Runtime 7.1.7 SP3 When Shuffles Do Not Occur When to Add a Shuffle Transformation When to use Atlas classifications for access control Whitelisting Configurations at the Session Level Why HDFS data becomes unbalanced Why one scheduler? Wildcards and variables in resource-based policies WINDOW WITH clause Work Preserving Recovery for YARN components Working with Amazon S3 Working with Apache Hive Metastore Working with Atlas classifications and labels Working with Classifications and Labels Working with Google Cloud Storage Working with ofs Working with Ozone File System (o3fs) Working with S3 buckets in the same AWS region Working with the ABFS Connector Working with the Oozie server Working with the Recon web user interface Working with Third-party S3-compatible Object Stores Working with versioned S3 buckets Working with Zeppelin Notes Write a few Events into the Topic Write-ahead log garbage collection Writes Writing data in a Kerberos and TLS/SSL enabled cluster Writing data in an unsecured cluster Writing data through HWC Writing data to HBase Writing data to Kafka Writing Kafka data to Ozone with Kafka Connect Writing to multiple tablets Writing transformed Hive data to Kafka Writing UDFs Writing user-defined aggregate functions (UDAFs) YARN YARN YARN YARN YARN YARN ACL rules YARN ACL syntax YARN ACL types YARN Configuration Properties YARN Features YARN Health Tests YARN Log Aggregation Overview YARN Metrics YARN Pool Metrics YARN Pool User Metrics YARN Properties in Cloudera Runtime 7.1.7 YARN Queue Manager Metrics YARN Queue Manager Properties in Cloudera Runtime 7.1.7 YARN Queue Manager Store Health Tests YARN Queue Manager Store Metrics YARN Queue Manager Webapp Health Tests YARN Queue Manager Webapp Metrics YARN resource allocation of multiple resource-types YARN ResourceManager High Availability YARN ResourceManager high availability architecture YARN services API examples YARN Tab YARN tuning overview YARN, MRv1, and Linux OS Security Zeppelin Zeppelin Zeppelin Health Tests Zeppelin Metrics Zeppelin Properties in Cloudera Runtime 7.1.7 Zeppelin Server Health Tests Zeppelin Server Metrics Zookeeper ZooKeeper ZooKeeper ZooKeeper ACLs Best Practices ZooKeeper ACLs Best Practices: Atlas ZooKeeper ACLs Best Practices: Cruise Control ZooKeeper ACLs Best Practices: HBase ZooKeeper ACLs Best Practices: HDFS ZooKeeper ACLs Best Practices: Kafka ZooKeeper ACLs Best Practices: Oozie ZooKeeper ACLs Best Practices: Ranger ZooKeeper ACLs best practices: Search ZooKeeper ACLs Best Practices: YARN ZooKeeper ACLs Best Practices: ZooKeeper ZooKeeper Authentication Zookeeper Configurations ZooKeeper Health Tests ZooKeeper Metrics ZooKeeper Properties in Cloudera Runtime 7.1.7 ZooKeeper Server Health Tests zookeeper-security-migration
Core Configuration Properties in Cloudera Runtime 7.1.7
Gatewayπ
Advancedπ
Deploy Directoryπ
Description
The directory where the client configs will be deployed
Related Name
Default Value
/etc/hadoop
API Name
client_config_root_dir
Required
true
Core Configuration Client Environment Advanced Configuration Snippet (Safety Valve) for hadoop-env.shπ
Description
For advanced use only, key-value pairs (one on each line) to be inserted into the client configuration for hadoop-env.sh
Related Name
Default Value
API Name
core_client_env_safety_valve
Required
false
Client Java Configuration Optionsπ
Description
These are Java command-line arguments. Commonly, garbage collection flags, PermGen, or extra debugging flags would be passed here.
Related Name
Default Value
-Djava.net.preferIPv4Stack=true
API Name
core_client_java_opts
Required
false
Gateway Logging Advanced Configuration Snippet (Safety Valve)π
Description
For advanced use only, a string to be inserted into log4j.properties for this role only.
Related Name
Default Value
API Name
log4j_safety_valve
Required
false
Logsπ
Gateway Logging Thresholdπ
Description
The minimum log level for Gateway logs
Related Name
Default Value
INFO
API Name
log_threshold
Required
false
Monitoringπ
Enable Configuration Change Alertsπ
Description
When set, Cloudera Manager will send alerts when this entity's configuration changes.
Related Name
Default Value
false
API Name
enable_config_alerts
Required
false
Otherπ
Alternatives Priorityπ
Description
The priority level that the client configuration will have in the Alternatives system on the hosts. Higher priority levels will cause Alternatives to prefer this configuration over any others.
Related Name
Default Value
90
API Name
client_config_priority
Required
true
Resource Managementπ
Client Java Heap Size in Bytesπ
Description
Maximum size in bytes for the Java process heap memory. Passed to Java -Xmx.
Related Name
Default Value
256 MiB
API Name
core_client_java_heapsize
Required
false
Suppressionsπ
Suppress Configuration Validator: CDH Version Validatorπ
Description
Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_cdh_version_validator
Required
true
Suppress Parameter Validation: Deploy Directoryπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Deploy Directory parameter.
Related Name
Default Value
false
API Name
role_config_suppression_client_config_root_dir
Required
true
Suppress Parameter Validation: Core Configuration Client Environment Advanced Configuration Snippet (Safety Valve) for hadoop-env.shπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Core Configuration Client Environment Advanced Configuration Snippet (Safety Valve) for hadoop-env.sh parameter.
Related Name
Default Value
false
API Name
role_config_suppression_core_client_env_safety_valve
Required
true
Suppress Parameter Validation: Client Java Configuration Optionsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Client Java Configuration Options parameter.
Related Name
Default Value
false
API Name
role_config_suppression_core_client_java_opts
Required
true
Suppress Parameter Validation: Gateway Logging Advanced Configuration Snippet (Safety Valve)π
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Gateway Logging Advanced Configuration Snippet (Safety Valve) parameter.
Related Name
Default Value
false
API Name
role_config_suppression_log4j_safety_valve
Required
true
Service-Wideπ
Advancedπ
Core Configuration Service Environment Advanced Configuration Snippet (Safety Valve)π
Description
For advanced use only, key-value pairs (one on each line) to be inserted into a role's environment. Applies to configurations of all roles in this service except client configuration.
Related Name
Default Value
API Name
CORE_SETTINGS_service_env_safety_valve
Required
false
Cluster-wide Advanced Configuration Snippet (Safety Valve) for core-site.xmlπ
Description
For advanced use only, a string to be inserted into core-site.xml . Applies to all roles and client configurations in this HDFS service as well as all its dependent services. Any configs added here will be overridden by their default values in HDFS (which can be found in hdfs-default.xml).
Related Name
Default Value
API Name
core_site_safety_valve
Required
false
HDFS Advanced Configuration Snippet (Safety Valve) for ssl-client.xmlπ
Description
For advanced use only, a string to be inserted into ssl-client.xml . Applies cluster-wide, but can be overridden by individual services.
Related Name
Default Value
API Name
hdfs_ssl_client_safety_valve
Required
false
System Groupπ
Description
The group that this service's processes should run as (except the HttpFS server, which has its own group)
Related Name
Default Value
hdfs
API Name
process_groupname
Required
true
System Userπ
Description
The user that this service's processes should run as.
Related Name
Default Value
hdfs
API Name
process_username
Required
true
Monitoringπ
Enable Service Level Health Alertsπ
Description
When set, Cloudera Manager will send alerts when the health of this service reaches the threshold specified by the EventServer setting eventserver_health_events_alert_threshold
Related Name
Default Value
true
API Name
enable_alerts
Required
false
Enable Configuration Change Alertsπ
Description
When set, Cloudera Manager will send alerts when this entity's configuration changes.
Related Name
Default Value
false
API Name
enable_config_alerts
Required
false
Service Triggersπ
Description
The configured triggers for this service. This is a JSON-formatted list of triggers. These triggers are evaluated as part as the health system. Every trigger expression is parsed, and if the trigger condition is met, the list of actions provided in the trigger expression is executed. Each trigger has the following fields:triggerName
(mandatory) - The name of the trigger. This value must be unique for the specific service. triggerExpression
(mandatory) - A tsquery expression representing the trigger. streamThreshold
(optional) - The maximum number of streams that can satisfy a condition of a trigger before the condition fires. By default set to 0, and any stream returned causes the condition to fire. enabled
(optional) - By default set to 'true'. If set to 'false', the trigger is not evaluated.expressionEditorConfig
(optional) - Metadata for the trigger editor. If present, the trigger should only be edited from the Edit Trigger page; editing the trigger here can lead to inconsistencies. For example, the following JSON formatted trigger fires if there are more than 10 DataNodes with more than 500 file descriptors opened:[{"triggerName": "sample-trigger",
"triggerExpression": "IF (SELECT fd_open WHERE roleType = DataNode and last(fd_open) > 500) DO health:bad",
"streamThreshold": 10, "enabled": "true"}]
See the trigger rules documentation for more details on how to write triggers using tsquery.The JSON format is evolving and may change and, as a result, backward compatibility is not guaranteed between releases.
Related Name
Default Value
[]
API Name
service_triggers
Required
true
Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve)π
Description
For advanced use only, a list of derived configuration properties that will be used by the Service Monitor instead of the default ones.
Related Name
Default Value
API Name
smon_derived_configs_safety_valve
Required
false
Otherπ
Default Filesystemπ
Description
The defaultFs to use in the cluster. Leave this blank if the cluster has a storage service which should be used as the defaultFs.
Related Name
core.defaultFs
Default Value
API Name
core_defaultfs
Required
false
Object Store Serviceπ
Description
Select an Object Store service to enable cloud storage support. Once enabled, the cloud storage can be used in Impala and Hue services, via fully-qualified URIs.
Related Name
Default Value
API Name
object_store_service
Required
false
Set Rules to Map Kerberos Principals to Lower Case Short Namesπ
Description
Adds mapping rules to map Kerberos principals to lower case short names that will be inserted before the default rule. After changing this value and restarting the service, any services depending on this one must be restarted as well.
Related Name
Default Value
false
API Name
set_auth_to_local_to_lowercase
Required
false
Proxyπ
HDFS Proxy User Groupsπ
Description
Comma-delimited list of groups to allow the HDFS user to impersonate. The default '*' allows all groups. To disable entirely, use a string that does not correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.hdfs.groups
Default Value
*
API Name
hdfs_proxy_user_groups_list
Required
false
HDFS Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the HDFS user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.hdfs.hosts
Default Value
*
API Name
hdfs_proxy_user_hosts_list
Required
false
Hive Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the Hive user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.hive.groups
Default Value
*
API Name
hive_proxy_user_groups_list
Required
false
Hive Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Hive user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.hive.hosts
Default Value
*
API Name
hive_proxy_user_hosts_list
Required
false
HTTP Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the HTTP user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'. This is used by WebHCat.
Related Name
hadoop.proxyuser.HTTP.groups
Default Value
*
API Name
HTTP_proxy_user_groups_list
Required
false
HTTP Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the HTTP user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'. This is used by WebHCat.
Related Name
hadoop.proxyuser.HTTP.hosts
Default Value
*
API Name
HTTP_proxy_user_hosts_list
Required
false
HttpFS Proxy User Groupsπ
Description
Comma-delimited list of groups to allow the HttpFS user to impersonate. The default '*' allows all groups. To disable entirely, use a string that does not correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.httpfs.groups
Default Value
*
API Name
httpfs_proxy_user_groups_list
Required
false
HttpFS Proxy User Hostsπ
Description
Comma-delimited list of hosts where you allow the HttpFS user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.httpfs.hosts
Default Value
*
API Name
httpfs_proxy_user_hosts_list
Required
false
Hue Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the Hue user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.hue.groups
Default Value
*
API Name
hue_proxy_user_groups_list
Required
false
Hue Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Hue user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.hue.hosts
Default Value
*
API Name
hue_proxy_user_hosts_list
Required
false
Impala Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the Impala user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.impala.groups
Default Value
*
API Name
impala_proxy_user_groups_list
Required
false
Impala Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Impala user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.impala.hosts
Default Value
*
API Name
impala_proxy_user_hosts_list
Required
false
Knox Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the Knox user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.knox.groups
Default Value
*
API Name
knox_proxy_user_groups_list
Required
false
Knox Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Knox user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.knox.hosts
Default Value
*
API Name
knox_proxy_user_hosts_list
Required
false
Livy Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the Livy user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.livy.groups
Default Value
*
API Name
livy_proxy_user_groups_list
Required
false
Livy Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Livy user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.livy.hosts
Default Value
*
API Name
livy_proxy_user_hosts_list
Required
false
Oozie Proxy User Groupsπ
Description
Allows the oozie superuser to impersonate any members of a comma-delimited list of groups. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.oozie.groups
Default Value
*
API Name
oozie_proxy_user_groups_list
Required
false
Oozie Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the oozie user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.oozie.hosts
Default Value
*
API Name
oozie_proxy_user_hosts_list
Required
false
Phoenix Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the Phoenix user to impersonate. The default '*' allows all groups. To disable entirely, use a string that doesn't correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.phoenix.groups
Default Value
*
API Name
phoenix_proxy_user_groups_list
Required
false
Phoenix Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Phoenix user to impersonate other users. The default '*' allows all hosts. To disable entirely, use a string that doesn't correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.phoenix.hosts
Default Value
*
API Name
phoenix_proxy_user_hosts_list
Required
false
Service Monitor Proxy User Groupsπ
Description
Allows the Cloudera Service Monitor user to impersonate any members of a comma-delimited list of groups. The default '*' allows all groups. This property is used only if Service Monitor is using a different Kerberos principal than the Hue service. To disable entirely, use a string that does not correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.smon.groups
Default Value
*
API Name
smon_proxy_user_groups_list
Required
false
Service Monitor Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Cloudera Service Monitor user to impersonate other users. The default '*' allows all hosts. This property is used only if Service Monitor is using a different Kerberos principal than the Hue service. To disable entirely, use a string that does not correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.smon.hosts
Default Value
*
API Name
smon_proxy_user_hosts_list
Required
false
Telemetry Publisher Proxy User Groupsπ
Description
Allows the Cloudera Telemetry Publisher user to impersonate any members of a comma-delimited list of groups. The default '*' allows all groups. This property is used only if Telemetry Publisher is using a different Kerberos principal than the Hue service. To disable entirely, use a string that does not correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.telepub.groups
Default Value
*
API Name
telepub_proxy_user_groups_list
Required
false
Telemetry Publisher Proxy User Hostsπ
Description
Comma-delimited list of hosts where you want to allow the Cloudera Telemetry Publisher user to impersonate other users. The default '*' allows all hosts. This property is used only if Telemetry Publisher is using a different Kerberos principal than the Hue service. To disable entirely, use a string that does not correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.telepub.hosts
Default Value
*
API Name
telepub_proxy_user_hosts_list
Required
false
YARN Proxy User Groupsπ
Description
Comma-delimited list of groups that you want to allow the YARN user to impersonate. The default '*' allows all groups. To disable entirely, use a string that does not correspond to a group name, such as '_no_group_'.
Related Name
hadoop.proxyuser.yarn.groups
Default Value
*
API Name
yarn_proxy_user_groups_list
Required
false
YARN Proxy User Hostsπ
Description
Comma-delimited list of hosts that you want to allow the YARN user to impersonate. The default '*' allows all hosts. To disable entirely, use a string that does not correspond to a host name, such as '_no_host'.
Related Name
hadoop.proxyuser.yarn.hosts
Default Value
*
API Name
yarn_proxy_user_hosts_list
Required
false
Securityπ
Additional Rules to Map Kerberos Principals to Short Namesπ
Description
Additional mapping rules that will be inserted before rules generated from the list of trusted realms and before the default rule. After changing this value and restarting the service, any services depending on this one must be restarted as well. The hadoop.security.auth_to_local property is configured using this information. Default rules are generated by Cloudera Manager and substituted in place of the literal {DEFAULT_RULES} if it is specified in this value.
Related Name
Default Value
DEFAULT_RULES
API Name
extra_auth_to_local_rules
Required
false
Authorized Admin Groupsπ
Description
Comma-separated list of groups authorized to perform admin operations on Hadoop. This is emitted only if authorization is enabled.
Related Name
Default Value
API Name
hadoop_authorized_admin_groups
Required
false
Authorized Admin Usersπ
Description
Comma-separated list of users authorized to perform admin operations on Hadoop. This is emitted only if authorization is enabled.
Related Name
Default Value
*
API Name
hadoop_authorized_admin_users
Required
false
Authorized Groupsπ
Description
Comma-separated list of groups authorized to used Hadoop. This is emitted only if authorization is enabled.
Related Name
Default Value
API Name
hadoop_authorized_groups
Required
false
Authorized Usersπ
Description
Comma-separated list of users authorized to used Hadoop. This is emitted only if authorization is enabled.
Related Name
Default Value
*
API Name
hadoop_authorized_users
Required
false
Hadoop User Group Mapping Search Baseπ
Description
The search base for the LDAP connection. This is a distinguished name, and will typically be the root of the LDAP directory.
Related Name
hadoop.security.group.mapping.ldap.base
Default Value
API Name
hadoop_group_mapping_ldap_base
Required
false
Hadoop User Group Mapping LDAP Bind User Passwordπ
Description
The password of the bind user.
Related Name
hadoop.security.group.mapping.ldap.bind.password
Default Value
API Name
hadoop_group_mapping_ldap_bind_passwd
Required
false
Hadoop User Group Mapping LDAP Bind User Distinguished Nameπ
Description
Distinguished name of the user to bind to AD as for user authentication search/bind and group lookup for role authorization. For openLDAP based directories this should be a DN string, for Active Directory this can be just a username, combined with the "Active Directory Domain" value for login. For example username in the field and example.com in the active directory domain will result in the User Principal Name value of username@example.com being used to bind. If you put a UPM value here, do not over-configure the "active directory domain" field otherwise you will end up presenting username@example.com@example.com for binds.
AD will accept a UPN value or the DN value as a valid Bind DN;
An example of a Distinguished Name (DN): CN=cdh admin,OU=svcaccount,DC=example,DC=com
An example of a UPN value: cdhadmin@example.com
Related Name
hadoop.security.group.mapping.ldap.bind.user
Default Value
API Name
hadoop_group_mapping_ldap_bind_user
Required
false
Hadoop User Group Mapping LDAP Group Search Filterπ
Description
An additional filter to use when searching for groups.
Related Name
hadoop.security.group.mapping.ldap.search.filter.group
Default Value
(objectClass=group)
API Name
hadoop_group_mapping_ldap_group_filter
Required
false
Hadoop User Group Mapping LDAP Group Name Attributeπ
Description
The attribute of the group object that identifies the group name. The default will usually be appropriate for all LDAP systems.
Related Name
hadoop.security.group.mapping.ldap.search.attr.group.name
Default Value
cn
API Name
hadoop_group_mapping_ldap_group_name_attr
Required
false
Hadoop User Group Mapping LDAP TLS/SSL Truststoreπ
Description
File path to a jks-format truststore containing the TLS/SSL certificate used sign the LDAP server's certificate. Note that in previous releases this was erroneously referred to as a "keystore".
Related Name
hadoop.security.group.mapping.ldap.ssl.keystore
Default Value
API Name
hadoop_group_mapping_ldap_keystore
Required
false
Hadoop User Group Mapping LDAP TLS/SSL Truststore Passwordπ
Description
The password for the TLS/SSL truststore.
Related Name
hadoop.security.group.mapping.ldap.ssl.keystore.password
Default Value
API Name
hadoop_group_mapping_ldap_keystore_passwd
Required
false
Hadoop User Group Mapping LDAP Group Membership Attributeπ
Description
The attribute of the group object that identifies the users that are members of the group. The default will usually be appropriate for any LDAP installation.
Related Name
hadoop.security.group.mapping.ldap.search.attr.member
Default Value
member
API Name
hadoop_group_mapping_ldap_member_attr
Required
false
Hadoop User Group Mapping LDAP URLπ
Description
The URL of the LDAP Server. The URL must be prefixed with ldap:// or ldaps:// . The URL can optionally specify a custom port if necessary, but by default the ldap:// will connect to port 389, and the ldaps:// will connect to port 636. Note that passwords will be in the clear if ldap:// is used, and by fall 2020 Active directory servers will no longer allow non LDAPS connections to bind to AD hosts with LDAP signing enabled. See microsoft knowledge document 935834 for more information.
Related Name
hadoop.security.group.mapping.ldap.url
Default Value
API Name
hadoop_group_mapping_ldap_url
Required
false
Hadoop User Group Mapping LDAP TLS/SSL Enabledπ
Description
Whether or not to use TLS/SSL when connecting to the LDAP server.
Related Name
hadoop.security.group.mapping.ldap.use.ssl
Default Value
false
API Name
hadoop_group_mapping_ldap_use_ssl
Required
false
Hadoop User Group Mapping LDAP User Search Filterπ
Description
An additional filter to use when searching for LDAP users. The default will usually be appropriate for Active Directory installations. If connecting to a generic LDAP server, ''sAMAccountName'' will likely be replaced with ''uid''. {0} is a special string used to denote where the username fits into the filter.
Related Name
hadoop.security.group.mapping.ldap.search.filter.user
Default Value
(&(objectClass=user)(sAMAccountName=0))
API Name
hadoop_group_mapping_ldap_user_filter
Required
false
Hadoop HTTP Authentication Cookie Domainπ
Description
The domain to use for the HTTP cookie that stores the authentication token. In order for authentiation to work correctly across all Hadoop nodes' web-consoles the domain must be correctly set. Important: when using IP addresses, browsers ignore cookies with domain settings. For this setting to work properly all nodes in the cluster must be configured to generate URLs with hostname.domain names on it.
Related Name
Default Value
API Name
hadoop_http_auth_cookie_domain
Required
false
Hadoop RPC Protectionπ
Description
Quality of protection for secured RPC connections between NameNode and HDFS clients. For effective RPC protection, enable Kerberos authentication.
Related Name
hadoop.rpc.protection
Default Value
authentication
API Name
hadoop_rpc_protection
Required
false
Hadoop Secure Authenticationπ
Description
Choose the authentication mechanism used by Hadoop
Related Name
hadoop.security.authentication
Default Value
simple
API Name
hadoop_security_authentication
Required
false
Hadoop Secure Authorizationπ
Description
Enable authorization
Related Name
hadoop.security.authorization
Default Value
false
API Name
hadoop_security_authorization
Required
false
Hadoop User Group Mapping Implementationπ
Description
Class for user to group mapping (get groups for a given user).
Related Name
hadoop.security.group.mapping
Default Value
org.apache.hadoop.security.ShellBasedUnixGroupsMapping
API Name
hadoop_security_group_mapping
Required
false
Encryption Key Default Lengthπ
Description
The length (bits) of keys we want the KeyProvider to produce. Key length defines the upper-bound on an algorithm's security, ideally, it would coincide with the lower-bound on an algorithm's security.
Related Name
hadoop.security.key.default.bitlength
Default Value
128
API Name
hdfs_encryption_key_length
Required
false
Hadoop TLS/SSL Enabledπ
Description
Enable TLS/SSL encryption for HDFS, MapReduce, and YARN web UIs, as well as encrypted shuffle for MapReduce and YARN.
Related Name
hadoop.ssl.enabled
Default Value
false
API Name
hdfs_hadoop_ssl_enabled
Required
false
Kerberos Principalπ
Description
Kerberos principal short name used by all roles of this service.
Related Name
Default Value
hdfs
API Name
kerberos_princ_name
Required
true
Log and Query Redaction Policyπ
Description
Note: Do not edit this property in the classic layout. Switch to the new layout to use preconfigured redaction rules and test your rules inline.Use this property to define a list of rules to be followed for redacting sensitive information from log files and query strings. Click + to add a new redaction rule. You can choose one of the preconfigured rules or add a custom rule. When specifying a custom rule, the Search field should contain a regular expression that will be matched against the data. If a match is found, it is replaced by the contents of the Replace field.Trigger is an optional field. It can be used to specify a simple string to be searched in the data. If the string is found, the redactor attempts to find a match for the Search regex. If no trigger is specified, redaction occurs by matching the Search regular expression. Use the Trigger field to enhance performance: simple string matching is faster than regular expression matching.Test your rules by entering sample text into the Test Redaction Rules text box and clicking Test Redaction. If no rules match, the text you entered is returned unchanged.
Related Name
redaction_policy
Default Value
version: 1,
rules: [
description: Redact passwords from json files,
trigger: password,
search: \password\[ ]*:[ ]*\[^\]+\,
caseSensitive: false,
replace: \password\: \LOG-REDACTED\
,
description: Redact password\u003d and password:,
trigger: password,
search: password[:\u003d][^ \\\\\]+,
caseSensitive: false,
replace: password\u003dLOG-REDACTED
,
description: Redact passwd\u003d and passwd:,
trigger: passwd,
search: passwd[:\u003d][^ \\\\\]+,
caseSensitive: false,
replace: passwd\u003dLOG-REDACTED
,
description: Redact pass\u003d and pass:,
trigger: pass,
search: pass[:\u003d][^ \\\\\]+,
caseSensitive: false,
replace: pass\u003dLOG-REDACTED
,
description: Redact PASSWORD, ,
trigger: PASSWORD, ,
search: PASSWORD, [^\\\\\]+,
caseSensitive: false,
replace: PASSWORD, LOG-REDACTED
,
description: Redact secret\u003d and secret:,
trigger: secret,
search: secret[:\u003d][^ \\\\\]+,
caseSensitive: false,
replace: secret\u003dLOG-REDACTED
,
description: Credit Card numbers (with separator),
search: \\d4[^\\w:]\\d4[^\\w:]\\d4[^\\w:]\\d4,
caseSensitive: true,
replace: XXXX-XXXX-XXXX-XXXX
,
description: Social Security numbers (with separator),
search: \\d3[^\\w:]\\d2[^\\w:]\\d4,
caseSensitive: true,
replace: XXX-XX-XXXX
]
API Name
redaction_policy
Required
false
Enable Log and Query Redactionπ
Description
Enable/Disable the Log and Query Redaction Policy for this cluster.
Related Name
redaction_policy_enabled
Default Value
true
API Name
redaction_policy_enabled
Required
false
Enable Security Audit Loggerπ
Description
Enable security audit logger for HDFS and dependent services
Related Name
security_logger_enabled
Default Value
true
API Name
security_logger_enabled
Required
false
Cluster-Wide Default TLS/SSL Client Truststore Locationπ
Description
Path to the TLS/SSL client truststore file. Defines a cluster-wide default that can be overridden by individual services. This truststore must be in JKS format. The truststore contains certificates of trusted servers, or of Certificate Authorities trusted to identify servers. The contents of the truststore can be modified without restarting any roles. By default, changes to its contents are picked up within ten seconds. If not set, the default Java truststore is used to verify certificates.
Related Name
ssl.client.truststore.location
Default Value
API Name
ssl_client_truststore_location
Required
false
Cluster-Wide Default TLS/SSL Client Truststore Passwordπ
Description
Password for the TLS/SSL client truststore. Defines a cluster-wide default that can be overridden by individual services.
Related Name
ssl.client.truststore.password
Default Value
API Name
ssl_client_truststore_password
Required
false
Trusted Kerberos Realmsπ
Description
List of Kerberos realms that Hadoop services should trust. If empty, defaults to the default_realm property configured in the krb5.conf file. After changing this value and restarting the service, all services depending on this service must also be restarted. Adds mapping rules for each domain to the hadoop.security.auth_to_local property in core-site.xml.
Related Name
Default Value
API Name
trusted_realms
Required
false
Suppressionsπ
Suppress Configuration Validator: CDH Version Validatorπ
Description
Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_cdh_version_validator
Required
true
Suppress Configuration Validator: Deploy Directoryπ
Description
Whether to suppress configuration warnings produced by the Deploy Directory configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_client_config_root_dir
Required
true
Suppress Configuration Validator: Core Configuration Client Environment Advanced Configuration Snippet (Safety Valve) for hadoop-env.shπ
Description
Whether to suppress configuration warnings produced by the Core Configuration Client Environment Advanced Configuration Snippet (Safety Valve) for hadoop-env.sh configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_core_client_env_safety_valve
Required
true
Suppress Configuration Validator: Client Java Configuration Optionsπ
Description
Whether to suppress configuration warnings produced by the Client Java Configuration Options configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_core_client_java_opts
Required
true
Suppress Configuration Validator: Gateway Logging Advanced Configuration Snippet (Safety Valve)π
Description
Whether to suppress configuration warnings produced by the Gateway Logging Advanced Configuration Snippet (Safety Valve) configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_log4j_safety_valve
Required
true
Suppress Parameter Validation: Default Filesystemπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Default Filesystem parameter.
Related Name
Default Value
false
API Name
service_config_suppression_core_defaultfs
Required
true
Suppress Parameter Validation: Core Configuration Service Environment Advanced Configuration Snippet (Safety Valve)π
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Core Configuration Service Environment Advanced Configuration Snippet (Safety Valve) parameter.
Related Name
Default Value
false
API Name
service_config_suppression_core_settings_service_env_safety_valve
Required
true
Suppress Parameter Validation: Cluster-wide Advanced Configuration Snippet (Safety Valve) for core-site.xmlπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Cluster-wide Advanced Configuration Snippet (Safety Valve) for core-site.xml parameter.
Related Name
Default Value
false
API Name
service_config_suppression_core_site_safety_valve
Required
true
Suppress Parameter Validation: Additional Rules to Map Kerberos Principals to Short Namesπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Additional Rules to Map Kerberos Principals to Short Names parameter.
Related Name
Default Value
false
API Name
service_config_suppression_extra_auth_to_local_rules
Required
true
Suppress Configuration Validator: Gateway Count Validatorπ
Description
Whether to suppress configuration warnings produced by the Gateway Count Validator configuration validator.
Related Name
Default Value
false
API Name
service_config_suppression_gateway_count_validator
Required
true
Suppress Parameter Validation: Authorized Admin Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Authorized Admin Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_authorized_admin_groups
Required
true
Suppress Parameter Validation: Authorized Admin Usersπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Authorized Admin Users parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_authorized_admin_users
Required
true
Suppress Parameter Validation: Authorized Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Authorized Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_authorized_groups
Required
true
Suppress Parameter Validation: Authorized Usersπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Authorized Users parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_authorized_users
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping Search Baseπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping Search Base parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_base
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP Bind User Passwordπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP Bind User Password parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_bind_passwd
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP Bind User Distinguished Nameπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP Bind User Distinguished Name parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_bind_user
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP Group Search Filterπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP Group Search Filter parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_group_filter
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP Group Name Attributeπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP Group Name Attribute parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_group_name_attr
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP TLS/SSL Truststoreπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP TLS/SSL Truststore parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_keystore
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP TLS/SSL Truststore Passwordπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP TLS/SSL Truststore Password parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_keystore_passwd
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP Group Membership Attributeπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP Group Membership Attribute parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_member_attr
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP URLπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP URL parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_url
Required
true
Suppress Parameter Validation: Hadoop User Group Mapping LDAP User Search Filterπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop User Group Mapping LDAP User Search Filter parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_group_mapping_ldap_user_filter
Required
true
Suppress Parameter Validation: Hadoop HTTP Authentication Cookie Domainπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hadoop HTTP Authentication Cookie Domain parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hadoop_http_auth_cookie_domain
Required
true
Suppress Configuration Validator: HDFS Authentication And Authorization Validationπ
Description
Whether to suppress configuration warnings produced by the HDFS Authentication And Authorization Validation configuration validator.
Related Name
Default Value
false
API Name
service_config_suppression_hdfs_authentication_and_authorization_validator
Required
true
Suppress Parameter Validation: HDFS Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HDFS Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hdfs_proxy_user_groups_list
Required
true
Suppress Parameter Validation: HDFS Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HDFS Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hdfs_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: HDFS Advanced Configuration Snippet (Safety Valve) for ssl-client.xmlπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HDFS Advanced Configuration Snippet (Safety Valve) for ssl-client.xml parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hdfs_ssl_client_safety_valve
Required
true
Suppress Parameter Validation: Hive Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hive_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Hive Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hive_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: HTTP Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HTTP Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_http_proxy_user_groups_list
Required
true
Suppress Parameter Validation: HTTP Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HTTP Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_http_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: HttpFS Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HttpFS Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_httpfs_proxy_user_groups_list
Required
true
Suppress Parameter Validation: HttpFS Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the HttpFS Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_httpfs_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Hue Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hue Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hue_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Hue Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Hue Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_hue_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Impala Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Impala Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_impala_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Impala Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Impala Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_impala_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Kerberos Principalπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Kerberos Principal parameter.
Related Name
Default Value
false
API Name
service_config_suppression_kerberos_princ_name
Required
true
Suppress Parameter Validation: Knox Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Knox Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_knox_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Knox Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Knox Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_knox_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Livy Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Livy Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_livy_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Livy Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Livy Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_livy_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Oozie Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Oozie Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_oozie_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Oozie Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Oozie Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_oozie_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Phoenix Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Phoenix Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_phoenix_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Phoenix Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Phoenix Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_phoenix_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: System Groupπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the System Group parameter.
Related Name
Default Value
false
API Name
service_config_suppression_process_groupname
Required
true
Suppress Parameter Validation: System Userπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the System User parameter.
Related Name
Default Value
false
API Name
service_config_suppression_process_username
Required
true
Suppress Parameter Validation: Log and Query Redaction Policyπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Log and Query Redaction Policy parameter.
Related Name
Default Value
false
API Name
service_config_suppression_redaction_policy
Required
true
Suppress Configuration Validator: Redaction Policy Validatorπ
Description
Whether to suppress configuration warnings produced by the Redaction Policy Validator configuration validator.
Related Name
Default Value
false
API Name
service_config_suppression_redaction_policy_validator
Required
true
Suppress Configuration Validator: Hadoop RPC Protection validatorπ
Description
Whether to suppress configuration warnings produced by the Hadoop RPC Protection validator configuration validator.
Related Name
Default Value
false
API Name
service_config_suppression_rpc_protection_validator
Required
true
Suppress Parameter Validation: Service Triggersπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Triggers parameter.
Related Name
Default Value
false
API Name
service_config_suppression_service_triggers
Required
true
Suppress Parameter Validation: Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve)π
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve) parameter.
Related Name
Default Value
false
API Name
service_config_suppression_smon_derived_configs_safety_valve
Required
true
Suppress Parameter Validation: Service Monitor Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Monitor Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_smon_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Service Monitor Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Monitor Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_smon_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Cluster-Wide Default TLS/SSL Client Truststore Locationπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Cluster-Wide Default TLS/SSL Client Truststore Location parameter.
Related Name
Default Value
false
API Name
service_config_suppression_ssl_client_truststore_location
Required
true
Suppress Parameter Validation: Cluster-Wide Default TLS/SSL Client Truststore Passwordπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Cluster-Wide Default TLS/SSL Client Truststore Password parameter.
Related Name
Default Value
false
API Name
service_config_suppression_ssl_client_truststore_password
Required
true
Suppress Parameter Validation: Telemetry Publisher Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Telemetry Publisher Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_telepub_proxy_user_groups_list
Required
true
Suppress Parameter Validation: Telemetry Publisher Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Telemetry Publisher Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_telepub_proxy_user_hosts_list
Required
true
Suppress Parameter Validation: Trusted Kerberos Realmsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Trusted Kerberos Realms parameter.
Related Name
Default Value
false
API Name
service_config_suppression_trusted_realms
Required
true
Suppress Parameter Validation: YARN Proxy User Groupsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the YARN Proxy User Groups parameter.
Related Name
Default Value
false
API Name
service_config_suppression_yarn_proxy_user_groups_list
Required
true
Suppress Parameter Validation: YARN Proxy User Hostsπ
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the YARN Proxy User Hosts parameter.
Related Name
Default Value
false
API Name
service_config_suppression_yarn_proxy_user_hosts_list
Required
true
Feedback
I like something
I have an idea
Something's not working