Homepage
/
Cloudera Private Cloud Base
7.1.7
(Private Cloud)
Search Documentation
▶︎
Cloudera
Reference Architectures
▶︎
Cloudera Public Cloud
Getting Started
Patterns
Preview Features
Data Catalog
Data Engineering
DataFlow
Data Hub
Data Warehouse
Data Warehouse Runtime
Cloudera AI
Management Console
Operational Database
Replication Manager
DataFlow for Data Hub
Runtime
▼
Cloudera Private Cloud
Data Services
Getting Started
Cloudera Manager
Management Console
Data Engineering
Data Warehouse
CDW Runtime
Machine Learning
Base
Getting Started
Runtime & Cloudera Manager
Upgrade
Flow Management
Streaming Analytics
▶︎
Cloudera Manager
Cloudera Manager
▶︎
Applications
Streaming Community Edition
Data Science Workbench
Data Visualization
Edge Management
Observability
Observability on premises
Workload XM On-Prem
▶︎
Legacy
Cloudera Enterprise
Flow Management
Stream Processing
HDP
HDF
Streams Messaging Manager
Streams Replication Manager
▶︎
Data Services
Getting Started
Cloudera Manager
Management Console
Data Engineering
Data Warehouse
CDW Runtime
Machine Learning
Base
Getting Started
Runtime & Cloudera Manager
Upgrade
Flow Management
Streaming Analytics
«
Filter topics
CDP Private Cloud Base
▼
Cloudera Runtime Release Notes
Overview
▼
7.1.7 SP3
What's new in Cloudera Runtime 7.1.7 SP3
Cloudera Runtime 7.1.7 SP3 component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7 SP3
Runtime 7.1.7.3000-77
Runtime 7.1.7.3008-2
Runtime 7.1.7.3010-1
Runtime 7.1.7.3011-1
Runtime 7.1.7.3013-1
Runtime 7.1.7.3014-1
Runtime 7.1.7.3016-1
▶︎
Fixed issues in Cloudera Runtime 7.1.7 SP3
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed issues in Apache Calcite
Fixed issues in Cloud Connectors
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HDFS
Fixed Issues in Apache HBase
Fixed Issues in Apache Hive
Fixed Issues in Hue
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Kerberos
Fixed Issues in Apache Kudu
Fixed Issues in Apache Knox
Fixed Issues in Livy
Fixed Issues in MapReduce
Fixed Issues in Navigator Encrypt
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Apache Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Schema Registry
Fixed Issues in Cloudera Search
Fixed Issues in Apache Solr
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN and YARN Queue Manager
Fixed Issues in Zeppelin
Fixed Issues in Apache Zookeeper
▼
Known Issues in Cloudera Runtime 7.1.7 SP3
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known Issues in Cruise Control
Known Issues in Apache Calcite
Known Issues in Data Analytics Studio
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in HDFS
Known Issues in Apache Hive
Known Issues in Hue
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Navigator Encrypt
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Schema Registry
Known Issues in Cloudera Search
Known Issues in Apache Spark
Known Issues in Streams Replication Manager
Known Issues for Apache Sqoop
Known issues in Streams Messaging Manager
Known Issues in MapReduce and YARN
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7 SP3
Behavioral changes in Apache Hive
▶︎
CDP Private Cloud Base API Modifications and Removals
▶︎
CDP 7.1.7 SP2 and 7.1.7 SP3 Components with API differences
API Compatibility changes in 7.1.7 SP3 for Spark
API Compatibility changes in 7.1.7 SP3 for Zookeeper
▶︎
Deprecation notices in Cloudera Runtime 7.1.7 SP3
Platform and OS
Fixed Common Vulnerabilities and Exposures 7.1.7 SP3
Documentation Errata in Cloudera Runtime 7.1.7 SP3
▶︎
Cumulative hotfixes
▶︎
Cumulative hotfix CDP Private Cloud Base 7.1.7.3016-1 (SP3 Cumulative hotfix6)
Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF6
▶︎
Cumulative hotfix CDP Private Cloud Base 7.1.7.3014-1 (SP3 Cumulative hotfix5)
Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF5
Cumulative hotfix CDP Private Cloud Base 7.1.7.3013-1 (SP3 Cumulative hotfix4)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3011-1 (SP3 Cumulative hotfix3)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3010-1 (SP3 Cumulative hotfix2)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3008-2 (SP3 Cumulative hotfix1)
▶︎
7.1.7 SP2
What's new in Cloudera Runtime 7.1.7 SP2
Cloudera Runtime 7.1.7 SP2 component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7 SP2
Runtime 7.1.7.2000-305
Runtime 7.1.7.2002-1
Runtime 7.1.7.2009-1
Runtime 7.1.7.2010-1
Runtime 7.1.7.2011-1
Runtime 7.1.7.2013-1
Runtime 7.1.7.2016-1
Runtime 7.1.7.2021-1
Runtime 7.1.7.2023-1
Runtime 7.1.7.2024-1
Runtime 7.1.7.2025-2
Runtime 7.1.7.2026-3
Runtime 7.1.7.2030-1
Runtime 7.1.7.2032-1
Runtime 7.1.7.2035-2
Runtime 7.1.7.2038-1
Runtime 7.1.7.2040-4
Runtime 7.1.7.2046-1
Runtime 7.1.7.2047-1
Runtime 7.1.7.2050-1
▶︎
Fixed issues in Cloudera Runtime 7.1.7 SP2
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed issues in Apache Calcite
Fixed issues in Cloud Connectors
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HDFS
Fixed Issues in Apache HBase
Fixed Issues in Apache Hive
Fixed Issues in Hue
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Kerberos
Fixed Issues in Apache Kudu
Fixed Issues in Apache Knox
Fixed Issues in Livy
Fixed Issues in MapReduce
Fixed Issues in Navigator Encrypt
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Apache Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Schema Registry
Fixed Issues in Cloudera Search
Fixed Issues in Apache Solr
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN and YARN Queue Manager
Fixed Issues in Zeppelin
Fixed Issues in Apache Zookeeper
Hotfixes in Cloudera Runtime 7.1.7 SP2
▶︎
Known issues in Cloudera Runtime 7.1.7 SP2
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known issues in Cruise Control
Known issues in Apache Calcite
Known Issues in Data Analytics Studio
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in HDFS
Known Issues in Apache Hive
Known Issues in Hue
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Navigator Encrypt
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Schema Registry
Known Issues in Cloudera Search
Known Issues in Apache Spark
Known Issues in Streams Replication Manager
Known Issues for Apache Sqoop
Known issues in Streams Messaging Manager
Known Issues in MapReduce and YARN
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7 SP2
Behavioral changes in Apache Hive
Behavioral Changes in Cloudera Search
Behavioral changes in Apache Impala
Fixed Common Vulnerabilities and Exposures 7.1.7 SP2
Documentation Errata in Cloudera Runtime 7.1.7 SP2
▶︎
Cumulative hotfixes
Cumulative hotfix CDP PvC Base 7.1.7.2050-1 (SP2 cumulative hotfix19)
Cumulative hotfix CDP PvC Base 7.1.7.2047-1 (SP2 cumulative hotfix18)
Cumulative hotfix CDP PvC Base 7.1.7.2046-1 (SP2 cumulative hotfix17)
Cumulative hotfix CDP PvC Base 7.1.7.2040-4 (SP2 cumulative hotfix16)
Cumulative hotfix CDP PvC Base 7.1.7.2038-1 (SP2 cumulative hotfix15)
Cumulative hotfix CDP PvC Base 7.1.7.2035-2 (SP2 cumulative hotfix14)
Cumulative hotfix CDP PvC Base 7.1.7.2032-1 (SP2 cumulative hotfix13)
Cumulative hotfix CDP PvC Base 7.1.7.2030-1 (SP2 cumulative hotfix12)
Cumulative hotfix CDP PvC Base 7.1.7.2026-3 (SP2 cumulative hotfix11)
Cumulative hotfix CDP PvC Base 7.1.7.2025-2 (SP2 cumulative hotfix10)
Cumulative hotfix CDP PvC Base 7.1.7.2024-1 (SP2 cumulative hotfix9)
Cumulative hotfix CDP PvC Base 7.1.7.2023-1 (SP2 cumulative hotfix8)
Cumulative hotfix CDP PvC Base 7.1.7.2021-1 (SP2 cumulative hotfix7)
Cumulative hotfix CDP PvC Base 7.1.7.2016-1 (SP2 cumulative hotfix6)
Cumulative hotfix CDP PvC Base 7.1.7.2013-1 (SP2 cumulative hotfix5)
Cumulative hotfix CDP PvC Base 7.1.7.2011-1 (SP2 cumulative hotfix4)
Cumulative hotfix CDP PvC Base 7.1.7.2010-1 (SP2 cumulative hotfix3)
Cumulative hotfix CDP PvC Base 7.1.7.2009-1 (SP2 cumulative hotfix2)
Cumulative hotfix CDP PvC Base 7.1.7.2002-1 (SP2 cumulative hotfix1)
▶︎
7.1.7 SP1
What's new in Cloudera Runtime 7.1.7 SP1
Cloudera Runtime 7.1.7 SP1 component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7 SP1
Maven Artifacts for Cloudera Runtime 7.1.7 SP1
▶︎
Fixed issues in Cloudera Runtime 7.1.7 SP1
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HDFS
Fixed Issues in Apache HBase
Fixed Issues in Apache Hive
Fixed Issues in Hue
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Apache Kudu
Fixed Issues in Apache Knox
Fixed Issues in Navigator Encrypt
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Schema Registry
Fixed Issues in Cloudera Search
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN
Fixed Issues in Zeppelin
Fixed Issues in Apache Zookeeper
Hotfixes in Cloudera Runtime 7.1.7 SP1
▶︎
Known issues in Cloudera Runtime 7.1.7 SP1
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known issues in Cruise Control
Known Issues in Data Analytics Studio
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in HDFS
Known Issues in Apache Hive
Known Issues in Hue
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Kerberos
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Navigator Encrypt
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Schema Registry
Known Issues in Cloudera Search
Known Issues in Apache Spark
Known Issues in Streams Replication Manager
Known Issues for Apache Sqoop
Known issues in Streams Messaging Manager
Known Issues in MapReduce and YARN
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7 SP1
Behavioral changes in Apache Hive
Behavioral Changes in Cloudera Search
Behavioral changes in Apache HBase
Fixed Common Vulnerabilities and Exposures 7.1.7 SP1
Documentation Errata in Cloudera Runtime 7.1.7 SP1
Cloudera Logging is now available in CDP Private Cloud Base 7.1.7 SP1
Cumulative hotfixes
▶︎
7.1.7
▶︎
What's new in 7.1.7
Atlas
Cruise Control
Hive
Hue
Impala
Kafka
Kerberos
Kudu
Ozone
Ranger
Schema Registry
Search
Spark
Sqoop
Streams Replication Manager
Streams Messaging Manager
YARN
Unaffected Components in this release
Cloudera Runtime component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7
Maven Artifacts for Cloudera Runtime 7.1.7.0
▶︎
Fixed issues in Cloudera Runtime 7.1.7
Atlas
Avro
Cruise Control
DAS
Hadoop
HDFS
HBase
Hive
Hue
Impala
Kafka
Kudu
Knox
Navigator Encrypt
Oozie
Ozone
Parquet
Phoenix
Ranger
Schema Registry
Search
Spark
Sqoop
Streams Replication Manager
Streams Messaging Manager
Tez
YARN
Zeppelin
Zookeeper
Hotfixes in Cloudera Runtime 7.1.7
▶︎
Known issues in Cloudera Runtime 7.1.7
Atlas
Avro
Cruise Control
DAS
Hadoop
HBase
HDFS
Hive
Hue
Impala
Kafka
Kerberos
Knox
Kudu
Navigator Encrypt
Oozie
Ozone
Parquet
Phoenix
Ranger
Schema Registry
Search
Spark
Streams Replication Manager
Sqoop
Streams Messaging Manager
YARN
Zeppelin
ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7
Cruise Control
Hive
Kafka
Navigator Encrypt
Phoenix
Search
Impala
Streams Replication Manager
YARN
▶︎
Deprecation notices in Cloudera Runtime 7.1.7
Kudu
Kafka
HBase
HDFS
▶︎
CDP Private Cloud Base service groups and component reference
CDP PVC Base - Data Warehouse
CDP PVC Base - Data Engineering
CDP PVC Base - Operational Database
CDP PVC Base - Enterprise Essentials
▶︎
Cloudera Manager Release Notes
▶︎
Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
What's New in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Fixed Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Known Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Documentation Errata in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
▶︎
Deprecation notices in Cloudera Manager 7.11.3 CHF4
Platform and OS
▶︎
Cloudera Manager 7.6.7 Release Notes (CDP Private Cloud Base 7.1.7 SP2)
What's New in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
Fixed Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
Known Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
Fixed Common Vulnerabilities and Exposures in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
▶︎
Cumulative hotfixes
Cloudera Manager 7.6.7 Cumulative hotfix 13
Cloudera Manager 7.6.7 Cumulative hotfix 12
Cloudera Manager 7.6.7 Cumulative hotfix 11
Cloudera Manager 7.6.7 Cumulative hotfix 10
Cloudera Manager 7.6.7 Cumulative hotfix 9
Cloudera Manager 7.6.7 Cumulative hotfix 8
Cloudera Manager 7.6.7 Cumulative hotfix 7
Cloudera Manager 7.6.7 Cumulative hotfix 6
Cloudera Manager 7.6.7 Cumulative hotfix 5
Cloudera Manager 7.6.7 Cumulative hotfix 4
Cloudera Manager 7.6.7 Cumulative hotfix 3
Cloudera Manager 7.6.7 Cumulative hotfix 2
Cloudera Manager 7.6.7 Cumulative hotfix 1
▶︎
Cloudera Manager 7.6.1 Release Notes (CDP Private Cloud Base 7.1.7 SP1)
What's New in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
Fixed Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
Known Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
Documentation Errata in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
▶︎
Cumulative hotfixes
Cloudera Manager 7.6.1 Cumulative hotfix 9
Cloudera Manager 7.6.1 Cumulative hotfix 8
Cloudera Manager 7.6.1 Cumulative hotfix 7
Cloudera Manager 7.6.1 Cumulative hotfix 6
Cloudera Manager 7.6.1 Cumulative hotfix 5
Cloudera Manager 7.6.1 Cumulative hotfix 4
Cloudera Manager 7.6.1 Cumulative hotfix 3
Cloudera Manager 7.6.1 Cumulative hotfix 2
Cloudera Manager 7.6.1 Cumulative hotfix 1
▶︎
Cloudera Manager 7.4.4 Release Notes
What's New in Cloudera Manager 7.4.4
Fixed Issues in Cloudera Manager 7.4.4
Known Issues in Cloudera Manager 7.4.4
Known Issues for IBM PowerPC
▶︎
Concepts
▶︎
Cloudera Manager
▶︎
Cloudera Manager Overview
Overview
Terminology
Architecture
State Management
▶︎
Cloudera Manager Admin Console
Home Page
Automatic Logout
Software Distribution Management
Process Management
Host Management
Cloudera Manager Agents
Resource Management
User Management
Security Management
Monitoring a Cluster Using Cloudera Manager
Cloudera Management Service
Cluster Configuration Overview
Server and Client Configuration
Cloudera Manager API
▶︎
Virtual Private Clusters and Cloudera SDX
Advantages of Separating Compute and Data Resources
Architecture
Performance Trade Offs
Compatibility Considerations for Virtual Private Clusters
Networking Considerations for Virtual Private Clusters
▶︎
Storage
▶︎
Apache Hadoop HDFS Overview
▶︎
Introduction
Overview of HDFS
▶︎
NameNodes
▶︎
Moving NameNode roles
Moving highly available NameNode, failover controller, and JournalNode roles using the Migrate Roles wizard
Moving a NameNode to a different host using Cloudera Manager
▶︎
Sizing NameNode heap memory
Environment variables for sizing NameNode heap memory
Monitoring heap memory usage
Files and directories
Disk space versus namespace
Replication
Examples of estimating NameNode heap memory
Remove or add storage directories for NameNode data directories
▶︎
DataNodes
How NameNode manages blocks on a failed DataNode
Replace a disk on a DataNode host
Remove a DataNode
Fixing block inconsistencies
Add storage directories using Cloudera Manager
Remove storage directories using Cloudera Manager
▶︎
Configuring storage balancing for DataNodes
Configure storage balancing for DataNodes using Cloudera Manager
Perform a disk hot swap for DataNodes using Cloudera Manager
▶︎
JournalNodes
Moving the JournalNode edits directory for a role group using Cloudera Manager
Moving the JournalNode edits directory for a role instance using Cloudera Manager
Synchronizing the contents of JournalNodes
▶︎
Apache Ozone Overview
▶︎
Introduction to Ozone
Ozone architecture
Ozone security architecture
How Ozone manages read operations
How Ozone manages write operations
▶︎
Apache HBase Overview
Introduction
▶︎
Apache Kudu Overview
Kudu introduction
Kudu architecture in a CDP private cloud base deployment
Kudu network architecture
Kudu-Impala integration
Example use cases
Kudu concepts
▶︎
Apache Kudu usage limitations
Schema design limitations
Partitioning limitations
Scaling recommendations and limitations
Server management limitations
Cluster management limitations
Impala integration limitations
Spark integration limitations
Kudu security limitations
Other known issues
More Resources
▶︎
Apache Kudu Background Operations
Maintenance manager
Flushing data to disk
Compacting on-disk data
Write-ahead log garbage collection
Tablet history garbage collection and the ancient history mark
▶︎
Apache Hadoop YARN Overview
Introduction
YARN Features
Understanding YARN architecture
▶︎
Data Access
▶︎
Data Analytics Studio Overview
Data Analytics Studio overview
DAS architecture
▶︎
Apache Hive Metastore Overview
Introduction to Hive metastore
▶︎
Apache Hive Overview
Apache Hive features
Hive on Tez introduction
Hive unsupported interfaces and features
Apache Hive 3 architectural overview
▶︎
Installing Hive on Tez and adding a HiveServer role
Adding a HiveServer role
Changing the Hive warehouse location
Apache Hive content roadmap
▶︎
Apache Impala Overview
Introduction
Components
▶︎
Hue Overview
Hue overview
▶︎
Cloudera Search Overview
What is Cloudera Search
How Cloudera Search works
Cloudera Search and CDP
Search and other Runtime components
Cloudera Search architecture
Local file system support
▶︎
Cloudera Search tasks and processes
Ingestion
Indexing
Querying
ETL with Cloudera Morphlines
Backing up and restoring data
▶︎
Operational Database
▶︎
Operational Database Overview
▶︎
Operational Database overview
Introduction to Apache HBase
▶︎
Introduction to Apache Phoenix
Apache Phoenix and SQL
▶︎
Operational Database powered by Apache Accumulo Overview
Release notes
OpDB overview
CLI tool support
System requirements
▶︎
Introduction to HBase Multi-cluster Client
▶︎
Introduction to HBase Multi-cluster Client
HBase MCC Usage with Kerberos
HBase MCC Usage in Spark with Scala
HBase MCC Usage in Spark with Java
Zookeeper Configurations
HBase MCC Configurations
HBase MCC Restrictions
▶︎
Data Science
▶︎
Apache Spark Overview
Apache Spark Overview
Unsupported Apache Spark Features
▶︎
Apache Zeppelin Overview
Overview
▶︎
CDP Security Overview
▶︎
Introduction
What is CDP Private Cloud?
Importance of a Secure Cluster
Secure by Design
▶︎
Pillars of Security
Authentication
Authorization
Encryption
Identity Management
Security Management Model
▶︎
Security Levels
Choosing the Sufficient Security Level for Your Environment
Logical Architecture
SDX
Security Terms
▶︎
Governance
▶︎
Governance Overview
Using metadata for cluster governance
Data Stewardship with Apache Atlas
Apache Atlas dashboard tour
Apache Atlas metadata collection overview
Atlas metadata model overview
▶︎
Controlling Data Access with Tags
Atlas classifications drive Ranger policies
When to use Atlas classifications for access control
▶︎
How tag-based access control works
Propagation of tags as deferred actions
Examples of controlling data access using classifications
▶︎
Extending Atlas to Manage Metadata from Additional Sources
Top-down process for adding a new metadata source
▶︎
Streams Messaging
▶︎
Apache Kafka Overview
Kafka Introduction
▶︎
Kafka Architecture
Brokers
Topics
Records
Partitions
Record order and assignment
Logs and log segments
Kafka brokers and Zookeeper
Leader positions and in-sync replicas
▶︎
Kafka FAQ
Basics
Use cases
▶︎
Cruise Control Overview
Kafka cluster load balancing using Cruise Control
▶︎
Streams Messaging Manager Overview
Introduction to Streams Messaging Manager
▶︎
Streams Replication Manager Overview
Overview
Key Features
Main Use Cases
▶︎
Use Case Architectures
▶︎
Highly Available Kafka Architectures
Active / Stand-by Architecture
Active / Active Architecture
Cross Data Center Replication
▶︎
Cluster Migration Architectures
On-premise to Cloud and Kafka Version Upgrade
Aggregation for Analytics
▶︎
Streams Replication Manager Architecture
▶︎
Streams Replication Manager Driver
Connect workers
Connectors
Task architecture and load-balancing
Driver inter-node coordination
Streams Replication Manager Service
▶︎
Understanding Replication Flows
Replication Flows Overview
Remote Topics
Bidirectional Replication Flows
Fan-in and Fan-out Replication Flows
Understanding co-located and external clusters
Understanding SRM properties, their configuration and hierarchy
▶︎
Schema Registry Overview
▶︎
Schema Registry Overview
Examples of interacting with Schema Registry
▶︎
Schema Registry Use Cases
Use Case 1: Registering and Querying a Schema for a Kafka Topic
Use Case 2: Reading/Deserializing and Writing/Serializing Data from and to a Kafka Topic
Use Case 3: Dataflow Management with Schema-based Routing
Schema Registry Component Architecture
▶︎
Schema Registry Concepts
Schema Entities
Compatibility policies
▶︎
Planning
▶︎
Deployment Planning for Cloudera Search
Planning overview
Dimensioning guidelines
Schemaless mode overview and best practices
Advantages of defining a schema for production use
▶︎
Planning for Infra Solr
Calculating Infra Solr resource needs
▶︎
Planning for Apache Impala
Guidelines for Schema Design
User Account Requirements
▶︎
Planning for Apache Kudu
▶︎
Kudu schema design
The perfect schema
▶︎
Column design
Decimal type
Varchar type
Column encoding
Column compression
▶︎
Primary key design
Primary key index
Considerations for backfill inserts
▶︎
Partitioning
▶︎
Range partitioning
Adding and Removing Range Partitions
Hash partitioning
Multilevel partitioning
Partition pruning
▶︎
Partitioning examples
Range partitioning
Hash partitioning
Hash and range partitioning
Hash and hash partitioning
Schema alterations
Schema design limitations
Partitioning limitations
▶︎
Kudu transaction semantics
Single tablet write operations
Writing to multiple tablets
Read operations (scans)
▶︎
Known issues and limitations
Writes
Reads (scans)
▶︎
Scaling Kudu
Terms
Example workload
▶︎
Memory
Verifying if a memory limit is sufficient
File descriptors
Threads
Scaling recommendations and limitations
▶︎
Planning for Streams Replication Manager
Streams Replication Manager requirements
Recommended deployment architecture
▶︎
Installation & Upgrade
▶︎
Installing CDP Private Cloud Base
CDP Private Cloud Base Installation Guide
▶︎
Version and Download Information
Cloudera Manager Version Information
Cloudera Manager Download Information
Cloudera Runtime Version Information
Cloudera Runtime Download Information
Cloudera Manager support for Cloudera Runtime and CDH
CDP Private Cloud Base Trial Download Information
▶︎
CDP Private Cloud Base Requirements and Supported Versions
▶︎
Hardware Requirements
▶︎
Cloudera Manager
Cloudera Manager Server
Service Monitor Requirements
Host Monitor
Reports Manager
Agent Hosts
Event Server
Alert Publisher
▶︎
Cloudera Runtime
Atlas
Data Analytics Studio (DAS)
HDFS
HBase
Hive
Hue
Impala
Kafka
Key Trustee Server
Ranger KMS
Kudu
Oozie
Ozone
Phoenix
Ranger
Search
Spark
YARN
ZooKeeper
Operating System Requirements
Database Requirements
Java Requirements
Networking and Security Requirements
Data at Rest Encryption Requirements
▶︎
Third-party filesystems
IBM Spectrum Scale
Dell EMC PowerScale
▶︎
Trial Installation
▶︎
Installing a Trial Cluster
Before You Begin a Trial Installation
Download the Trial version of CDP Private Cloud Base
Run the Cloudera Manager Server Installer
Install Cloudera Runtime
Set Up a Cluster Using the Wizard
Stopping the Embedded PostgreSQL Database
Starting the Embedded PostgreSQL Database
Changing Embedded PostgreSQL Database Passwords
▶︎
Migrating from the Cloudera Manager Embedded PostgreSQL Database Server to an External PostgreSQL Database
Prerequisites
Identify Roles that Use the Embedded Database Server
Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server
▶︎
Installing and Configuring CDP with FIPS
Overview
Prerequisites
Configure Cloudera Manager for FIPS
Install and configure additional required components
▶︎
Production Installation
▶︎
Before You Install
▶︎
Storage Space Planning for Cloudera Manager
Cluster Lifecycle Management with Cloudera Manager
Configure Network Names
Setting SELinux Mode
Disabling the Firewall
Enable an NTP Service
Impala Requirements
Runtime Cluster Hosts and Role Assignments
Allocating Hosts for Key Trustee Server and Key Trustee KMS
▶︎
Configuring Local Package and Parcel Repositories
▶︎
Understanding Package Management
Repository Configuration Files
Listing Repositories
▶︎
Configuring a Local Package Repository
▶︎
Creating a Permanent Internal Repository
Setting Up a Web Server
Downloading and Publishing the Package Repository
Creating a Temporary Internal Repository
Configuring Hosts to Use the Internal Repository
▶︎
Configuring a Local Parcel Repository
▶︎
Using an Internally Hosted Remote Parcel Repository
Setting Up a Web Server
Downloading and Publishing the Parcel Repository
Configuring Cloudera Manager to Use an Internal Remote Parcel Repository
Using a Local Parcel Repository
Configuring /tmp directory for cluster hosts
Installing Cloudera Manager, Cloudera Runtime, and Managed Services
Step 1: Configure a Repository for Cloudera Manager
▶︎
Step 2: Install Java Development Kit
Installing OpenJDK on Cloudera Manager
Installing OpenJDK for CDP Runtime
Installing Oracle JDK for CDP Runtime
▶︎
Step 3: Install Cloudera Manager Server
Step 3: Deploy Cloudera Manager Server and Cloudera Manager Agents
▶︎
Step 4. Install and Configure Databases
Required Databases
▶︎
Install and Configure PostgreSQL for CDP
Installing Postgres JDBC Driver
Installing PostgreSQL Server
Installing the psycopg2 Python package for PostgreSQL-backed Hue
Configuring and Starting the PostgreSQL Server
Install and Configure MySQL for CDP
Install and Configure MariaDB for CDP
▶︎
Install and Configure Oracle Database
Configuring the Hue Server to Store Data in the Oracle database
▶︎
Configuring a database for Ranger or Ranger KMS
Configuring a Ranger or Ranger KMS Database: MySQL/MariaDB
Configuring a Ranger or Ranger KMS Database: Oracle
Configuring a Ranger or Ranger KMS Database: Oracle using /ServiceName format
Configuring a PostgreSQL Database for Ranger or Ranger KMS
Configure Ranger with SSL/TLS enabled PostgreSQL Database
▶︎
Configuring the Database for Streaming Components
Configure PostgreSQL for Streaming Components
Configuring MySQL for Streaming Components
Configuring Oracle for Streaming Components
▶︎
Step 5: Set up and configure the Cloudera Manager database
Syntax for scm_prepare_database.sh
▶︎
Step 6: Start the Cloudera Manager Server and Agents
Installation Wizard
▶︎
Step 7: Set Up a Cluster Using the Wizard
Select Services
Assign Roles
Setup Database
Enter Required Parameters
Review Changes
Command Details
Summary
Tuning JVM Garbage Collection
(Recommended) Enable Auto-TLS
Additional Steps for Apache Ranger
▶︎
Installing Apache Knox
Apache Knox Install Role Parameters
▶︎
Setting Up Data at Rest Encryption for HDFS
Installing Ranger KMS backed by a Database and HA
Installing Ranger KMS backed with a Key Trustee Server and HA
Installing a Java Keystore KMS
Installing Cloudera Navigator Encrypt
Installing Cloudera Navigator Key HSM
Installing Ranger RMS
▶︎
Custom Installation Solutions
▶︎
Privileged commands for Cloudera Manager installation
Prerequisites and exceptions for the example configuration
Example configuration to add to the sudoers file
▶︎
Creating Virtual Images of Cluster Hosts
Creating a Pre-Deployed Cloudera Manager Host
Instantiating a Cloudera Manager Image
Creating a Pre-Deployed Worker Host
Instantiating a worker host
▶︎
Manually Install Cloudera Software Packages
Install Cloudera Manager Packages
Manually Install Cloudera Manager Agent Packages
▶︎
Installation Reference
▶︎
Ports
Ports Used by Cloudera Manager
Ports Used by Cloudera Navigator Key Trustee Server
Ports Used by Cloudera Runtime Components
Ports Used by DistCp
Ports Used by Third-Party Components
Service Dependencies in Cloudera Manager
Cloudera Manager sudo command options
Introduction to Parcels
▶︎
After You Install
Deploying Clients
Testing the Installation
Checking Host Heartbeats
Running a MapReduce Job
Testing with Hue
Deploying Atlas service
Secure Your Cluster
Installing the GPL Extras Parcel
Configuring HDFS properties to optimize log collection
Troubleshooting Installation Problems
▶︎
Uninstalling Cloudera Manager and Managed Software
Record User Data Paths
Stop all Services
Deactivate and Remove Parcels
Delete the Cluster
Uninstall the Cloudera Manager Server
Uninstall Cloudera Manager Agent and Managed Software
Remove Cloudera Manager, User Data, and Databases
Uninstalling a Runtime Component From a Single Host
▶︎
Custom Installation Scenarios
Installing a Kafka-centric cluster
▶︎
Quick Start Deployment for a Streams Cluster
Create a Streams Cluster on CDP Private Cloud Base
▶︎
Before You Install
System Requirements for POC Streams Cluster
Disable the Firewall
Enable an NTP Service
▶︎
Installing a Trial Streaming Cluster
Download the Trial version of CDP Private Cloud Base
Run the Cloudera Manager Server Installer
Install Cloudera Runtime
Set Up a Streaming Cluster
▶︎
Getting Started on your Streams Cluster
Create a Kafka Topic to Store your Events
Write a few Events into the Topic
Read the Events
Monitor your Cluster from the SMM UI
After Evaluating Trial Software
▶︎
Installing Operational Database powered by Apache Accumulo
▶︎
Installing Accumulo Parcel 1.0.0
▶︎
Install OpDB
Install OpDB CSD file
Install CDP
▶︎
Install OpDB parcel
Install OpDB parcel using Local Parcel Repository
Install OpDB parcel using Remote Parcel Repository
▶︎
Add Accumulo on CDP service
Add unsecure Accumulo on CDP service to your cluster
Add secure Accumulo on CDP service to your cluster
Creating trace user in unsecure OpDB deployment
Check trace table
Provide user permissions
Verify your OpDB installation
▶︎
Installing Accumulo Parcel 1.1.0
▶︎
Install OpDB
Install OpDB CSD file
Install CDP
▶︎
Install OpDB parcel
Install OpDB parcel using Local Parcel Repository
Install OpDB parcel using Remote Parcel Repository
▶︎
Add Accumulo on CDP service
Add unsecure Accumulo on CDP service to your cluster
Add secure Accumulo on CDP service to your cluster
Verify your OpDB installation
▶︎
Installing Accumulo Parcel 1.10
▶︎
Install Accumulo
Install Accumulo CSD file
Install CDP
▶︎
Install Accumulo 1.10 parcel
Install Accumulo parcel using Local Parcel Repository
Install Accumulo using Remote Parcel Repository
▶︎
Add Accumulo on CDP service
Add unsecure Accumulo on CDP service to your cluster
Add secure Accumulo on CDP service to your cluster
Creating a trace user in unsecure Accumulo deployment
Check trace table
Provide user permissions
Verify your Accumulo installation
Getting Started with CDP Upgrade and Migration
In-Place Upgrade CDH 6 to CDP Private Cloud Base
In-Place Upgrade CDH 5 to CDP Private Cloud Base
In-Place Upgrade HDP3 to CDP Private Cloud Base
In-Place Upgrade HDP2 to CDP Private Cloud Base
In-Place Upgrade CDP Private Cloud Base
▶︎
Managing Clusters
Accessing the Cloudera Manager Admin Console
▶︎
Adding and Deleting Clusters
Adding a Compute Cluster and Data Context
▶︎
Adding a Cluster Using New Hosts
Step 1: Welcome (Add Cluster - Installation)
Step 2: Cluster Basics
Step 3: Setup Auto-TLS
Step 4: Specify Hosts
Step 5: Select Repository
Step 6: Select JDK
Step 7: Enter Login Credentials
Step 8: Install Agents
Step 9: Install Parcels
Step 11: Inspect Cluster
▶︎
Adding a Cluster Using Currently Managed Hosts
Step 1: Welcome (Add Cluster - Installation)
Step 2: Cluster Basics
Step 3: Setup Auto-TLS
Step 4: Specify Hosts
Step 5: Select Repository
Step 6: Install Parcels
Step 8: Inspect Cluster
Deleting a Cluster
▶︎
Tutorial: Using Impala, Hive and Hue with Virtual Private Clusters
Set Up an Environment
Using impala-shell and Hive
View HDFS directory structure of Compute clusters
Insert data in test_table through Spark
Hue in a Virtual Private Cluster Environment
Starting, Stopping, Refreshing, and Restarting a Cluster
▶︎
Pausing a Cluster in AWS
Shutting Down and Starting Up the Cluster
Renaming a Cluster
▶︎
Managing Hosts
Viewing Host Status
Adding a Host to a Cluster
Parcels
Configuring Hosts
Viewing Host Role Assignments
▶︎
Host Templates
Creating a Host Template
Editing a Host Template
Deleting a Host Template
Applying a Host Template to a Host
Hosts Disks Overview
▶︎
Deleting Hosts
Deleting a Host from Cloudera Manager
Removing a Host From a Cluster
Stopping All the Roles on a Host
Starting All the Roles on a Host
Changing Hostnames
Moving a Host Between Clusters
▶︎
Configuring Upgrade Domains
Configuring Upgrade Domains
Changing the Upgrade Domain for hosts
Putting all Hosts in an Upgrade Domain group into Maintenance Mode
Specifying Racks for Hosts
▶︎
Performing Maintenance on a Cluster Host
Decommissioning Hosts
Recommissioning Hosts
▶︎
Tuning and Troubleshooting Host Decommissioning
Tuning HDFS Prior to Decommissioning DataNodes
Tuning HBase Prior to Decommissioning DataNodes
Performance Considerations
Troubleshooting Performance of Decommissioning
Maintenance Mode
Viewing the Maintenance Mode Status of a Cluster
▶︎
Managing Roles
▶︎
Role Instances
Adding a Role Instance
Starting, Stopping, and Restarting Role Instances
Decommissioning Role Instances
Recommissioning Role Instances
Deleting Role Instances
Configuring Roles to Use a Custom Garbage Collection Parameter
▶︎
Role Groups
Creating a Role Group
Managing Role Groups
Default User Roles
Backing up Cloudera Manager databases
▶︎
Managing Cloudera Runtime Services
▶︎
Adding a Service
Prerequisites for installing Atlas
Installing Atlas using Add Service
Installing Ranger using Add Service
Comparing Configurations for a Service Between Clusters
Starting a Cloudera Runtime Service on All Hosts
Stopping a Cloudera Runtime Service on All Hosts
Restarting a Cloudera Runtime Service
Rolling Restart
Aborting a Pending Command
Deleting Services
Renaming a Service
Configuring Maximum File Descriptors
▶︎
Extending Cloudera Manager
Add-on Services
Configuring Services to Use LZO Compression
▶︎
Core Settings Service
Configuration parameters migrated to Core Settings Service
▶︎
Performance Management
▶︎
Optimizing Performance in Cloudera Runtime
Disabling Transparent Hugepages (THP)
▶︎
Setting the vm.swappiness Linux Kernel Parameter
File system partitioning recommendations
Improving Performance in Shuffle Handler and IFile Reader
Tips and Best Practices for Jobs
Decrease Reserve Space
Choosing and Configuring Data Compression
▶︎
Managing Cloudera Manager
Automatic Logout
Starting, Stopping, and Restarting the Cloudera Manager Server
▶︎
Configuring Cloudera Manager
Configuring Cloudera Manager Server Ports
Configuring Network Settings for a Proxy Server
Moving the Cloudera Manager Server to a New Host
▶︎
Migrating Embedded PostgreSQL Database to External PostgreSQL Database
Step 1: Identify Roles that Use the Embedded Database Server
Step 2: Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server
▶︎
Migrating from PostgreSQL Database Server to MySQL/Oracle Database Server
Migrate from the Cloudera Manager External PostgreSQL Database Server to a MySQL/Oracle Database Server
Managing Cloudera Manager Server Logs
▶︎
Cloudera Manager Agents
Starting, Stopping, and Restarting Cloudera Manager Agents
Configuring Cloudera Manager Agents
Managing the Cloudera Manager Agent Logs
▶︎
Overview of Parcels
Advantages of Parcels
Parcel Life Cycle
Parcel Locations
Managing Parcels
Viewing Parcel Usage
Parcel Configuration Settings
▶︎
Managing Licenses
Accessing the License Page
Ending a CDP Private Cloud Base Trial
Upgrading from a CDP Private Cloud Base Trial to CDP Private Cloud Base
Renewing a License
Cloudera Manager User Roles
Other Tasks and Settings
▶︎
Cloudera Management Service
Starting the Cloudera Management Service
Stopping the Cloudera Management Service
Restarting the Cloudera Management Service
Starting and Stopping Cloudera Management Service Roles
Configuring Management Service Database Limits
▶︎
Securing sensitive information using a Secure Credential Storage Provider (Technical Preview)
Configuring a Secure Credential Storage Provider for Cloudera Manager (Technical Preview)
Disabling or changing the Credential Storage Provider (Technical Preview)
▶︎
Resource Management
▶︎
Static Service Pools
Enabling and Configuring Static Service Pools
Disabling Static Service Pools
▶︎
Linux Control Groups (cgroups)
Enabling Resource Management with Control Groups
Configuring Resource Parameters
Configuring Custom Cgroups
▶︎
Data Storage for Monitoring Data
Configuring Service Monitor Data Storage
Configuring Host Monitor Data Storage
Viewing Host and Service Monitor Data Storage
Data Granularity and Time-Series Metric Data
Moving Monitoring Data on an Active Cluster
▶︎
Host Monitor and Service Monitor Memory Configuration
Configuring Memory Allocations
▶︎
Accessing Storage Using Amazon S3
Referencing S3 Credentials for YARN, MapReduce, or Spark Clients
Referencing Amazon S3 in URIs
Using Fast Upload with Amazon S3
Enabling Fast Upload using Cloudera Manager
▶︎
Configuring and Managing S3Guard
Configuring S3Guard for Cluster Access to S3
Editing the S3Guard Configuration
Running the Prune Command Using Cloudera Manager Admin Console
Running the Prune Command Using the Cloudera Manager API
How to Configure a MapReduce Job to Access S3 with an HDFS Credstore
▶︎
Importing Data into Amazon S3 Using Sqoop
▶︎
Authentication
Using a Credential Provider to Secure S3 Credentials
▶︎
Sqoop Import into Amazon S3
Import Data from RDBMS into an S3 Bucket
Import Data into S3 Bucket in Incremental Mode
Import Data into an External Hive Table Backed by S3
S3Guard with Sqoop
▶︎
Accessing Storage Using Microsoft ADLS
Configuring OAuth in Data Hub
Configuring OAuth with core-site.xml
Configuring OAuth with the Hadoop CredentialProvider
Configuring Built-in TLS Acceleration
▶︎
Importing Data into Microsoft Azure Data Lake Store (Gen1 and Gen2) Using Sqoop
Prerequisites
Authentication
Sqoop Import into ADLS
▶︎
Configuring Clusters
Accessing the Cloudera Manager Admin Console
▶︎
Modifying Configuration Properties Using Cloudera Manager
▶︎
Changing the Configuration of a Service or Role Instance
Searching for Properties
Validation of Configuration Properties
Overriding Configuration Properties
Viewing and Editing Overridden Configuration Properties
Resetting Configuration Properties to the Default Value
Viewing and Editing Host Overrides
Restarting Services and Instances after Configuration Changes
▶︎
Suppressing Configuration and Parameter Validation Warnings
Suppressing a Configuration Validation in Cloudera Manager
Managing Suppressed Validations
Suppressing Configuration Validations Before They Trigger Warnings
Viewing a List of All Suppressed Validations
Cluster-Wide Configuration
Custom Configuration
Setting an Advanced Configuration Snippet for a Cloudera Runtime Service
Setting an Advanced Configuration Snippet for a Cluster
Stale Configurations
▶︎
Client Configuration Files
How Client Configurations are Deployed
Downloading Client Configuration Files
Manually Redeploying Client Configuration Files
▶︎
Viewing and Reverting Configuration Changes
Changes for a service, role, or host
Changes for a cluster
Autoconfiguration
▶︎
Using the Cloudera Manager API
▶︎
Using the Cloudera Manager API to backup and restore clusters
Backing up the Cloudera Manager configuration
Restoring the Cloudera Manager configuration
▶︎
Using the Cloudera Manager API to Manage and Configure Clusters
Using the Cloudera Manager API for Cluster Automation
Using the Cloudera Manager API to Obtain Configuration Files
Using the Cloudera Manager API to Set Advanced Configuration Snippets (Safety Valves)
Using Tags in Cloudera Manager
Initiating HDFS failover using the Cloudera Manager API
▶︎
Creating a Runtime Cluster Using a Cloudera Manager Template
Exporting the Cluster Configuration
Preparing a New Cluster
Creating the Template
Importing the Template to a New Cluster
Sample Python Code
Disabling Redaction of sensitive information when using the Cloudera Manager API
▶︎
Monitoring and Diagnostics
Accessing the Cloudera Manager Admin Console
Monitoring and Diagnostics
▶︎
Time Line
Selecting a Point In Time or a Time Range
▶︎
Health Tests
Viewing Health Test Results
Suppressing Health Test Results
Suppressing a Health Test
Configuring Suppression of Health Tests Before Tests Run
Viewing a List of Suppressed Health Tests
Unsuppressing Health Tests
▶︎
Viewing Charts for Cluster, Service, Role, and Host Instances
Exporting Data from Charts
Adding and Removing Charts from a Dashboard
Creating Triggers from Charts
▶︎
Configuring Monitoring Settings
Configuring Health Monitoring
Configuring Service Monitoring
Configuring Host Monitoring
Configuring Directory Monitoring
Configuring YARN Application Monitoring
Configuring Impala Query Monitoring
Configuring Impala Query Data Store Maximum Size
Enabling Configuration Change Alerts
Filtering Metrics
▶︎
Configuring Log Events
Configuring Logs
Configuring Logging Thresholds
Configuring Log Directories
Enabling and Disabling Log Event Capture
Configuring Which Log Messages Become Events
Configuring Log Alerts
Monitoring Clusters
▶︎
Cluster Utilization Report overview
Enable the Cluster Utilization Report
Configure the Cluster Utilization Report
▶︎
Use the Cluster Utilization Report to manage resources
Overview Tab
YARN Tab
Impala Tab
Download the Cluster Utilization Report
▶︎
Creating a Custom Cluster Utilization Report
Metrics and queries
Impala query counter metrics
Calculations for reports
Retrieving metric data
Querying metric data
Inspecting Network Performance
▶︎
Monitoring Services
▶︎
Monitoring Service Status
Viewing the URLs of the Client Configuration Files
Viewing the Status of a Service Instance
Viewing the Health and Status of a Role Instance
Viewing the Maintenance Mode Status of a Cluster
▶︎
Viewing Service Status
Viewing Past Status
Status Summary
Service Summary
Health Tests and Health History
▶︎
Viewing Service Instance Details
Role Instance Reference
▶︎
Viewing Role Instance Status
The Actions Menu
Viewing Past Status
Summary
Health Tests and Health History
Status Summary
Charts
The Processes Tab
Running Diagnostic Commands for Roles
▶︎
Periodic Stacks Collection
Configuring Periodic Stacks Collection
Viewing and Downloading Stacks Logs
▶︎
Viewing Running and Recent Commands
Viewing Running and Recent Commands For a Cluster
Viewing Running and Recent Commands for a Service or Role
Command Details
▶︎
Monitoring Hosts
Viewing All Hosts
Role Assignments
Viewing the Disks Overview
Viewing the Hosts in a Cluster
Viewing Individual Hosts
▶︎
Host Details
Viewing Host Details
Status
Processes
Resources
Commands
Configuration
Components
Audits
Charts Library
▶︎
Host Inspector
Running the Host Inspector
Viewing Past Host Inspector Results
▶︎
Monitoring Activities
Selecting Columns to Show in the Activities List
Sorting the Activities List
Filtering the Activities List
Activity Charts
Viewing the Jobs in a Pig, Oozie, or Hive Activity
▶︎
Task Attempts
Viewing a Job's Task Attempts
Selecting Columns to Show in the Tasks List
Sorting the Tasks List
Filtering the Tasks List
Viewing Activity Details in a Report Format
Comparing Similar Activities
▶︎
Viewing the Distribution of Task Attempts
The Task Distribution Chart
TaskTracker Hosts
▶︎
Monitoring Impala Queries
Viewing Queries
Configuring Impala Query Monitoring
Impala Best Practices
Results Tab
Filtering Queries
Filter Expressions
Filter Attributes
Choosing and Running a Filter
Query Details
▶︎
Monitoring YARN Applications
Viewing Jobs
Configuring YARN Application Monitoring
Results Tab
Filtering Jobs
Filter Expressions
Choosing and Running a Filter
Filter Attributes
Sending Diagnostic Data to Cloudera for YARN Applications
▶︎
Monitoring Spark Applications
Viewing and Debugging Spark Applications Using Logs
Managing Spark Driver Logs
Visualizing Spark Applications Using the Web Application UI
Accessing the Web UI of a Running Spark Application
Accessing the Web UI of a Completed Spark Application
▶︎
Events
Viewing Events
▶︎
Filtering Events
Adding an Event Filter
Removing an Event Filter
▶︎
Alerts
Managing Alerts
Configuring Alert Email Delivery
Configuring Alert SNMP Delivery
▶︎
Configuring Custom Alert Scripts
Sample Custom Alert Script
Enabling Configuration Change Alerts
Enabling HBase Alerts
Enabling Health Alerts
Modifying the Health Threshold
Configuring Alerts Transitioning Out of Alerting Health Threshold
Configuring Log Alerts
Configuring Alert Delivery
▶︎
Triggers
Creating a Trigger Using the Expression Editor
Editing, Deleting, Suppressing, or Deleting a Trigger
▶︎
Cloudera Manager Trigger Use Cases
Creating a Trigger for Memory Capacity
Creating a Trigger for CPU Capacity
▶︎
Lifecycle and Security Auditing
Viewing Audit Events
▶︎
Filtering Audit Events
Adding a Filter
Removing a Filter
Downloading Audit Events
▶︎
Charting Time-Series Data
Terminology
Building a Chart with Time-Series Data
Configuring Time-Series Query Results
Using Context-Sensitive Variables in Charts
▶︎
Chart Properties
Changing the Chart Type
Grouping (Faceting) Time Series
Displaying Chart Details
Editing a Chart
Saving a Chart
Obtaining Time-Series Data Using the API
▶︎
Dashboards
Dashboard Types
Creating a Dashboard
Managing Dashboards
Configuring Dashboards
Saving Charts to Dashboards
Saving Charts to a New Dashboard
Saving Charts to an Existing Dashboard
Adding a New Chart to the Custom Dashboard
Removing a Chart from a Custom Dashboard
Moving and Resizing Charts
▶︎
tsquery Language
tsquery Syntax
Metric Expressions
Metric Expression Functions
Predicates
Discovering possible predicates
Filtering by Day of Week or Hour of Day
Time Series Attributes
Time Series Entities and their Attributes
FAQ
▶︎
Metric Aggregation
Presentation of Aggregate Data
Accessing Aggregate Statistics Through tsquery
Filtering Metrics
▶︎
Logs
Viewing Logs
Logs List
Filtering Logs
Log Details
▶︎
Viewing the Cloudera Manager Server Log
Viewing Cloudera Manager Server Logs in the Logs Page
Viewing the Cloudera Manager Server Log
▶︎
Viewing the Cloudera Manager Agent Logs
Viewing Cloudera Manager Agent Logs in the Logs Page
Viewing the Cloudera Manager Agent Log
Managing Disk Space for Log Files
▶︎
Reports
▶︎
Directory Usage Report
Accessing the Directory Usage Report
▶︎
Using the Directory Usage Report
Filters
Disk Usage Reports
▶︎
Disk Usage Reports
Viewing Current Disk Usage by User, Group, or Directory
Viewing Historical Disk Usage by User, Group, or Directory
Downloading Reports as CSV and XLS Files
Activity, Application, and Query Reports
▶︎
The File Browser
Searching Within the File System
Enabling Snapshots
Setting Quotas
Designating Directories to Include in Disk Usage Reports
Downloading HDFS Directory Access Permission Reports
Cluster Support Tokens using Cloudera Manager
▶︎
Sending Usage and Diagnostic Data to Cloudera
Configuring a Proxy Server
Managing Anonymous Usage Data Collection
Diagnostic Data Collection
Log support in Cloudera Manager for ECS cluster
Configuring the Frequency of Diagnostic Data Collection
Configuring collection of Cloudera Manager table data
Specifying the Diagnostic Data Directory
Redaction of Sensitive Information from Diagnostic Bundles
Disabling the Automatic Sending of Diagnostic Data from a Manually Triggered Collection
Manually Triggering Collection and Transfer of Diagnostic Data to Cloudera
▶︎
Troubleshooting Cluster Configuration and Operation
Solutions to Common Problems
Logs and Events
▶︎
Replication Manager
Replication Manager in CDP Private Cloud Base
Support matrix for Replication Manager on CDP Private Cloud Base
Port and network requirements for Replication Manager on CDP Private Cloud Base
▶︎
Prepare to replicate using replication policies
Cloudera license requirements for Replication Manager
Configuring SSL/TLS certificate exchange between two Cloudera Manager instances
Add source cluster as peer to use in replication policies
▶︎
Enabling replication between clusters with Kerberos authentication
Required ports in Kerberos authentication-enabled clusters for replication
Considerations for realm names to use for replication
Prepare Kerberos authentication-enabled clusters for replication
Kerberos connectivity test
Replicating from unsecure to secure clusters
▶︎
Replication of encrypted data
Encrypting data in transit between clusters
Security considerations for encrypted data during replication
Configuring heap size to replicate large directories using replication policies
Retaining logs for Replication Manager
▶︎
HDFS replication policies
▶︎
HDFS replication policy considerations
Guidelines to add or delete source data during replication job run
Improve network latency during replication job run
Performance and scalability limitations to consider for replication policies
Guidelines to use snapshot diff-based replication
HDFS replication in Sentry-enabled clusters
Specifying hosts to improve HDFS replication policy performance
Creating HDFS replication policy to replicate HDFS data
View HDFS replication policy details
View historical details for an HDFS replication policy
Monitoring the performance of HDFS replication policies
▶︎
Hive external table replication policies
▶︎
Hive replication policy considerations
Specifying hosts to improve Hive replication policy performance
Hive tables and DDL commands
Disabling replication of parameters during Hive replication
Accommodate HMS changes for Hive replication policies
Creating a Hive external table replication policy
Sentry to Ranger replication for Hive external tables
Importing Sentry privileges into Ranger policies
Replicating data to Impala clusters
Replication of Impala and Hive User Defined Functions (UDFs)
Monitoring the performance of Hive/Impala replication policies
Managing replication policies
Troubleshooting replication policies between on-premises clusters
▶︎
Snapshots
▶︎
Using snapshots with replication
Snapshot policies in Replication Manager
Creating and managing snapshot policies
Snapshots history
Hive/Impala replication using snapshots
Orphaned snapshots
▶︎
Managing HDFS snapshots in Cloudera Manager
Browse HDFS directories
Enabling and disabling HDFS snapshots
Taking and deleting HDFS snapshots
Restoring HDFS snapshots
▶︎
Use DistCp to migrate HDFS data from HDP to CDP
▶︎
Using DistCp to migrate data from secure HDP to unsecure CDP
Step 1: Enabling hdfs user to run YARN jobs
Step 2: Configuration changes on the CDP cluster
Step 3: Running the DistCp job on the HDP cluster
▶︎
Using DistCp to migrate data from secure HDP to secure CDP using DistCp
Step 1: Configuration changes on HDP and CDP clusters
Step 2: Configuring user to run YARN jobs on both the clusters
Step 3: Running DistCp job on CDP cluster
▶︎
How to: Next-Gen Storage
▶︎
Storing Data Using Ozone
▶︎
Managing storage elements by using the command-line interface
▶︎
Commands for managing volumes
Assigning administrator privileges to users
Commands for managing buckets
Commands for managing keys
▶︎
Using Ozone S3 Gateway to work with storage elements
Configuration to expose buckets under non-default volumes
REST endpoints supported on Ozone S3 Gateway
Configuring Ozone to work as a pure object store
▶︎
Access Ozone S3 Gateway using the S3A filesystem
Examples of using the S3A filesystem with Ozone S3 Gateway
Configuring Spark access for S3A
Configuring Hive access for S3A
Configuring Impala access for S3A
▶︎
Using the AWS CLI with Ozone S3 Gateway
Configuring https endpoints in Ozone S3 Gateway to work with AWS CLI
Examples of using the AWS CLI for Ozone S3 Gateway
▶︎
Accessing Ozone object store with Amazon Boto3 client
Obtaining resources to Ozone
Obtaining client to Ozone through session
▶︎
List of APIs verified
Create a bucket
List buckets
Head a bucket
Delete a bucket
Upload a file
Download a file
Head an object
Delete Objects
Multipart upload
▶︎
Working with Ozone File System (o3fs)
Setting up o3fs
▶︎
Working with ofs
Volume and bucket management using ofs
Key management using ofs
▶︎
Ozone configuration options to work with CDP components
Configuration options for Spark to work with o3fs
Configuration options to store Hive managed tables on Ozone
▶︎
Overview of the Ozone Manager in High Availability
Considerations for configuring High Availability on the Ozone Manager
▶︎
Ozone Manager nodes in High Availability
Read and write requests with Ozone Manager in High Availability
▶︎
Overview of Storage Container Manager in High Availability
Considerations for configuring High Availability on Storage Container Manager
Storage Container Manager operations in High Availability
Offloading Application Logs to Ozone
▶︎
Removing Ozone DataNodes from the cluster
Decommissioning Ozone DataNodes
Placing Ozone DataNodes in offline mode
Configuring the number of storage container copies for a DataNode
Recommissioning an Ozone DataNode
Multi-Raft configuration for efficient write performances
▶︎
Working with the Recon web user interface
Access the Recon web user interface
▶︎
Elements of the Recon web user interface
Overview page
DataNodes page
Pipelines page
Missing Containers page
Configuring Ozone to work with Prometheus
Ozone trash overview
Configuring the Ozone trash checkpoint values
▶︎
Configuring Ozone Security
Using Ranger with Ozone
▶︎
Kerberos configuration for Ozone
Security tokens in Ozone
Kerberos principal and keytab properties for Ozone service daemons
Securing DataNodes
Configure S3 credentials for working with Ozone
Configuring custom Kerberos principal for Ozone
Configuring Transparent Data Encryption for Ozone
Configuring TLS/SSL encryption manually for Ozone
Configuration for enabling mTLS in Ozone
▶︎
Configuring security for Storage Container Managers in High Availability
Considerations for enabling SCM HA security
▶︎
Configuring Ozone
Performance tuning for Ozone
▶︎
How to: Storage
▶︎
Managing Data Storage
▶︎
Optimizing data storage
▶︎
Balancing data across disks of a DataNode
▶︎
Plan the data movement across disks
Parameters to configure the Disk Balancer
Run the Disk Balancer plan
Disk Balancer commands
▶︎
Erasure coding overview
Understanding erasure coding policies
Comparing replication and erasure coding
Best practices for rack and node setup for EC
Prerequisites for enabling erasure coding
Limitations of erasure coding
Using erasure coding for existing data
Using erasure coding for new data
Advanced erasure coding configuration
Erasure coding CLI command
Erasure coding examples
▶︎
Increasing storage capacity with HDFS compression
Enable GZipCodec as the default compression codec
Use GZipCodec with a one-time job
▶︎
Set HDFS quotas
Setting HDFS quotas in Cloudera Manager
▶︎
Configuring heterogeneous storage in HDFS
HDFS storage types
HDFS storage policies
Commands for configuring storage policies
Set up a storage policy for HDFS
Set up SSD storage using Cloudera Manager
Configure archival storage
The HDFS mover command
▶︎
Balancing data across an HDFS cluster
Why HDFS data becomes unbalanced
▶︎
Configurations and CLI options for the HDFS Balancer
Properties for configuring the Balancer
Balancer commands
Recommended configurations for the Balancer
▶︎
Configuring and running the HDFS balancer using Cloudera Manager
Configuring the balancer threshold
Configuring concurrent moves
Recommended configurations for the balancer
Running the balancer
Configuring block size
▶︎
Cluster balancing algorithm
Storage group classification
Storage group pairing
Block move scheduling
Block move execution
Exit statuses for the HDFS Balancer
HDFS
▶︎
Optimizing performance
▶︎
Improving performance with centralized cache management
Benefits of centralized cache management in HDFS
Use cases for centralized cache management
Centralized cache management architecture
Caching terminology
Properties for configuring centralized caching
Commands for using cache pools and directives
▶︎
Specifying racks for hosts
Viewing racks assigned to cluster hosts
Editing rack assignments for hosts
▶︎
Customizing HDFS
Customize the HDFS home directory
Properties to set the size of the NameNode edits directory
▶︎
Optimizing NameNode disk space with Hadoop archives
Overview of Hadoop archives
Hadoop archive components
Create a Hadoop archive
List files in Hadoop archives
Format for using Hadoop archives with MapReduce
▶︎
Detecting slow DataNodes
Enable disk IO statistics
Enable detection of slow DataNodes
▶︎
Allocating DataNode memory as storage
HDFS storage types
LAZY_PERSIST memory storage policy
Configure DataNode memory as storage
▶︎
Improving performance with short-circuit local reads
Prerequisites for configuring short-ciruit local reads
Properties for configuring short-circuit local reads on HDFS
▶︎
Configure mountable HDFS
Add HDFS system mount
Optimize mountable HDFS
Configuring Proxy Users to Access HDFS
▶︎
Using DistCp to copy files
Using DistCp
Distcp syntax and examples
Using DistCp with Highly Available remote clusters
▶︎
Using DistCp with Amazon S3
Using a credential provider to secure S3 credentials
Examples of DistCp commands using the S3 protocol and hidden credentials
Kerberos setup guidelines for Distcp between secure clusters
▶︎
Distcp between secure clusters in different Kerberos realms
Configure source and destination realms in krb5.conf
Configure HDFS RPC protection
Specify truststore properties
Set HADOOP_CONF to the destination cluster
Launch distcp
Copying data between a secure and an insecure cluster using DistCp and WebHDFS
Post-migration verification
Using DistCp between HA clusters using Cloudera Manager
▶︎
Using the NFS Gateway for accessing HDFS
Install the NFS Gateway
Configure the NFS Gateway
▶︎
Start and stop the NFS Gateway services
Start the NFS Gateway services
Stop the NFS Gateway services
Verify validity of the NFS services
▶︎
Access HDFS from the NFS Gateway
How NFS Gateway authenticates and maps users
▶︎
APIs for accessing HDFS
Set up WebHDFS on a secure cluster
▶︎
Using HttpFS to provide access to HDFS
Add the HttpFS role
Using Load Balancer with HttpFS
▶︎
HttpFS authentication
Use curl to access a URL protected by Kerberos HTTP SPNEGO
▶︎
Data storage metrics
Using JMX for accessing HDFS metrics
HDFS Metrics
▶︎
Using HdfsFindTool to find files
Downloading Hdfsfindtool from the CDH archives
▶︎
Configuring Data Protection
▶︎
Data protection
▶︎
Backing up HDFS metadata
▶︎
Introduction to HDFS metadata files and directories
▶︎
Files and directories
NameNodes
JournalNodes
DataNodes
▶︎
HDFS commands for metadata files and directories
Configuration properties
▶︎
Back up HDFS metadata
Prepare to back up the HDFS metadata
Backing up NameNode metadata
Back up HDFS metadata using Cloudera Manager
Restoring NameNode metadata
Restore HDFS metadata from a backup using Cloudera Manager
Perform a backup of the HDFS metadata
▶︎
Configuring HDFS trash
Trash behavior with HDFS Transparent Encryption enabled
Enabling and disabling trash
Setting the trash interval
▶︎
Using HDFS snapshots for data protection
Considerations for working with HDFS snapshots
Enable snapshot creation on a directory
Create snapshots on a directory
Recover data from a snapshot
Options to determine differences between contents of snapshots
CLI commands to perform snapshot operations
▶︎
Managing snapshot policies using Cloudera Manager
Create a snapshot policy
Edit or delete a snapshot policy
Enable and disable snapshot creation using Cloudera Manager
Create snapshots using Cloudera Manager
Delete snapshots using Cloudera Manager
Preventing inadvertent deletion of directories
▶︎
Accessing Cloud Data
Cloud storage connectors overview
The Cloud Storage Connectors
▶︎
Working with Amazon S3
Limitations of Amazon S3
▶︎
Configuring Access to S3
Configuring Access to S3 on CDP Public Cloud
▶︎
Configuring Access to S3 on Cloudera Private Cloud Base
Using Configuration Properties to Authenticate
Using Per-Bucket Credentials to Authenticate
Using Environment Variables to Authenticate
Using EC2 Instance Metadata to Authenticate
Referencing S3 Data in Applications
▶︎
Configuring Per-Bucket Settings
Customizing Per-Bucket Secrets Held in Credential Files
Configuring Per-Bucket Settings to Access Data Around the World
▶︎
Encrypting Data on S3
▶︎
SSE-S3: Amazon S3-Managed Encryption Keys
Enabling SSE-S3
▶︎
SSE-KMS: Amazon S3-KMS Managed Encryption Keys
Enabling SSE-KMS
IAM Role permissions for working with SSE-KMS
▶︎
SSE-C: Server-Side Encryption with Customer-Provided Encryption Keys
Enabling SSE-C
Configuring Encryption for Specific Buckets
Encrypting an S3 Bucket with Amazon S3 Default Encryption
Performance Impact of Encryption
▶︎
Safely Writing to S3 Through the S3A Committers
Introducing the S3A Committers
Configuring Directories for Intermediate Data
Using the Directory Committer in MapReduce
Verifying That an S3A Committer Was Used
Cleaning up after failed jobs
Using the S3Guard Command to List and Delete Uploads
▶︎
Advanced Committer Configuration
Enabling Speculative Execution
Using Unique Filenames to Avoid File Update Inconsistency
Speeding up Job Commits by Increasing the Number of Threads
Securing the S3A Committers
The S3A Committers and Third-Party Object Stores
Limitations of the S3A Committers
Troubleshooting the S3A Committers
Security Model and Operations on S3
S3A and Checksums (Advanced Feature)
A List of S3A Configuration Properties
Working with versioned S3 buckets
Working with Third-party S3-compatible Object Stores
▶︎
Improving Performance for S3A
Working with S3 buckets in the same AWS region
▶︎
Configuring and tuning S3A block upload
Tuning S3A Uploads
Thread Tuning for S3A Data Upload
Optimizing S3A read performance for different file types
S3 Performance Checklist
Troubleshooting S3
▶︎
Working with Google Cloud Storage
▶︎
Configuring Access to Google Cloud Storage
Create a GCP Service Account
Create a Custom Role
Modify GCS Bucket Permissions
Configure Access to GCS from Your Cluster
Additional Configuration Options for GCS
▶︎
Working with the ABFS Connector
▶︎
Introduction to Azure Storage and the ABFS Connector
Feature Comparisons
Setting up and configuring the ABFS connector
▶︎
Configuring the ABFS Connector
▶︎
Authenticating with ADLS Gen2
Configuring Access to Azure on CDP Public Cloud
Configuring Access to Azure on Cloudera Private Cloud Base
ADLS Proxy Setup
▶︎
Performance and Scalability
Hierarchical namespaces vs. non-namespaces
Flush options
▶︎
Using ABFS using CLI
Hadoop File System commands
Create a table in Hive
Accessing Azure Storage account container from spark-shell
Copying data with Hadoop DistCp
DistCp and Proxy Settings
ADLS Trash Folder Behavior
Troubleshooting ABFS
▶︎
Configuring HDFS ACLs
HDFS ACLs
Configuring ACLs on HDFS
Using CLI commands to create and list ACLs
ACL examples
ACLS on HDFS features
Use cases for ACLs on HDFS
▶︎
Enable authorization for HDFS web UIs
Enable authorization for additional HDFS web UIs
Configuring HSTS for HDFS Web UIs
▶︎
Configuring Fault Tolerance
▶︎
High Availability on HDFS clusters
▶︎
Configuring HDFS High Availability
NameNode architecture
Preparing the hardware resources for HDFS High Availability
▶︎
Using Cloudera Manager to manage HDFS HA
Enabling HDFS HA
Prerequisites for enabling HDFS HA using Cloudera Manager
Enabling High Availability and automatic failover
Disabling and redeploying HDFS HA
▶︎
Configuring other CDP components to use HDFS HA
Configuring HBase to use HDFS HA
Configuring the Hive Metastore to use HDFS HA
Configuring Impala to work with HDFS HA
Configuring oozie to use HDFS HA
Changing a nameservice name for Highly Available HDFS using Cloudera Manager
Manually failing over to the standby NameNode
Additional HDFS haadmin commands to administer the cluster
Turning safe mode on HA NameNodes
Converting from an NFS-mounted shared edits directory to Quorum-Based Storage
Administrative commands
▶︎
Configuring Apache Kudu
▶︎
Configure Kudu processes
Experimental flags
Configuring the Kudu master
Configuring tablet servers
Rack awareness (Location awareness)
▶︎
Directory configurations
Changing directory configuration
▶︎
Managing Apache Kudu
▶︎
Limitations
Server management limitations
Cluster management limitations
Start and stop Kudu processes
▶︎
Orchestrate a rolling restart with no downtime
Minimize cluster distruption during planned downtime
▶︎
Kudu web interfaces
Kudu master web interface
Kudu tablet server web interface
Common web interface pages
Best practices when adding new tablet servers
Decommission or remove a tablet server
Use cluster names in the kudu command line tool
Migrate data on the same host
▶︎
Migrate to multiple Kudu masters
Prepare for the migration
Perform the migration
▶︎
Change master hostnames
Prepare for master hostname changes
Perform master hostname changes
▶︎
Remove Kudu masters
Prepare for removal
Perform the removal
▶︎
Run the tablet rebalancing tool
Run a tablet rebalancing tool on a rack-aware cluster
Run a tablet rebalancing tool in Cloudera Manager
Run a tablet rebalancing tool in command line
▶︎
Managing Apache Kudu Security
Kudu security considerations
Kudu security limitations
▶︎
Kudu authentication
Kudu authentication with Kerberos
Kudu authentication tokens
Client authentication to secure Kudu clusters
Kudu coarse-grained authorization
▶︎
Kudu fine-grained authorization
Kudu and Apache Ranger integration
Kudu authorization tokens
Specifying trusted users
Kudu authorization policies
Ranger policies for Kudu
Disabling redaction
▶︎
Configuring a secure Kudu cluster using Cloudera Manager
Enabling Kerberos authentication and RPC encryption
Configuring custom Kerberos principal for Kudu
Configuring coarse-grained authorization with ACLs
Configuring TLS/SSL encryption for Kudu using Cloudera Manager
Enabling Ranger authorization
Configuring HTTPS encryption
▶︎
Backing up and Recovering Apache Kudu
▶︎
Kudu backup
Back up tables
Backup tools
Generate a table list
Backup directory structure
Physical backups of an entire node
▶︎
Kudu recovery
Restore tables from backups
Recover from disk failure
Recover from full disks
Bring a tablet that has lost a majority of replicas back online
Rebuild a Kudu filesystem layout
▶︎
Developing Applications with Apache Kudu
View the API documentation
Kudu example applications
Kudu Python client
▶︎
Kudu integration with Spark
Spark integration known issues and limitations
Spark integration best practices
Upsert option in Kudu Spark
Use Spark with a secure Kudu cluster
Spark tuning
▶︎
Using Hive Metastore with Apache Kudu
Integrating the Hive Metastore with Apache Kudu
Databases and Table Names
Administrative tools for Hive Metastore integration
Upgrading existing Kudu tables for Hive Metastore integration
Enabling the Hive Metastore integration
▶︎
Using Apache Impala with Apache Kudu
▶︎
Understanding Impala integration with Kudu
Impala database containment model
Internal and external Impala tables
Verifying the Impala dependency on Kudu
Impala integration limitations
▶︎
Using Impala to query Kudu tables
Query an existing Kudu table from Impala
Create a new Kudu table from Impala
Use CREATE TABLE AS SELECT
▶︎
Partitioning tables
Basic partitioning
Advanced partitioning
Non-covering range partitions
Partitioning guidelines
Optimize performance for evaluating SQL predicates
Insert data
INSERT and primary key uniqueness violations
Update data
Upsert a row
Alter a table
Delete data
Failures during INSERT, UPDATE, UPSERT, and DELETE operations
Drop a Kudu table
▶︎
Monitoring Apache Kudu
▶︎
Kudu metrics
Listing available metrics
Collecting metrics through HTTP
Diagnostics logging
Monitor cluster health with ksck
Report craches using breakpad
Enable core dump
Use the Charts Library
▶︎
How to: Compute
▶︎
Using YARN Web UI and CLI
Access the YARN Web User Interface
View Cluster Overview
View Nodes and Node Details
View Queues and Queue Details
▶︎
View All Applications
Search applications
View application details
UI Tools
Use the YARN CLI to View Logs for Applications
▶︎
Configuring Apache Hadoop YARN Security
Linux Container Executor
▶︎
Managing Access Control Lists
YARN ACL rules
YARN ACL syntax
▶︎
YARN ACL types
Admin ACLs
Queue ACLs
▶︎
Application ACLs
Application ACL evaluation
MapReduce Job ACLs
Spark Job ACLs
Application logs' ACLs
▶︎
Configure TLS/SSL for Core Hadoop Services
Configure TLS/SSL for HDFS
Configure TLS/SSL for YARN
Enable HTTPS communication
Configure Cross-Origin Support for YARN UIs and REST APIs
Configure YARN Security for Long-Running Applications
Enabling custom Kerberos principal support in YARN
Enabling custom Kerberos principal support in a Queue Manager cluster
▶︎
Configuring Apache Hadoop YARN High Availability
▶︎
YARN ResourceManager High Availability
YARN ResourceManager high availability architecture
Configure YARN ResourceManager high availability
Use the yarn rmadmin tool to administer ResourceManager high availability
Migrate ResourceManager to another host
▶︎
Work Preserving Recovery for YARN components
Configure work preserving recovery on ResourceManager
Configure work preserving recovery on NodeManager
Example: Configuration for work preserving recovery
▶︎
Managing and Allocating Cluster Resources using Capacity Scheduler
▶︎
Resource Scheduling and Management
YARN resource allocation of multiple resource-types
Hierarchical queue characteristics
Scheduling among queues
Application reservations
Resource distribution workflow
Resource allocation overview
▶︎
Use CPU scheduling
Configure CPU scheduling and isolation
Use CPU scheduling with distributed shell
▶︎
Use GPU scheduling
Configure GPU scheduling and isolation
Use GPU scheduling with distributed shell
▶︎
Use FPGA scheduling
Configure FPGA scheduling and isolation
Use FPGA with distributed shell
▶︎
Limit CPU usage with Cgroups
Use Cgroups
Enable Cgroups
▶︎
Manage Queues
Prerequisite
Add queues using YARN Queue Manager UI
Configure cluster capacity with queues
Configuring the resource capacity of root queue
Change resource allocation mode
Start and stop queues
Delete queues
▶︎
Configure Scheduler Properties at the Global Level
Setting global maximum application priority
Configure preemption
Enabling Intra-Queue preemption
Enabling LazyPreemption
Set global application limits
Set default Application Master resource limit
Enable asynchronous scheduler
Configuring queue mapping to use the user name from the application tag using Cloudera Manager
Configure NodeManager heartbeat
Configure data locality
▶︎
Configure Per Queue Properties
Set user limits within a queue
Set Maximum Application limit for a specific queue
Set Application-Master resource-limit for a specific queue
Control access to queues using ACLs
Enable preemption for a specific queue
Enable Intra-Queue Preemption for a specific queue
Configure dynamic queue properties
▶︎
Set Ordering policies within a specific queue
Configure queue ordering policies
▶︎
Dynamic Queue Scheduling [Technical Preview]
Enabling the Dynamic Queue Scheduling feature
Creating a new Dynamic Configuration
Managing Dynamic Configurations
How to read the Schedule table
▶︎
Manage placement rules
Placement rule policies
How to read the Placement Rules table
▶︎
Create placement rules
Example - Placement rules creation
Reorder placement rules
Delete placement rules
Enable override of default queue mappings
▶︎
Manage dynamic queues
Managed Parent Queues
Converting a queue to a Managed Parent Queue
Enabling dynamic child creation in weight mode
Managing dynamic child creation enabled parent queues
Managing dynamically created child queues
Disabling auto queue deletion
Deleting dynamically created child queues
▶︎
Configure Partitions
Enable node label on a cluster to configure partition
Create partitions
Assign or unassign a node to a partition
View partitions
Associate partitions with queues
Disassociate partitions from queues
Deleting partitions
Use partitions when submitting a job
Provide Read-only access to Queue Manager UI
▶︎
Managing Apache Hadoop YARN Services
Configure YARN Services API to Manage Long-running Applications
Configure YARN Services using Cloudera Manager
Migrating database configuration to a new location
▶︎
Running YARN Services
Deploy and manage services on YARN
Launch a YARN service
Save a YARN service definition
▶︎
Create new YARN services using UI
Create a standard YARN service
Create a custom YARN service
Manage the YARN service life cycle through the REST API
YARN services API examples
▶︎
Managing YARN Docker Containers
▶︎
Configuring YARN Docker Containers Support
Prerequisites for installing Docker
Recommendations for managing Docker containers on YARN
Install Docker
Configure Docker
Configure YARN for managing Docker containers
Docker on YARN configuration properties
▶︎
Running Dockerized Applications on YARN
Docker on YARN example: MapReduce job
Docker on YARN example: DistributedShell
Docker on YARN example: Spark-on-Docker-on-YARN
▶︎
Configuring Apache Hadoop YARN Log Aggregation
YARN Log Aggregation Overview
Log Aggregation File Controllers
Configure Log Aggregation
Log Aggregation Properties
Configure Debug Delay
▶︎
Managing Apache ZooKeeper
Add a ZooKeeper service
Use multiple ZooKeeper services
Replace a ZooKeeper disk
Replace a ZooKeeper role with ZooKeeper service downtime
Replace a ZooKeeper role without ZooKeeper service downtime
Replace a ZooKeeper role on an unmanaged cluster
Confirm the election status of a ZooKeeper service
▶︎
Configuring Apache ZooKeeper
Enable the AdminServer
Configure four-letter-word commands in ZooKeeper
▶︎
Managing Apache ZooKeeper Security
▶︎
ZooKeeper Authentication
Configure ZooKeeper server for Kerberos authentication
Configure ZooKeeper client shell for Kerberos authentication
Verify the ZooKeeper authentication
Enable server-server mutual authentication
Use Digest Authentication Provider
Configure ZooKeeper TLS/SSL using Cloudera Manager
▶︎
ZooKeeper ACLs Best Practices
ZooKeeper ACLs Best Practices: Atlas
ZooKeeper ACLs Best Practices: Cruise Control
ZooKeeper ACLs Best Practices: HBase
ZooKeeper ACLs Best Practices: HDFS
ZooKeeper ACLs Best Practices: Kafka
ZooKeeper ACLs Best Practices: Oozie
ZooKeeper ACLs Best Practices: Ranger
ZooKeeper ACLs best practices: Search
ZooKeeper ACLs Best Practices: YARN
ZooKeeper ACLs Best Practices: ZooKeeper
▶︎
How to: Data Access
▶︎
Using Data Analytics Studio
Compose queries
▶︎
Manage queries
Searching queries
Refining query search using filters
Saving the search results
Compare queries
▶︎
View query details
Viewing the query recommendations
Viewing the query details
Viewing the visual explain for a query
Viewing the Hive configurations for a query
Viewing the query timeline
Viewing the task-level DAG information
Viewing the DAG flow
Viewing the DAG counters
Viewing the Tez configurations for a query
▶︎
Manage databases and tables
Using the Database Explorer
Searching tables
Managing tables
Creating tables
Uploading tables
Editing tables
Deleting tables
Managing columns
Managing partitions
Viewing storage information
Viewing detailed information
Viewing table and column statistics
Previewing tables using Data Preview
▶︎
Manage reports
Viewing the Read and Write report
Viewing the Join report
▶︎
DAS administration using Cloudera Manager in CDP
Running a query on a different Hive instance
Modifying the session cookie timeout value
▶︎
Configuring user authentication
Configuring user authentication using SPNEGO
Configuring user authentication using LDAP
Configuring TLS/SSL encryption manually for DAS using Cloudera Manager
Cleaning up old queries, DAG information, and reports data
Disabling the reporting feature
▶︎
DAS administration using Ambari in CDP
Running a query on a different Hive instance
Cleaning up old queries, DAG information, and reports data using Ambari
Creating system tables to run query on Hive and Tez DAG events
Changing the retention period of DAS event logs
▶︎
Working with Apache Hive Metastore
HMS table storage
Configuring HMS for high availability
HWC authorization
Authorizing external tables
Configure HMS properties for authorization
Filter HMS results
▶︎
Setting up the metastore database
▶︎
Setting up the backend Hive metastore database
Set up MariaDB or MySQL database
Set up a PostgreSQL database
Set up an Oracle database
Configuring metastore database properties
Configuring metastore location
Setting up a JDBC URL connection override
Tuning the metastore
Creating a view from Spark
▶︎
Starting Apache Hive
Starting Hive on an insecure cluster
Starting Hive using a password
Running a Hive command
Converting Hive CLI scripts to Beeline
▶︎
Using Apache Hive
▶︎
Apache Hive 3 tables
Locating Hive tables and changing the location
Refer to a table using dot notation
Creating a CRUD transactional table
Creating an insert-only transactional table
Creating, using, and dropping an external table
Creating an Ozone-based external table
Accessing Hive files in Ozone
Recommended Hive configurations when using Ozone
Dropping an external table along with data
Converting a managed non-transactional table to external
Using constraints
Determining the table type
Apache Hive 3 ACID transactions
▶︎
Apache Hive query basics
Querying the information_schema database
Inserting data into a table
Updating data in a table
Merging data in tables
Deleting data from a table
▶︎
Creating a temporary table
Configuring temporary table storage
▶︎
Using a subquery
Subquery restrictions
Use wildcards with SHOW DATABASES
Aggregating and grouping data
Querying correlated data
▶︎
Using common table expressions
Use a CTE in a query
Comparing tables using ANY/SOME/ALL
Escaping an invalid identifier
CHAR data type support
ORC vs Parquet formats
Creating a default directory for managed tables
Generating surrogate keys
▶︎
Partitions and performance
Creating partitions dynamically
▶︎
Partition refresh and configuration
Automating partition discovery and repair
Repairing partitions manually using MSCK repair
Managing partition retention time
▶︎
Query scheduling
Enabling scheduled queries
Enabling all scheduled queries
Periodically rebuilding a materialized view
Getting scheduled query information and monitor the query
▶︎
Materialized views
▶︎
Creating and using a materialized view
Creating the tables and view
Verifing use of a query rewrite
Using optimizations from a subquery
Dropping a materialized view
Showing materialized views
Describing a materialized view
Managing query rewrites
Purposely using a stale materialized view
Creating and using a partitioned materialized view
Using JdbcStorageHandler to query RDBMS
▶︎
Using functions
Reloading, viewing, and filtering functions
▶︎
Create a user-defined function
Setting up the development environment
Creating the UDF class
Building the project and upload the JAR
Registering the UDF
Calling the UDF in a query
▶︎
Managing Apache Hive
▶︎
ACID operations
Configuring partitions for transactions
Viewing transactions
Viewing transaction locks
▶︎
Data compaction
Compaction prerequisites
Compaction tasks
Initiating automatic compaction in Cloudera Manager
Starting compaction manually
Viewing compaction progress
Disabling automatic compaction
Configuring compaction using table properties
Configuring compaction in Cloudera Manager
Configuring the compaction check interval
Compactor properties
▶︎
Query vectorization
Query vectorization properties
Checking query execution
Tracking Hive on Tez query execution
Tracking an Apache Hive query in YARN
Application not running message
▶︎
Configuring Apache Hive
Configuring legacy CREATE TABLE behavior
Limiting concurrent connections
Hive on Tez configurations
Configuring HiveServer high availability using Dynamic Service Discovery
▶︎
Configuring HiveServer high availability using a load balancer
Configuring the Hive Delegation Token Store
Adding a HiveServer role
Configuring the HiveServer load balancer
Achieving cross-cluster availability through Hive Load Balancer failover
▶︎
Generating statistics
Setting up the cost-based optimizer and statistics
Generating and viewing Apache Hive statistics
Statistics generation and viewing commands
Removing scratch directories
▶︎
Securing Apache Hive
Hive access authorization
Transactional table access
External table access
Accessing Hive files in Ozone
▶︎
Configuring access to Hive on YARN
Configuring HiveServer for ETL using YARN queues
Managing YARN queue users
Configuring queue mapping to use the user name from the application tag using Cloudera Manager
Disabling impersonation (doas)
Connecting to an Apache Hive endpoint through Apache Knox
HWC authorization
▶︎
Hive authentication
Securing HiveServer using LDAP
Client connections to HiveServer
Pluggable authentication modules in HiveServer
JDBC connection string syntax
▶︎
Communication encryption
Enabling TLS/SSL for HiveServer
Enabling SASL in HiveServer
▶︎
Securing an endpoint under AutoTLS
Securing Hive metastore
Activating the Hive web UI
▶︎
Integrating Apache Hive with Apache Spark and BI
▶︎
Hive Warehouse Connector for accessing Apache Spark data
Set up
HWC limitations
▶︎
Reading data through HWC
Direct Reader mode introduction
Using Direct Reader mode
Direct Reader configuration properties
Direct Reader limitations
Secure access mode introduction
Setting up secure access mode
Using secure access mode
JDBC read mode introduction
Using JDBC read mode
JDBC mode configuration properties
JDBC mode limitations
Kerberos configurations for HWC
Writing data through HWC
Apache Spark executor task statistics
▶︎
HWC and DataFrame APIs
HWC and DataFrame API limitations
HWC supported types mapping
Catalog operations
Read and write operations
Committing a transaction for Direct Reader
Closing HiveWarehouseSession operations
Using HWC for streaming
HWC API Examples
Hive Warehouse Connector Interfaces
Submitting a Scala or Java application
Examples of writing data in various file formats
▶︎
HWC integration with pyspark, sparklyr, and Zeppelin
Submitting a Python app
Reading and writing Hive tables in R
Livy interpreter configuration
Reading and writing Hive tables in Zeppelin
▶︎
Apache Hive-Kafka integration
Creating a table for a Kafka stream
▶︎
Querying Kafka data
Querying live data from Kafka
Perform ETL by ingesting data from Kafka into Hive
▶︎
Writing data to Kafka
Writing transformed Hive data to Kafka
Setting consumer and producer table properties
Kafka storage handler and table properties
▶︎
Connecting Hive to BI tools using a JDBC/ODBC driver
Getting the JDBC driver
Getting the ODBC driver
Integrating Hive and a BI tool
Specify the JDBC connection string
JDBC connection string syntax
Using JdbcStorageHandler to query RDBMS
Setting up JDBCStorageHandler for Postgres
▶︎
Apache Hive Performance Tuning
Query results cache
Best practices for performance tuning
▶︎
ORC file format
Advanced ORC properties
Performance improvement using partitions
Bucketed tables in Hive
▶︎
Migrating Data Using Sqoop
Data migration to Apache Hive
Setting Up Sqoop
Atlas Hook for Sqoop
▶︎
Imports into Hive
Creating a Sqoop import command
Importing RDBMS data into Hive
▶︎
HDFS to Apache Hive data migration
Importing RDBMS data to HDFS
Converting an HDFS file to ORC
Incrementally updating an imported table
Import command options
▶︎
Starting and Stopping Apache Impala
Modifying Impala Startup Options
▶︎
Configuring Client Access to Impala
▶︎
Impala Shell Tool
Impala Shell Configuration Options
Impala Shell Configuration File
Connecting to Impala Daemon in Impala Shell
Running Commands and SQL Statements in Impala Shell
Impala Shell Command Reference
Configuring ODBC for Impala
Configuring JDBC for Impala
Configuring Impyla for Impala
Configuring Delegation for Clients
Spooling Query Results
Shut Down Impala
▶︎
Setting Timeouts in Impala
Setting Timeout and Retries for Thrift Connections to Backend Client
Increasing StateStore Timeout
Setting the Idle Query and Idle Session Timeouts
▶︎
Securing Apache Impala
▶︎
Securing Impala
Configuring Impala TLS/SSL
▶︎
Impala Authentication
Configuring Kerberos Authentication
▶︎
Configuring LDAP Authentication
Enabling LDAP for in Hue
Enabling LDAP Authentication for impala-shell
▶︎
Impala Authorization
Configuring Authorization
Row-level filtering in Impala with Ranger policies
▶︎
Configuring Apache Impala
Configuring Impala
Configuring Load Balancer for Impala
▶︎
Tuning Apache Impala
Setting Up HDFS Caching
Setting up Data Cache for Remote Reads
Configuring Dedicated Coordinators and Executors
▶︎
Managing Apache Impala
▶︎
Managing Resources in Impala
Admission Control and Query Queuing
Enabling Admission Control
Creating Static Pools
Configuring Dynamic Resource Pool
Dynamic Resource Pool Settings
Admission Control Sample Scenario
Cancelling a Query
▶︎
Managing Metadata in Impala
On-demand Metadata
Automatic Invalidation of Metadata Cache
▶︎
Automatic Invalidation/Refresh of Metadata
Configuring Event Based Automatic Metadata Sync
▶︎
Monitoring Apache Impala
▶︎
Impala Logs
Managing Logs
Impala lineage
▶︎
Web User Interface for Debugging
Debug Web UI for Impala Daemon
Debug Web UI for StateStore
Debug Web UI for Catalog Server
Configuring Impala Web UI
▶︎
Using Hue
Using Hue
Enabling the SQL editor autocompleter
▶︎
Using governance-based data discovery
Searching metadata tags
List of supported non-alphanumeric characters for file and directory names in Hue
Options to rerun Oozie workflows in Hue
▶︎
Administering Hue
Reference architecture
Hue configuration files
Hue configurations in CDP Runtime
Hue Advanced Configuration Snippet
▶︎
Hue logs
Standard stream logs
Hue service Django logs
Enabling DEBUG
Enabling httpd log rotation for Hue
Hue supported browsers
Adding a Hue service with Cloudera Manager
Adding a Hue role instance with Cloudera Manager
▶︎
Customizing the Hue web UI
Adding a custom banner in Hue
Changing the page logo in Hue
Adding a splash screen in Hue
Setting the cache timeout
Enabling or disabling anonymous usage date collection
Configuring the number of objects displayed in Hue
▶︎
Using Oracle database with Hue
Installing and configuring the Oracle server
(Optional) Configuring the character set
Creating Hue Schema in Oracle database
Downloading, staging, and activating the Oracle Instant Client parcel
(Optional) Upgrading cx_Oracle to 6.4.1
Configuring Oracle as backend database for Hue
▶︎
Using MySQL database with Hue
Downloading and installing MySQL database
Configuring MySQL server
Installing and configuring MySQL on RHEL 8
Creating the Hue database
Configuring MySQL as the backend database for Hue
Configuring TLSv1.2-enforced MySQL server
▶︎
Using MariaDB database with Hue
Downloading and installing MariaDB database
Configuring MariaDB server
Installing and configuring MariaDB on RHEL 8
Creating the Hue database
Configuring MariaDB as the backend database for Hue
▶︎
Using PostgreSQL database with Hue
Download and install PostgreSQL
Configure the PostgreSQL server
Configure PostgreSQL as the backend database for Hue
Disabling the share option in Hue
Enabling Hue applications with Cloudera Manager
Running shell commands
Downloading and exporting data from Hue
Backing up the Hue database
Enabling a multi-threaded environment for Hue
▶︎
Moving the Hue service to a different host
Migrating Hue service using Add Service wizard
Migrating Hue service by adding new role instances
Configuring timezone for Hue
▶︎
Securing Hue
▶︎
User management in Hue
Understanding Hue users and groups
Finding the list of Hue superusers
Creating a Hue user
Restricting user login
Creating a group in Hue
Managing Hue permissions
Resetting Hue user password
Assigning superuser status to an LDAP user
Configuring file and directory permissions for Hue
▶︎
User authentication in Hue
Authentication using Kerberos
▶︎
Authentication using LDAP
Import and sync LDAP users and groups
Configuring authentication with LDAP and Search Bind
Configuring authentication with LDAP and Direct Bind
Multi-server LDAP/AD autentication
Testing the LDAP configuration
Configuring group permissions
Enabling LDAP authentication with HiveServer2 and Impala
LDAP properties
Configuring LDAP on unmanaged clusters
▶︎
Authentication using SAML
Configuring SAML authentication on managed clusters
Manually configuring SAML authentication
Integrating your identity provider's SAML server with Hue
SAML properties
Troubleshooting SAML authentication
Authentication using Knox SSO
Applications and permissions reference
Securing Hue passwords with scripts
▶︎
Configuring TLS/SSL for Hue
Creating a truststore file in PEM format
Configuring Hue as a TLS/SSL client
Enabling Hue as a TLS/SSL client
Configuring Hue as a TLS/SSL server
Enabling Hue as a TLS/SSL server using Cloudera Manager
Enabling TLS/SSL for Hue Load Balancer
Enabling TLS/SSL communication with HiveServer2
Enabling TLS/SSL communication with Impala
Securing database connections with TLS/SSL
Enforcing TLS version 1.2 for Hue
Securing sessions
Specifying HTTP request methods
Restricting supported ciphers for Hue
Specifying domains or pages to which Hue can redirect users
Setting Oozie permissions
Configuring secure access between Solr and Hue
▶︎
Tuning Hue
Adding a load balancer
▶︎
Configuring high availability for Hue
Configuring Hive and Impala for high availability with Hue
Configuring for HDFS high availability
Configuring dedicated Impala coordinator
▶︎
Search Tutorial
Tutorial
▶︎
Validating the Cloudera Search deployment
Create a test collection
Index sample data
Query sample data
▶︎
Indexing sample Tweets with Cloudera Search
Create a collection for tweets
Copy sample tweets to HDFS
▶︎
Using MapReduce batch indexing to index sample Tweets
Batch indexing into online Solr servers using GoLive
Batch indexing into offline Solr shards
▶︎
Securing Cloudera Search
Cloudera Search security aspects
Configure TLS/SSL encryption for Solr
Using a load balancer
Cloudera Search authentication
▶︎
Set proxy server authentication for clusters using Kerberos
Configure Kerberos authentication for Solr
Enable Kerberos authentication in Solr
Overview of proxy usage and load balancing for Search
Configuring custom Kerberos principals and custom system users for Solr
Enable LDAP authentication in Solr
Enabling Solr clients to authenticate with a secure Solr
Creating a JAAS configuration file
Enable Ranger authorization in Solr
Configuring Ranger authorization
Enable document-level authorization
▶︎
Tuning Cloudera Search
Solr server tuning categories
Setting Java system properties for Solr
Enable multi-threaded faceting
Tuning garbage collection
Enable garbage collector logging
Solr and HDFS - the block cache
▶︎
Tuning replication
Adjust the Solr replication factor for index files stored in HDFS
▶︎
Managing Cloudera Search
▶︎
Managing collection configuration
Cloudera Search config templates
Generating collection configuration using configs
Securing configs with ZooKeeper ACLs and Ranger
Generating Solr collection configuration using instance directories
Modifying a collection configuration generated using an instance directory
Converting instance directories to configs
Cloudera Search configuration files
Using custom JAR files with Search
Retrieving the clusterstate.json file
▶︎
Managing collections
Creating a Solr collection
Viewing existing collections
Deleting all documents in a collection
Deleting a collection
Updating the schema in a collection
Creating a replica of an existing shard
Migrating Solr replicas
Backing up a collection from HDFS
Backing up a collection from local file system
Restoring a collection
Defining a backup target in solr.xml
▶︎
Cloudera Search ETL
Using Morphlines to index Avro
Using Morphlines with Syslog
▶︎
Indexing Data Using Morphlines
Indexing Data
▶︎
Near Real Time Indexing
▶︎
Lily HBase Near Real Time Indexing for Cloudera Search
Adding the Lily HBase Indexer Service
Starting the Lily HBase NRT Indexer Service
▶︎
Using the Lily HBase NRT Indexer Service
Enable Replication on HBase Column Families
Create a Collection in Cloudera Search
Creating a Lily HBase Indexer Configuration File
Creating a Morphline Configuration File
Understanding the extractHBaseCells Morphline Command
Registering a Lily HBase Indexer Configuration with the Lily HBase Indexer Service
Verifying that Indexing Works
Using the Indexer HTTP Interface
▶︎
Configuring Lily HBase Indexer Security
Configure Lily HBase Indexer to use TLS/SSL
Configure Lily HBase Indexer Service to Use Kerberos Authentication
▶︎
Batch Indexing
Spark indexing using morphlines
▶︎
MapReduce indexing
▶︎
MapReduceIndexerTool
MapReduceIndexerTool input splits
MapReduceIndexerTool metadata
MapReduceIndexerTool usage syntax
Indexing data with MapReduceIndexerTool in Solr backup format
▶︎
Lily HBase batch indexing for Cloudera Search
Populating an HBase Table
Create a Collection in Cloudera Search
Creating a Lily HBase Indexer Configuration File
Creating a Morphline Configuration File
Understanding the extractHBaseCells Morphline Command
Running HBaseMapReduceIndexerTool
HBaseMapReduceIndexerTool command line reference
Using --go-live with SSL or Kerberos
Understanding --go-live and HDFS ACLs
▶︎
Indexing Data Using Spark-Solr Connector
▶︎
Batch indexing to Solr using SparkApp framework
Create indexer Maven project
Run the spark-submit job
▶︎
How to: Operational Database
▶︎
Configuring Apache HBase
Using DNS with HBase
Use the Network Time Protocol (NTP) with HBase
Configure the graceful shutdown timeout property
▶︎
Setting user limits for HBase
Configure ulimit for HBase using Cloudera Manager
Configuring ulimit for HBase
Configure ulimit using Pluggable Authentication Modules using the Command Line
Using dfs.datanode.max.transfer.threads with HBase
Configure encryption in HBase
▶︎
Using hedged reads
Enable hedged reads for HBase
▶︎
Understanding HBase garbage collection
Configure HBase garbage collection
Disable the BoundedByteBufferPool
Configure the HBase canary
Configuring auto split policy in an HBase table
▶︎
Using HBase blocksize
Configure the blocksize for a column family
▶︎
Configuring HBase BlockCache
Contents of the BlockCache
Size the BlockCache
Decide to use the BucketCache
▶︎
About the Off-heap BucketCache
Off-heap BucketCache
BucketCache IO engine
Configure BucketCache IO engine
Configure the off-heap BucketCache using Cloudera Manager
Configure the off-heap BucketCache using the command line
Cache eviction priorities
Bypass the BlockCache
Monitor the BlockCache
▶︎
Using quota management
Configuring quotas
General Quota Syntax
▶︎
Throttle quotas
Throttle quota examples
Space quotas
Quota enforcement
Quota violation policies
▶︎
Impact of quota violation policy
Live write access
Bulk Write Access
Read access
Metrics and Insight
Examples of overlapping quota policies
Number-of-Tables Quotas
Number-of-Regions Quotas
▶︎
Using HBase scanner heartbeat
Configure the scanner heartbeat using Cloudera Manager
▶︎
Storing medium objects (MOBs)
Prerequisites
Configure columns to store MOBs
Configure the MOB cache using Cloudera Manager
Test MOB storage and retrieval performance
MOB cache properties
▶︎
Limiting the speed of compactions
Configure the compaction speed using Cloudera Manager
Enable HBase indexing
▶︎
Using HBase coprocessors
Add a custom coprocessor
Disable loading of coprocessors
▶︎
Configuring HBase MultiWAL
Configuring MultiWAL support using Cloudera Manager
▶︎
Configuring the storage policy for the Write-Ahead Log (WAL)
Configure the storage policy for WALs using Cloudera Manager
Configure the storage policy for WALs using the Command Line
▶︎
Using RegionServer grouping
Enable RegionServer grouping using Cloudera Manager
Configure RegionServer grouping
Monitor RegionServer grouping
Remove a RegionServer from RegionServer grouping
Enabling ACL for RegionServer grouping
Best practices when using RegionServer grouping
Disable RegionServer grouping
▶︎
Optimizing HBase I/O
HBase I/O components
Advanced configuration for write-heavy workloads
▶︎
Managing Apache HBase Security
▶︎
HBase authentication
Configure HBase servers to authenticate with a secure HDFS cluster
Configure secure HBase replication
Configure the HBase client TGT renewal period
HBase authorization
▶︎
Configuring TLS/SSL for HBase
Prerequisites to configure TLS/SSL for HBase
Configure TLS/SSL for HBase Web UIs
Configure TLS/SSL for HBase REST Server
Configure TLS/SSL for HBase Thrift Server
Configure HSTS for HBase Web UIs
▶︎
Accessing Apache HBase
▶︎
Use the HBase shell
Virtual machine options for HBase Shell
Script with HBase Shell
Use the HBase command-line utilities
Use the HBase APIs for Java
▶︎
Use the HBase REST server
Installing the REST Server using Cloudera Manager
Using the REST API
Using the REST proxy API
▶︎
Using the Apache Thrift Proxy API
Preparing a thrift server and client
List of Thrift API and HBase configurations
Example for using THttpClient API in secure cluster
Example for using THttpClient API in unsecure cluster
Example for using TSaslClientTransport API in secure cluster without HTTP
▶︎
Using the Apache HBase Hive integration
Configuring Hive to use with HBase
Configuring HBase Hive integration
▶︎
Configure HBase-Spark connector using Cloudera Manager
Configuring HBase-Spark connector when both are on same cluster
Configuring HBase-Spark connector when HBase is on remote cluster
Example: Using the HBase-Spark connector
▶︎
Use the Hue HBase app
Configure the HBase thrift server role
▶︎
Managing Apache HBase
▶︎
Starting and stopping HBase using Cloudera Manager
Start HBase
Stop HBase
▶︎
Graceful HBase shutdown
Gracefully shut down an HBase RegionServer
Gracefully shut down the HBase service
▶︎
Importing data into HBase
Choose the right import method
Use snapshots
Use CopyTable
▶︎
Use BulkLoad
Use cases for BulkLoad
Use cluster replication
Use Sqoop
Use Spark
Use a custom MapReduce job
▶︎
Use HashTable and SyncTable Tool
HashTable/SyncTable tool configuration
Synchronize table data using HashTable/SyncTable tool
▶︎
Writing data to HBase
Variations on Put
Versions
Deletion
Examples
▶︎
Reading data from HBase
Perform scans using HBase Shell
▶︎
HBase filtering
Dynamically loading a custom filter
Logical operators, comparison operators and comparators
Compound operators
Filter types
HBase Shell example
Java API example
HBase online merge
Move HBase Master Role to another host
Expose HBase metrics to a Ganglia server
▶︎
Configuring Apache HBase High Availability
Enable HBase high availability using Cloudera Manager
HBase read replicas
Timeline consistency
Keep replicas current
Read replica properties
Configure read replicas using Cloudera Manager
▶︎
Using rack awareness for read replicas
Create a topology map
Create a topology script
Activate read replicas on a table
Request a timeline-consistent read
▶︎
Using Apache HBase Backup and Disaster Recovery
HBase backup and disaster recovery strategies
▶︎
Configuring HBase snapshots
About HBase snapshots
Configure snapshots
▶︎
Manage HBase snapshots using Cloudera Manager
Browse HBase tables
Take HBase snapshots
▶︎
Store HBase snapshots on Amazon S3
Configure HBase in Cloudera Manager to store snapshots in Amazon S3
Configure the dynamic resource pool used for exporting and importing snapshots in Amazon S3
HBase snapshots on Amazon S3 with Kerberos enabled
Manage HBase snapshots on Amazon S3 in Cloudera Manager
Delete HBase snapshots from Amazon S3
Restore an HBase snapshot from Amazon S3
Restore an HBase snapshot from Amazon S3 with a new name
Manage Policies for HBase snapshots in Amazon S3
▶︎
Manage HBase snapshots using the HBase shell
Shell commands
Take a snapshot using a shell script
Export a snapshot to another cluster
▶︎
Snapshot failures
Information and debugging
▶︎
Using HBase replication
Common replication topologies
Notes about replication
Replication requirements
▶︎
Deploy HBase replication
Replication across three or more clusters
Enable replication on a specific table
Configure secure replication
▶︎
Configure bulk load replication
Enable bulk load replication using Cloudera Manager
Create empty table on the destination cluster
Disable replication at the peer level
Stop replication in an emergency
▶︎
Initiate replication when data already exist
Replicate pre-exist data in an active-active deployment
Effects of WAL rolling on replication
Configure secure HBase replication
Restore data from a replica
Verify that replication works
Replication caveats
▶︎
Configuring Apache HBase for Apache Phoenix
Configure HBase for use with Phoenix
▶︎
Using Apache Phoenix to Store and Access Data
▶︎
Mapping Apache Phoenix schemas to Apache HBase namespaces
Enable namespace mapping
▶︎
Associating tables of a schema to a namespace
Associate table in a customized Kerberos environment
Associate a table in a non-customized environment without Kerberos
▶︎
Using secondary indexing
Use strongly consistent indexing
Migrate to strongly consistent indexing
▶︎
Using transactions
Configure transaction support
Use transactions with tables
▶︎
Using JDBC API
Connecting to PQS using JDBC
Connect to Phoenix Query Server
Connect to Phoenix Query Server through Apache Knox
Launching Apache Phoenix Thin Client
Using non-JDBC drivers
▶︎
Using Apache Phoenix-Spark connector
Configuring Phoenix-Spark connector when both are on same cluster
Configuring Phoenix-Spark connector when Phoenix is on remote cluster
Phoenix-Spark connector usage examples
▶︎
Using Apache Phoenix-Hive connector
Configure Phoenix-Hive connector
Apache Phoenix-Hive usage examples
Limitations of Phoenix-Hive connector
▶︎
Managing Apache Phoenix Security
Managing Apache Phoenix security
Enable Phoenix ACLs
Configure TLS encryption manually for Phoenix Query Server
▶︎
Managing Operational Database powered by Apache Accumulo
Change root user password
Find latest OpDB keytab
Relax WAL durability
▶︎
How to: Data Science
▶︎
Configuring Apache Spark
▶︎
Configuring dynamic resource allocation
Customize dynamic resource allocation settings
Configure a Spark job for dynamic resource allocation
Dynamic resource allocation properties
▶︎
Spark security
Enabling Spark authentication
Enabling Spark Encryption
Running Spark applications on secure clusters
Configuring HSTS for Spark
Accessing compressed files in Spark
Sample script to connect Spark to Ozone
▶︎
Developing Apache Spark Applications
Introduction
Spark application model
Spark execution model
Developing and running an Apache Spark WordCount application
Using the Spark DataFrame API
▶︎
Building Spark Applications
Best practices for building Apache Spark applications
Building reusable modules in Apache Spark applications
Packaging different versions of libraries with an Apache Spark application
▶︎
Using Spark SQL
SQLContext and HiveContext
Querying files into a DataFrame
Spark SQL example
Interacting with Hive views
Performance and storage considerations for Spark SQL DROP TABLE PURGE
TIMESTAMP compatibility for Parquet files
Accessing Spark SQL through the Spark shell
Calling Hive user-defined functions (UDFs)
▶︎
Using Spark Streaming
Spark Streaming and Dynamic Allocation
Spark Streaming Example
Enabling fault-tolerant processing in Spark Streaming
Configuring authentication for long-running Spark Streaming jobs
Building and running a Spark Streaming application
Sample pom.xml file for Spark Streaming with Kafka
▶︎
Accessing external storage from Spark
▶︎
Accessing data stored in Amazon S3 through Spark
Examples of accessing Amazon S3 data from Spark
Accessing Hive from Spark
Accessing HDFS Files from Spark
▶︎
Accessing ORC Data in Hive Tables
Accessing ORC files from Spark
Predicate push-down optimization
Loading ORC data into DataFrames using predicate push-down
Optimizing queries using partition pruning
Enabling vectorized query execution
Reading Hive ORC tables
Accessing Avro data files from Spark SQL applications
Accessing Parquet files from Spark SQL applications
▶︎
Using Spark MLlib
Running a Spark MLlib example
Enabling Native Acceleration For MLlib
Using custom libraries with Spark
▶︎
Running Apache Spark Applications
Introduction
Running your first Spark application
Running Spark 3 Applications
Updating Spark 2 apps for Spark 3.x
Running sample Spark applications
▶︎
Configuring Spark Applications
Configuring Spark application properties in spark-defaults.conf
Configuring Spark application logging properties
▶︎
Submitting Spark applications
spark-submit command options
Spark cluster execution overview
Canary test for pyspark command
Fetching Spark Maven dependencies
Accessing the Spark History Server
▶︎
Running Spark applications on YARN
Spark on YARN deployment modes
Submitting Spark Applications to YARN
Monitoring and Debugging Spark Applications
Example: Running SparkPi on YARN
Configuring Spark on YARN Applications
Dynamic allocation
▶︎
Submitting Spark applications using Livy
Configuring the Livy Thrift Server
Connecting to the Apache Livy Thrift Server
Using Livy with Spark
Using Livy with interactive notebooks
Using the Livy API to run Spark jobs
▶︎
Running an interactive session with the Livy API
Livy objects for interactive sessions
Setting Python path variables for Livy
Livy API reference for interactive sessions
▶︎
Submitting batch applications using the Livy API
Livy batch object
Livy API reference for batch jobs
▶︎
Using PySpark
Running PySpark in a virtual environment
Running Spark Python applications
Automating Spark Jobs with Oozie Spark Action
▶︎
Tuning Apache Spark
Introduction
Check Job Status
Check Job History
Improving Software Performance
▶︎
Tuning Apache Spark Applications
Tuning Spark Shuffle Operations
Choosing Transformations to Minimize Shuffles
When Shuffles Do Not Occur
When to Add a Shuffle Transformation
Secondary Sort
Tuning Resource Allocation
Resource Tuning Example
Tuning the Number of Partitions
Reducing the Size of Data Structures
Choosing Data Formats
▶︎
CDS 3 Powered by Apache Spark
CDS 3.2.3 Overview
CDS 3.2.3 Requirements
Installing CDS 3.2.3
Enabling Spark rolling event log files in CDP
Enabling CDS 3.2.3 with GPU Support
Updating Spark 2 apps for Spark 3
Running Spark 3 Applications with CDS 3.2.3
Running applications with CDS 3.2.3 with GPU Support
CDS 3.2.3 Packaging, and Download
Using the CDS 3.2.3 Maven Repo
CDS 3.2.3 Maven Artifacts
▶︎
Cumulative hotfixes for CDS
Cumulative hotfix CDS 3.2.7172000.3-3
Cumulative hotfix CDS 3.2.7172000.6-1
Cumulative hotfix CDS 3.2.7172000.8-1
Cumulative hotfix CDS 3.2.7172000.9-1
Cumulative hotfix CDS 3.2.7172000.10-1
Cumulative hotfix CDS 3.2.7172000.12-1
Cumulative hotfix CDS 3.2.7172000.13-4
Cumulative hotfix CDS 3.2.7172000.14-1
Cumulative hotfix CDS 3.2.7172000.15-1
Cumulative hotfix CDS 3.2.7172000.16-1
Cumulative hotfix CDS 3.2.7173000.2-1
Cumulative hotfix CDS 3.2.7173000.3-1
▶︎
Configuring Apache Zeppelin
Introduction
Configuring Zeppelin caching
Configuring Livy
Configure User Impersonation for Access to Hive
Configure User Impersonation for Access to Phoenix
▶︎
Enabling Access Control for Zeppelin Elements
Enable Access Control for Interpreter, Configuration, and Credential Settings
Enable Access Control for Notebooks
Enable Access Control for Data
▶︎
Shiro Settings: Reference
Active Directory Settings
LDAP Settings
General Settings
shiro.ini Example
▶︎
Using Apache Zeppelin
Introduction
Launch Zeppelin
▶︎
Working with Zeppelin Notes
Create and Run a Note
Import a Note
Export a Note
Using the Note Toolbar
Import External Packages
▶︎
Configuring and Using Zeppelin Interpreters
Modify interpreter settings
Using Zeppelin Interpreters
Customize interpreter settings in a note
Use the JDBC interpreter to access Hive
Use the Livy interpreter to access Spark
Using Spark Hive Warehouse and HBase Connector Client .jar files with Livy
▶︎
How to: Security
▶︎
Configuring Authentication in Cloudera Manager
Overview
Kerberos Security Artifacts Overview
Kerberos Configuration Strategies for CDP
▶︎
Configuring Authentication in Cloudera Manager
Cloudera Manager user accounts
▶︎
Configuring external authentication and authorization for Cloudera Manager
Configuring PAM authentication with LDAP and SSSD
Configuring PAM authentication with Linux users
Configuring PAM authentication using Apache Knox
Configure authentication using Active Directory
Configure authentication using an LDAP-compliant identity service
Configure authentication using Kerberos (SPNEGO)
Configure authentication using an external program
Configure authentication using SAML
▶︎
Enabling Kerberos Authentication for CDP
Step 1: Install Cloudera Manager and CDP
Step 2: Install JCE policy files for AES-256 encryption
Step 3: Create the Kerberos Principal for Cloudera Manager Server
Step 4: Enable Kerberos using the wizard
Step 5: Create the HDFS superuser
Step 6: Get or create a Kerberos principal for each user account
Step 7: Prepare the cluster for each user
Step 8: Verify that Kerberos security is working
Step 9: (Optional) Enable authentication for HTTP web consoles for Hadoop roles
Kerberos authentication for non-default users
▶︎
Customizing Kerberos principals
Configuring custom Kerberos principal for Atlas
Configuring custom Kerberos principal for Cruise Control
Configuring custom Kerberos principal for Apache Flink
Configuring custom Kerberos principal for HBase
Configuring custom Kerberos principal for HDFS
Configuring custom Kerberos principal for Hive and Hive-on-Tez
Configuring custom Kerberos principal for HttpFS
Configuring custom Kerberos principal for Hue
Configuring Kerberos Authentication
Configuring custom Kerberos principal for Kafka
Configuring custom Kerberos principal for Knox
Configuring custom Kerberos principal for Kudu
Configuring custom Kerberos principal for Livy
Configuring custom Kerberos principal for NiFi and NiFi Registry
Configuring custom Kerberos principal for Omid
Configuring custom Kerberos principal for Oozie
Configuring custom Kerberos principal for Ozone
Configuring custom Kerberos principal for Phoenix
Configuring custom Kerberos principal for Schema Registry
Configuring custom Kerberos principals and custom system users for Solr
Configuring custom Kerberos principal for Spark
Configuring custom Kerberos principal for Streams Messaging Manager
Configuring custom Kerberos principal for SQL Stream Builder
Configuring custom Kerberos principal for Streams Replication Manager
Enabling custom Kerberos principal support in YARN
Enabling custom Kerberos principal support in a Queue Manager cluster
Configuring custom Kerberos principal for Zeppelin
Configuring custom Kerberos principal for ZooKeeper
Managing Kerberos credentials using Cloudera Manager
Using a custom Kerberos keytab retrieval script
Adding trusted realms to the cluster
Using auth-to-local rules to isolate cluster users
Configuring a dedicated MIT KDC for cross-realm trust
Integrating MIT Kerberos and Active Directory
Hadoop Users (user:group) and Kerberos Principals
Mapping Kerberos Principals to Short Names
▶︎
Cloudera Authorization
Overview
Configuring LDAP Group Mappings
Using Ranger to Provide Authorization in CDP
▶︎
Encrypting Data in Transit
Encrypting Data in Transit
Understanding Keystores and Truststores
Disabling TLS protocols on JMX ports
Choosing manual TLS or Auto-TLS
SAN Certificates
▶︎
Configuring TLS Encryption for Cloudera Manager Using Auto-TLS
Use case 1: Use Cloudera Manager to generate internal CA and corresponding certificates
▶︎
Use case 2: Enabling Auto-TLS with an intermediate CA signed by an existing Root CA
Certmanager Options - Using CM's GenerateCMCA API
Use case 3: Enabling Auto-TLS with Existing Certificates
Manually Configuring TLS Encryption for Cloudera Manager
▶︎
Configuring TLS/SSL encryption manually for CDP Services
Configuring TLS encryption manually for Apache Atlas
Enable security for Cruise Control
Configuring TLS/SSL encryption manually for DAS using Cloudera Manager
Enabling security for Apache Flink
▶︎
Configuring TLS/SSL for HBase
Prerequisites to configure TLS/SSL for HBase
Configuring TLS/SSL for HBase Web UIs
Configuring TLS/SSL for HBase REST Server
Configuring TLS/SSL for HBase Thrift Server
Enabling TLS/SSL for HiveServer
▶︎
Configuring TLS/SSL for Hue
Creating a truststore file in PEM format
Configuring Hue as a TLS/SSL client
Enabling Hue as a TLS/SSL client
Configuring Hue as a TLS/SSL server
Enabling Hue as a TLS/SSL server using Cloudera Manager
Enabling TLS/SSL for Hue Load Balancer
Enabling TLS/SSL communication with HiveServer2
Enabling TLS/SSL communication with Impala
Securing database connections with TLS/SSL
Configuring Impala TLS/SSL
▶︎
Channel encryption
Configure Kafka brokers
Configure Kafka MirrorMaker
Configuring TLS/SSL encryption
Configure Kafka clients
Configure Zookeeper TLS/SSL support for Kafka
▶︎
Authentication
▶︎
TLS/SSL client authentication
Configure Kafka brokers
Configure Kafka clients
Principal name mapping
Inter-broker security
Configuring multiple listeners
▶︎
Configuring TLS/SSL encryption manually for Key Trustee Server
Key Trustee Server Properties for TLS
▶︎
Configuring TLS/SSL encryption manually for Apache Knox
Knox Properties for TLS
Configuring TLS/SSL encryption for Kudu using Cloudera Manager
Configure Lily HBase Indexer to use TLS/SSL
Configuring TLS/SSL encryption manually for Livy
▶︎
Configuring TLS/SSL manually
TLS/SSL certificate requirements and recommendations
Configuring TLS/SSL encryption manually for NiFi and NiFi Registry
NiFi TLS/SSL properties
NiFi Registry TLS/SSL Properties
Configure TLS/SSL for Oozie
Configure TLS encryption manually for Phoenix Query Server
Configure TLS/SSL encryption manually for Apache Ranger
▶︎
Configure TLS/SSL encryption manually for Ranger KMS
Overriding custom keystore alias on a Ranger KMS Server
Configure TLS/SSL encryption manually for Ranger RMS
Configuring TLS encryption manually for Schema Registry
▶︎
Configure TLS/SSL encryption for Solr
Using a load balancer
Configuring TLS/SSL encryption manually for Spark
Encryption in SSB
Enabling TLS/SSL for the SRM service
▶︎
Enabling TLS Encryption for SMM on CDP Private Cloud
TLS/SSL settings for Streams Messaging Manager
▶︎
Configuring TLS/SSL for Core Hadoop Services
Configuring TLS/SSL for HDFS
Configuring TLS/SSL for YARN
Configuring TLS/SSL encryption manually for Zeppelin
Configure ZooKeeper TLS/SSL using Cloudera Manager
Manually Configuring TLS Encryption on the Agent Listening Port
▶︎
Encrypting Data at Rest
Encrypting Data at Rest
Data at Rest Encryption Reference Architecture
Data at Rest Encryption Requirements
Resource Planning for Data at Rest Encryption
▶︎
HDFS Transparent Encryption
▶︎
Key Concepts and Architecture
Keystores and the Key Management Server
Data Encryption Components and Solutions
Encryption Zones and Keys
Accessing Files Within an Encryption Zone
Optimizing Performance for HDFS Transparent Encryption
▶︎
Managing Encryption Keys and Zones
Validating Hadoop Key Operations
Creating Encryption Zones
Adding Files to an Encryption Zone
Deleting Encryption Zones
Backing Up Encryption Keys
Rolling Encryption Keys
Deleting Encryption Zone Keys
▶︎
Re-encrypting Encrypted Data Encryption Keys (EDEKs)
Benefits and Capabilities
Prerequisites and Assumptions
Limitations
Re-encrypting an EDEK
Managing Re-encryption Operations
▶︎
Securing the Key Management System (KMS)
Enabling Kerberos Authentication for the KMS
Configuring TLS/SSL for the KMS
Migrating Keys from a Java KeyStore to Cloudera Navigator Key Trustee Server
▶︎
Migrating Ranger Key Management Server Role Instances to a New Host
Migrate the Ranger Admin role instance to a new host
Migrate the Ranger KMS db role instance to a new host
Migrate the Ranger KMS KTS role instance to a new host
▶︎
Migrating ACLs from Key Trustee KMS to Ranger KMS
Key Trustee KMS operations not supported by Ranger KMS
ACLs supported by Ranger KMS and Ranger KMS Mapping
▶︎
Configuring CDP Services for HDFS Encryption
Transparent Encryption Recommendations for HBase
▶︎
Transparent Encryption Recommendations for Hive
Changed Behavior after HDFS Encryption is Enabled
KMS ACL Configuration for Hive
Transparent Encryption Recommendations for Hue
Transparent Encryption Recommendations for Impala
Transparent Encryption Recommendations for MapReduce and YARN
Transparent Encryption Recommendations for Search
Transparent Encryption Recommendations for Spark
Transparent Encryption Recommendations for Sqoop
▶︎
Integrating Components for Encrypting Data at Rest
Set up Luna 7 HSM for Ranger KMS w/database
Set up Luna 6 HSM for Ranger KMS, KTS, and KeyHSM
Set up Luna 7 HSM for Ranger KMS, KTS, and KeyHSM
Set up GCP Cloud HSM for Ranger KMS, KTS, and KeyHSM
Setting up CipherTrust HSM for Ranger KMS, KTS, and KeyHSM
Integrating Ranger KMS DB with Google Cloud HSM
Integrating Ranger KMS DB with CipherTrust Manager HSM
Integrating Ranger KMS DB with SafeNet Keysecure HSM
Connecting KeySecure HSM to CipherTrust Manager after migration from Key Secure HSM
▶︎
Using the Ranger Key Management Service
Accessing the Ranger KMS Web UI
List and Create Keys
Roll Over an Existing Key
Delete a Key
▶︎
Navigator Key Trustee Server
▶︎
Cloudera Navigator Key Trustee Server Overview
Key Trustee Server System Requirements
Cloudera Navigator Key Trustee Server
▶︎
Backing up Key Trustee Server and clients
Back up Key Trustee Server using Cloudera Manager
Back up Key Trustee Server using the ktbackup.sh script
Back up Key Trustee Server manually
Back up Key Trustee Server clients
▶︎
Restoring Navigator Key Trustee Server
Restore Key Trustee Server in parcel-based installations
Restore Key Trustee Server in package-based installations
Restore Key Trustee Server from ktbackup.sh backups
▶︎
Initializing Standalone Key Trustee Server
Initializing Standalone Key Trustee Server Using Cloudera Manager
Specifying TLS/SSL Minimum Allowed Version and Ciphers
Configuring a Mail Transfer Agent for Key Trustee Server
Verifying Cloudera Navigator Key Trustee Server Operations
Managing Key Trustee Server Organizations
▶︎
Managing Key Trustee Server Certificates
Generating a New Certificate
Replacing Key Trustee Server Certificates
▶︎
Setting Up Key Trustee Server High Availability
Configuring Key Trustee Server High Availability Using Cloudera Manager
Recovering a Key Trustee Server
▶︎
Navigator Encrypt
Navigator Encrypt Overview
Registering Cloudera Navigator Encrypt with Key Trustee Server
Preparing for Encryption Using Cloudera Navigator Encrypt
Encrypting and Decrypting Data Using Cloudera Navigator Encrypt
Converting from Device Names to UUIDs for Encrypted Devices
Navigator Encrypt Access Control List
Maintaining Cloudera Navigator Encrypt
▶︎
Navigator Key HSM
Cloudera Navigator Key HSM Overview
Initializing Navigator Key HSM
HSM-Specific Setup for Cloudera Navigator Key HSM
Validating Key HSM Settings
Managing the Navigator Key HSM Service
Integrating Key HSM with Key Trustee Server
▶︎
Apache Ranger Access Control and Auditing
▶︎
Apache Ranger Auditing
Audit Overview
▶︎
Managing Auditing with Ranger
View audit details
Create a read-only Admin user (Auditor)
Configuring Ranger audit properties for Solr
Configuring Ranger audit properties for HDFS
▶︎
Ranger Audit Filters
Default Ranger audit filters
Configuring a Ranger audit filter policy
How to set audit filters in Ranger Admin Web UI
Filter service access logs from Ranger UI
Excluding audits for specific users, groups, and roles
Changing Ranger audit storage location and migrating data
Configuring Ranger audits to show actual client IP address
▶︎
Apache Ranger Authorization
Using Ranger to Provide Authorization in CDP
Ranger special entities
▶︎
Ranger Policies Overview
Ranger tag-based policies
Tags and policy evaluation
Ranger access conditions
▶︎
Using the Ranger Console
Accessing the Ranger console
Ranger console navigation
▶︎
Resource-based Services and Policies
▶︎
Configuring resource-based services
Configure a resource-based service: Atlas
Configure a resource-based service: HBase
Configure a resource-based service: HDFS
Configure a resource-based service: HadoopSQL
Configure a resource-based service: Kafka
Configure a resource-based service: Knox
Configure a resource-based service: NiFi
Configure a resource-based service: NiFi Registry
Configure a resource-based service: Solr
Configure a resource-based service: YARN
▶︎
Configuring resource-based policies
Configure a resource-based policy: Atlas
Configure a resource-based policy: HBase
Configure a resource-based policy: HDFS
Configure a resource-based policy: HadoopSQL
Configure a resource-based storage handler policy: HadoopSQL
Configure a resource-based policy: Kafka
Configure a resource-based policy: Knox
Configure a resource-based policy: NiFi
Configure a resource-based policy: NiFi Registry
Configure a resource-based policy: Solr
Configure a resource-based policy: YARN
Wildcards and variables in resource-based policies
Adding a policy label to a resource-based policy
Preloaded resource-based services and policies
▶︎
Importing and exporting resource-based policies
Import resource-based policies for a specific service
Import resource-based policies for all services
Export resource-based policies for a specific service
Export all resource-based policies for all services
▶︎
Row-level filtering and column masking in Hive
Row-level filtering in Hive with Ranger policies
Dynamic resource-based column masking in Hive with Ranger policies
Dynamic tag-based column masking in Hive with Ranger policies
▶︎
Tag-based Services and Policies
Adding a tag-based service
▶︎
Adding tag-based policies
Using tag attributes and values in Ranger tag-based policy conditions
Adding a tag-based PII policy
Default EXPIRES ON tag policy
▶︎
Importing and exporting tag-based policies
Import tag-based policies
Export tag-based policies
Create a time-bound policy
Create a Hive authorizer URL policy
▶︎
Ranger Security Zones
Security Zones Administration
Security Zones Example Use Cases
Adding a Ranger security zone
▶︎
Administering Ranger Users, Groups, Roles, and Permissions
Add a user
Edit a user
Delete a user
Add a group
Edit a group
Delete a group
Add a role through Ranger
Add a role through Hive
Edit a role
Delete a role
Add or edit permissions
▶︎
Administering Ranger Reports
View Ranger reports
Search Ranger reports
Export Ranger reports
Using Ranger client libraries
Using session cookies to validate Ranger policies
▶︎
Apache Ranger User Management
Ranger Usersync
Configure Usersync assignment of Admin users
Configure Ranger Usersync for Deleted Users and Groups
Configure Ranger Usersync for invalid usernames
Adding default service users and roles for Ranger
Set credentials for Ranger Usersync
Ranger user management
▶︎
Configuring Ranger Authentication with UNIX, LDAP, or AD
▶︎
Configuring Ranger Authentication with UNIX, LDAP, AD, or PAM
Configure Ranger authentication for UNIX
Configure Ranger authentication for AD
Configure Ranger authentication for LDAP
Configure Ranger authentication for PAM
▶︎
Ranger AD Integration
Ranger UI authentication
Ranger UI authorization
▶︎
Configuring Advanced Security Options for Apache Ranger
Configuring the server work directory path for a Ranger service
Configure session timeout for Ranger Admin Web UI
Configure Kerberos authentication for Apache Ranger
Configure TLS/SSL encryption manually for Apache Ranger
▶︎
Configure TLS/SSL encryption manually for Ranger KMS
Overriding custom keystore alias on a Ranger KMS Server
Configure TLS/SSL encryption manually for Ranger RMS
▶︎
Configuring Apache Ranger High Availability
Configure Ranger Admin High Availability
Configure Ranger Admin High Availability with a Load Balancer
Migrating Ranger Usersync and Tagsync role groups
Configuring JVM options and system properties for Ranger services
How to pass JVM options to Ranger KMS services
How to clear Ranger Admin access logs
Enable Ranger Admin login using kerberos authentication
How to configure Ranger HDFS plugin configs per (NameNode) Role Group
How to add a coarse URI check for Hive agent
How to suppress database connection notifications
How to change the password for Ranger users
▶︎
Configuring and Using Hive-HDFS ACL Sync
Ranger RMS - HIVE-HDFS ACL Sync Overview
Analyzing Ranger RMS resources
How to full sync the Ranger RMS database
Configure High Availability for Hive-HDFS ACL Sync
Configure Hive-HDFS ACL Sync
Hive-HDFS ACL Sync Use Cases
Hive-HDFS ACL Sync Reference
▶︎
Configuring and Using Ranger KMS
▶︎
Configuring Ranger KMS High Availability
Configure High Availability for Ranger KMS with DB
Configure High Availability for Ranger KMS with KTS
▶︎
Apache Knox Authentication
▶︎
Apache Knox Overview
Securing Access to Hadoop Cluster: Apache Knox
Apache Knox Gateway Overview
Knox Supported Services Matrix
Knox Topology Management in Cloudera Manager
Considerations for Knox
Proxy Cloudera Manager through Apache Knox
▶︎
Installing Apache Knox
Apache Knox Install Role Parameters
▶︎
Management of Knox shared providers in Cloudera Manager
Configure Apache Knox authentication for PAM
Configure Apache Knox authentication for AD/LDAP
Configure Apache Knox authentication for SAML
Add a new shared provider configuration
TLS Mutual Authentication
▶︎
Management of existing Apache Knox shared providers
Add a new provider in an existing provider configuration
Modify a provider in an existing provider configuration
Disable a provider in an existing provider configuration
Saving aliases
Configuring Kerberos authentication in Apache Knox shared providers
▶︎
Management of services for Apache Knox via Cloudera Manager
Enable proxy for a known service in Apache Knox
Disable proxy for a known service in Apache Knox
Add custom service to existing descriptor in Apache Knox Proxy
Add a custom descriptor to Apache Knox
▶︎
Management of Service Parameters for Apache Knox via Cloudera Manager
Add custom service parameter to descriptor
Modify custom service parameter in descriptor
Remove custom service parameter from descriptor
▶︎
Additional Security Topics
How to Add Root and Intermediate CAs to Truststore for TLS/SSL
Amazon S3 Security
How to Authenticate Kerberos Principals Using Java
Check Cluster Security Settings
Configure Antivirus Software on CDP Hosts
Configure Browser-based Interfaces to Require Authentication (SPNEGO)
Configure Browsers for Kerberos Authentication (SPNEGO)
Configure Cluster to Use Kerberos Authentication
Convert DER, JKS, PEM Files for TLS/SSL Artifacts
Configure Authentication for Amazon S3
Configure Encryption for Amazon S3
Configure AWS Credentials
Enable Sensitive Data Redaction
Log a Security Support Case
Obtain and Deploy Keys and Certificates for TLS/SSL
Renew and Redistribute Certificates
Set Up a Gateway Host to Restrict Access to the Cluster
Set Up Access to Cloudera EDH (Microsoft Azure Marketplace)
Use Self-Signed Certificates for TLS
▶︎
Configuring Infra Solr
Configure Ranger authorization for Infra Solr
Configuring custom Kerberos principals and custom system users for Solr
▶︎
How to: Governance
▶︎
Searching with Metadata
Searching overview
Using Basic Search
Using Search filters
Using Free-text Search
Saving searches
Using advanced search
Atlas index repair configuration
▶︎
Working with Classifications and Labels
Working with Atlas classifications and labels
Creating classifications
Creating labels
Adding attributes to classifications
Associating classifications with entities
Propagating classifications through lineage
Searching for entities using classifications
▶︎
Exploring using Lineage
Lineage overview
Viewing lineage
Lineage lifecycle
▶︎
Leveraging Business Metadata
Business Metadata overview
Creating Business Metadata
Adding attributes to Business Metadata
Associating Business Metadata attributes with entities
Importing Business Metadata associations in bulk
Searching for entities using Business Metadata attributes
▶︎
Managing Business Terms with Atlas Glossaries
Glossaries overview
Creating glossaries
Creating terms
Associating terms with entities
Defining related terms
Creating categories
Assigning terms to categories
Searching using terms
▶︎
Importing Glossary terms in bulk
Enhancements related to bulk glossary terms import
▶︎
Setting up Atlas High Availability
About Atlas High Availability
Prerequisites for setting up Atlas HA
Installing Atlas in HA using CDP Private Cloud Base cluster
▶︎
Auditing Atlas Entities
▶︎
Audit Operations
Atlas Type Definitions
Atlas Export and Import Operations
Atlas Server Operations
Audit enhancements
Examples of Audit Operations
▶︎
Securing Atlas
Securing Atlas
Configuring TLS encryption manually for Apache Atlas
▶︎
Configuring Atlas Authentication
Configure Kerberos authentication for Apache Atlas
Configure Atlas authentication for AD
Configure Atlas authentication for LDAP
Configure Atlas PAM authentication
Configure Atlas file-based authentication
▶︎
Configuring Atlas Authorization
Restricting classifications based on user permission
Configuring Ranger Authorization for Atlas
Configuring Atlas Authorization using Ranger
Configuring Simple Authorization in Atlas
▶︎
Configuring Atlas using Cloudera Manager
▶︎
Configuring and Monitoring Atlas
Showing Atlas Server status
Accessing Atlas logs
▶︎
Integrating Atlas with Ozone
About Apache Ozone integration with Apache Atlas
How Integration works
▶︎
Using import utility tools with Atlas
▶︎
Importing Hive Metadata using Command-Line (CLI) utility
Using Atlas-Hive import utility with Ozone entities
Setting up Atlas Kafka import tool
▶︎
How to: Jobs Management
Overview of Oozie
Adding the Oozie service using Cloudera Manager
Considerations for Oozie to work with AWS
User authorization configuration for Oozie
▶︎
Redeploying the Oozie ShareLib
Redeploying the Oozie sharelib using Cloudera Manager
▶︎
Oozie configurations with CDP services
▶︎
Using Sqoop actions with Oozie
Deploying and configuring Oozie Sqoop1 Action JDBC drivers
Configuring Oozie Sqoop1 Action workflow JDBC drivers
Configuring Oozie to enable MapReduce jobs to read or write from Amazon S3
Configuring Oozie to use HDFS HA
Using Hive Warehouse Connector with Oozie Spark Action
▶︎
Oozie High Availability
Requirements for Oozie High Availability
▶︎
Configuring Oozie High Availability using Cloudera Manager
Oozie Load Balancer configuration
Enabling Oozie High Availability
Disabling Oozie High Availability
▶︎
Scheduling in Oozie using cron-like syntax
Oozie scheduling examples
▶︎
Configuring an external database for Oozie
Configuring PostgreSQL for Oozie
Configuring MariaDB for Oozie
Configuring MySQL 5 for Oozie
Configuring MySQL 8 for Oozie
Configuring Oracle for Oozie
▶︎
Working with the Oozie server
Starting the Oozie server
Stopping the Oozie server
Accessing the Oozie server with the Oozie Client
Accessing the Oozie server with a browser
Adding schema to Oozie using Cloudera Manager
Enabling the Oozie web console on managed clusters
Enabling Oozie SLA with Cloudera Manager
Disabling Oozie UI using Cloudera Manager
Moving the Oozie service to a different host
▶︎
Oozie database configurations
Configuring Oozie data purge settings using Cloudera Manager
Loading the Oozie database
Dumping the Oozie database
Setting the Oozie database timezone
Prerequisites for configuring TLS/SSL for Oozie
Configure TLS/SSL for Oozie
Oozie security enhancements
Additional considerations when configuring TLS/SSL for Oozie HA
Configure Oozie client when TLS/SSL is enabled
Configuring custom Kerberos principal for Oozie
▶︎
How to: Streams Messaging
▶︎
Configuring Apache Kafka
Operating system requirements
Performance considerations
Quotas
▶︎
JBOD
JBOD setup
JBOD Disk migration
Setting user limits for Kafka
Configuring Kafka ZooKeeper chroot
Rack awareness
▶︎
Securing Apache Kafka
▶︎
Channel encryption
Configure Kafka brokers
Configure Kafka clients
Configure Kafka MirrorMaker
Configure Zookeeper TLS/SSL support for Kafka
▶︎
Authentication
▶︎
TLS/SSL client authentication
Configure Kafka brokers
Configure Kafka clients
Principal name mapping
▶︎
Kerberos authentication
Enable Kerberos authentication
Configuring custom Kerberos principal for Kafka
▶︎
Delegation token based authentication
Enable or disable authentication with delegation tokens
Manage individual delegation tokens
Rotate the master key/secret
▶︎
Client authentication using delegation tokens
Configure clients on a producer or consumer level
Configure clients on an application level
▶︎
LDAP authentication
Configure Kafka brokers
Configure Kafka clients
▶︎
PAM authentication
Configure Kafka brokers
Configure Kafka clients
▶︎
Authorization
▶︎
Ranger
Enable authorization in Kafka with Ranger
Configure the resource-based Ranger service used for authorization
▶︎
Governance
Configuring the Atlas hook in Kafka
Inter-broker security
Configuring multiple listeners
▶︎
Kafka security hardening with Zookeeper ACLs
Restricting access to Kafka metadata in Zookeeper
Unlocking access to Kafka metadata in Zookeeper
▶︎
Tuning Apache Kafka Performance
Handling large messages
▶︎
Cluster sizing
Sizing estimation based on network and disk message throughput
Choosing the number of partitions for a topic
▶︎
Broker Tuning
JVM and garbage collection
Network and I/O threads
ISR management
Log cleaner
▶︎
System Level Broker Tuning
File descriptor limits
Filesystems
Virtual memory handling
Networking parameters
Configure JMX ephemeral ports
Kafka-ZooKeeper performance tuning
▶︎
Managing Apache Kafka
▶︎
Management basics
Broker log management
Record management
Broker garbage log collection and log rotation
Client and broker compatibility across Kafka versions
▶︎
Managing topics across multiple Kafka clusters
Set up MirrorMaker in Cloudera Manager
Settings to avoid data loss
▶︎
Broker migration
Migrate brokers by modifying broker IDs in meta.properties
Use rsync to copy files from one broker to another
▶︎
Disk management
Monitoring
▶︎
Handling disk failures
Disk Replacement
Disk Removal
Reassigning replicas between log directories
Retrieving log directory replica assignment information
▶︎
Metrics
Building Cloudera Manager charts with Kafka metrics
Essential metrics to monitor
▶︎
Command Line Tools
Unsupported command line tools
kafka-topics
kafka-configs
kafka-console-producer
kafka-console-consumer
kafka-consumer-groups
▶︎
kafka-reassign-partitions
Tool usage
Reassignment examples
kafka-log-dirs
zookeeper-security-migration
kafka-delegation-tokens
kafka-*-perf-test
Configuring log levels for command line tools
Understanding the kafka-run-class Bash Script
▶︎
Developing Apache Kafka Applications
Kafka producers
▶︎
Kafka consumers
Subscribing to a topic
Groups and fetching
Protocol between consumer and broker
Rebalancing partitions
Retries
Kafka clients and ZooKeeper
▶︎
Java client
▶︎
Client examples
Simple Java consumer
Simple Java producer
Security examples
▶︎
.NET client
▶︎
Client examples
Simple .NET consumer
Simple .NET producer
Performant .NET producer
Security examples
Kafka Streams
Kafka public APIs
Recommendations for client development
▶︎
Kafka Connect
Kafka Connect Overview
▶︎
Kafka Connect Setup
Installing the Kafka Connect Role
Configuring Streams Messaging Manager for Kafka Connect
▶︎
Using Kafka Connect
Configuring the Kafka Connect Role
Managing, Deploying and Monitoring Connectors
▶︎
Writing Kafka data to Ozone with Kafka Connect
Writing data in an unsecured cluster
Writing data in a Kerberos and TLS/SSL enabled cluster
▶︎
Securing Kafka Connect
Configure TLS/SSL Encryption for the Kafka Connect Role
Configure Kerberos Authentication for the Kafka Connect role
Kafka Connect API Security
▶︎
Connectors
Installing Connectors
▶︎
HDFS Sink Connector
Configuration example for writing data to HDFS
Configuration example for writing data to Ozone FS
▶︎
Amazon S3 Sink Connector
Configuration Example
▶︎
Configuring Cruise Control
Adding Cruise Control as a service
▶︎
Setting capacity estimations and goals
Configuring capacity estimations
Configuring goals
Example of Cruise Control goal configuration
▶︎
Enabling self-healing in Cruise Control
Changing the Anomaly Notifier Class value to self-healing
Enabling self-healing for all or individual anomaly types
Adding self-healing goals to Cruise Control in Cloudera Manager
▶︎
Securing Cruise Control
▶︎
Enable security for Cruise Control
Configuring custom Kerberos principal for Cruise Control
▶︎
Managing Cruise Control
▶︎
Rebalancing with Cruise Control
Cruise Control REST API endpoints
Rebalance after adding Kafka broker
Rebalance after demoting Kafka broker
Rebalance after removing Kafka broker
▶︎
Securing Streams Messaging Manager
Securing Streams Messaging Manager
Verifying the setup
▶︎
Getting Metrics for Streams Messaging Manager
Cloudera Manager metrics overview
Prometheus metrics overview
▶︎
Prometheus configuration for SMM
Prerequisites for Prometheus configuration
Prometheus properties configuration
SMM property configuration in Cloudera Manager for Prometheus
Kafka property configuration in Cloudera Manager for Prometheus
Kafka Connect property configuration in Cloudera Manager for Prometheus
Start Prometheus
▶︎
Secure Prometheus for SMM
▶︎
Nginx proxy configuration over Prometheus
Nginx installtion
Nginx configuration for Prometheus
▶︎
Setting up TLS for Prometheus
Configuring SMM to recognize Prometheus's TLS certificate
▶︎
Setting up basic authentication with TLS for Prometheus
Configuring Nginx for basic authentication
Configuring SMM for basic authentication
Setting up mTLS for Prometheus
Prometheus for SMM limitations
Troubleshooting Prometheus for SMM
Performance comparison between Cloudera Manager and Prometheus
▶︎
Monitoring Kafka Clusters using Streams Messaging Manager
Monitoring Kafka clusters
Monitoring Kafka producers
Monitoring Kafka topics
Monitoring Kafka brokers
Monitoring Kafka consumers
▶︎
Managing Alert Policies using Streams Messaging Manager
Introduction to alert policies in Streams Messaging Manager
Component types and metrics for alert policies
Notifiers
▶︎
Managing alert policies and notifiers in SMM
Creating a notifier
Updating a notifier
Deleting a notifier
Creating an alert policy
Updating an alert policy
Enabling an alert policy
Disabling an alert policy
Deleting an alert policy
▶︎
Managing Kafka Topics using Streams Messaging Manager
Creating a Kafka topic
Modifying a Kafka topic
Deleting a Kafka topic
▶︎
Monitoring End-to-End Latency using Streams Messaging Manager
End to end latency overview
Granularity of metrics for end-to-end latency
Enabling interceptors
Monitoring end to end latency for Kafka topic
End to end latency use case
▶︎
Monitoring Kafka Cluster Replications using Streams Messaging Manager
Introduction to monitoring Kafka cluster replications in SMM
Configuring SMM for monitoring Kafka cluster replications
▶︎
Viewing Kafka cluster replication details
Searching Kafka cluster replications by source
Monitoring Kafka cluster replications by quick ranges
Monitoring status of the clusters to be replicated
▶︎
Monitoring topics to be replicated
Searching by topic name
Monitoring throughput for cluster replication
Monitoring replication latency for cluster replication
Monitoring checkpoint latency for cluster replication
Monitoring replication throughput and latency by values
▶︎
Monitoring Kafka Connect using Streams Messaging Manager
Introduction to Kafka Connect
Default view of Kafka Connect in the SMM UI
Creating a connector using Kafka Connect in SMM
Modifying a connector using Kafka Connect in SMM
Deleting a connector using Kafka Connect in SMM
▶︎
Monitoring connectors using Kafka Connect in SMM
Monitoring connector profile using Kafka Connect in SMM
Monitoring connector settings using Kafka Connect in SMM
Monitoring cluster profile using Kafka Connect in SMM
▶︎
Configuring Streams Replication Manager
Add Streams Replication Manager to an existing cluster
Enable high availability
▶︎
Defining and adding clusters for replication
Defining external Kafka clusters
Defining co-located Kafka clusters using a service dependency
Defining co-located Kafka clusters using Kafka credentials
Adding clusters to SRM's configuration
Configuring replications
Configuring the driver role target clusters
Configuring the service role target cluster
Configuring properties not exposed in Cloudera Manager
Configuring replication specific REST servers
Configuring automatic group offset synchronization
Configuring SRM Driver for performance tuning
New topic and consumer group discovery
▶︎
Configuration examples
Bidirectional replication example of two active clusters
Cross data center replication example of multiple clusters
▶︎
Using Streams Replication Manager
▶︎
SRM Command Line Tools
▶︎
srm-control
▶︎
Configuring srm-control
Configuring the SRM client's secure storage
Configuring TLS/SSL properties
Configuring Kerberos properties
Configuring properties for non-Kerberos authentication mechanisms
Setting the secure storage password as an environment variable
Topics and Groups Subcommand
Offsets Subcommand
Monitoring Replication with Streams Messaging Manager
Replicating Data
▶︎
How to Set up Failover and Failback
Configure SRM for Failover and Failback
Migrating Consumer Groups Between Clusters
▶︎
Securing Streams Replication Manager
Security overview
Enabling TLS/SSL for the SRM service
Enabling Kerberos for the SRM service
Configuring custom Kerberos principal for Streams Replication Manager
SRM security example
▶︎
Integrating with Schema Registry
▶︎
Integrating with NiFi
Understand the NiFi Record Based Processors and Controller Services
Configuring Schema Registry instance in NiFi
Adding and Configuring Record Reader and Writer Controller Services
Using Record-Enabled Processors
Integrating Kafka and Schema Registry
Integrating with Flink and SSB
Improve Performance in Schema Registry
▶︎
Using Schema Registry
Adding a new schema
Querying a schema
Evolving a schema
Deleting a schema
Importing Confluent Schema Registry schemas into Cloudera Schema Registry
▶︎
Securing Schema Registry
▶︎
TLS Encryption
TLS Certificate Requirements and Recommendations
Configure TLS Encryption Manually for Schema Registry
Schema Registry TLS Properties
▶︎
Schema Registry Authorization through Ranger Access Policies
Pre-defined Access Policies for Schema Registry
Add the user or group to a pre-defined access policy
Create a Custom Access Policy
Configuring custom Kerberos principal for Schema Registry
▶︎
Troubleshooting
▶︎
Troubleshooting Security Issues
Troubleshooting Security Issues
Error Messages and Various Failures
Authentication and Kerberos Issues
HDFS Encryption Issues
Key Trustee KMS Encryption Issues
TLS/SSL Issues
▶︎
YARN, MRv1, and Linux OS Security
TaskController Error Codes (MRv1)
ContainerExecutor Error Codes (YARN)
▶︎
Troubleshooting Apache Hive
HeapDumpPath (/tmp) in Hive data nodes gets full due to .hprof files
Query fails with "Counters limit exceeded" error message
HiveServer is unresponsive due to large queries running in parallel
▶︎
Troubleshooting Apache Impala
Troubleshooting Impala
Using Breakpad Minidumps for Crash Reporting
▶︎
Troubleshooting Apache Hadoop YARN
Troubleshooting Docker on YARN
Troubleshooting on YARN
Troubleshooting Linux Container Executor
▶︎
Troubleshooting Apache HBase
Troubleshooting HBase
▶︎
Using the HBCK2 tool to remediate HBase clusters
Running the HBCK2 tool
Finding issues
Fixing issues
HBCK2 tool command reference
Thrift Server crashes after receiving invalid data
HBase is using more disk space than expected
Troubleshoot RegionServer grouping
▶︎
Troubleshooting Apache Kudu
▶︎
Issues starting or restarting the master or the tablet server
Errors during hole punching test
Already present: FS layout already exists
Troubleshooting NTP stability problems
Disk space usage issue
▶︎
Performance issues
▶︎
Kudu tracing
Accessing the tracing web interface
RPC timeout traces
Kernel stack watchdog traces
Memory limits
Block cache size
Heap sampling
Slow name resolution and nscd
▶︎
Usability issues
ClassNotFoundException: com.cloudera.kudu.hive.KuduStorageHandler
Runtime error: Could not create thread: Resource temporarily unavailable (error 11)
Tombstoned or STOPPED tablet replicas
Corruption: checksum error on CFile block
Symbolizing stack traces
▶︎
Recover from a dead Kudu master
Prepare for the recovery
Perform the recovery
▶︎
Troubleshooting Operational Database powered by Apache Accumulo
Under‐replicated block exceptions or cluster failure occurs on small clusters
▶︎
HDFS storage demands due to retained HDFS trash
Change the HDFS trash settings in Cloudera Manager
Disable OpDB's use of HDFS trash
▶︎
Troubleshooting Cloudera Search
▶︎
Troubleshooting
Identifying problems
▶︎
Cloudera Search configuration and log files
Cloudera Search configuration files
View and modify Search configuration
Cloudera Search log files
View and modify log levels for Search and related services
▶︎
Troubleshooting Data Analytics Studio
▶︎
Problem area: Queries page
Queries are not appearing on the Queries page
Query column is empty but you can see the DAG ID and Application ID
Cannot see the DAG ID and the Application ID
Cannot view queries of other users
▶︎
Problem area: Compose page
Cannot see databases, or the query editor is missing
Unable to view new databases and tables, or unable to see changes to the existing databases or tables
Troubleshooting replication failure in the DAS Event Processor
Problem area: Reports page
Unable to start DAS
How DAS helps to debug Hive on Tez queries
▶︎
Troubleshooting Hue
The Hue load balancer not distributing users evenly across various Hue servers
Unable to authenticate users in Hue using SAML
Cleaning up old data to improve performance
Unable to connect to database with provided credential
Activating Hive query editor on Hue UI
Completed Hue query shows executing on CM
Finding the list of Hue superusers
Knox Gateway UI: incorrect username or password
HTTP 403 error while accessing Hue
'Type' error while accessing Hue from Knox Gateway
Unable to access Hue from Knox Gateway UI
Referer checking failed
Unable to view Snappy-compressed files
"Unknown Attribute Name" exception
Invalid query handle
Services backed by PostgreSQL fail or stop responding
Error validating LDAP user in Hue
502 Proxy Error while accessing Hue from the Load Balancer
Invalid method name: 'GetLog' error
Authorization Exception error
Cannot alter compressed tables in Hue
Connection failed error when accessing the Search app (Solr) from Hue
Downloading query results from Hue takes time
Hue Load Balancer does not start
Unable to terminate Hive queries from Job Browser
Unable to view or create Oozie workflows
MySQL: 1040, 'Too many connections' exception
Unable to connect Oracle database to Hue using SCAN
Increasing the maximum number of processes for Oracle database
UTF-8 codec error
ASCII codec error
Fixing authentication issues between HBase and Hue
Lengthy BalancerMember Route length
Enabling access to HBase browser from Hue
Fixing a warning related to accessing non-optimized Hue
Unable to use pip command in CDP
Hue load balancer does not start after enabling TLS
Unable to log into Hue with Knox
LDAP search fails with invalid credentials error
Disabling the web metric collection for Hue
Resolving "The user authorized on the connection does not match the session username" error
Requirements for compressing and extracting files using Hue File Browser
Resolving "You are accessing a non-optimized Hue" error
Fixing incorrect start time and duration on Hue Job Browser
▶︎
Troubleshooting Apache Sqoop
Unable to read Sqoop metastore created by an older HSQLDB version
Merge process stops during Sqoop incremental imports
Sqoop Hive import stops when HS2 does not use Kerberos authentication
▶︎
Reference
▶︎
Apache Hadoop YARN Reference
▶︎
Tuning Apache Hadoop YARN
YARN tuning overview
Step 1: Worker host configuration
Step 2: Worker host planning
Step 3: Cluster size
Steps 4 and 5: Verify settings
Step 6: Verify container settings on cluster
Step 6A: Cluster container capacity
Step 6B: Container parameters checking
Step 7: MapReduce configuration
Step 7A: MapReduce settings checking
Set properties in Cloudera Manager
Configure memory settings
YARN Configuration Properties
Use the YARN REST APIs to manage applications
▶︎
Comparison of Fair Scheduler with Capacity Scheduler
Why one scheduler?
Scheduler performance improvements
Feature comparison
Migration from Fair Scheduler to Capacity Scheduler
▶︎
Configuring and using Queue Manager REST API
Limitations
Using the REST API
Prerequisites
Start Queue
Stop Queue
Add Queue
Change Queue Capacities
Change Queue Properties
Delete Queue
▶︎
Data Access
▶︎
Apache Hive Materialized View Commands
ALTER MATERIALIZED VIEW REBUILD
ALTER MATERIALIZED VIEW REWRITE
CREATE MATERIALIZED VIEW
DESCRIBE EXTENDED and DESCRIBE FORMATTED
DROP MATERIALIZED VIEW
SHOW MATERIALIZED VIEWS
▶︎
Apache Hive Reference
▶︎
Apache Impala Reference
▶︎
Performance Considerations
Performance Best Practices
Query Join Performance
▶︎
Table and Column Statistics
Generating Table and Column Statistics
Runtime Filtering
▶︎
Partitioning
Partition Pruning for Queries
HDFS Caching
HDFS Block Skew
Understanding Performance using EXPLAIN Plan
Understanding Performance using SUMMARY Report
Understanding Performance using Query Profile
▶︎
Scalability Considerations
Scaling Limits and Guidelines
Dedicated Coordinator
▶︎
Hadoop File Formats Support
Using Text Data Files
Using Parquet Data Files
Using ORC Data Files
Using Avro Data Files
Using RCFile Data Files
Using SequenceFile Data Files
▶︎
Storage Systems Supports
Impala with HDFS
▶︎
Impala with Kudu
Configuring for Kudu Tables
▶︎
Impala DDL for Kudu
Partitioning for Kudu Tables
Impala DML for Kudu Tables
Impala with HBase
Impala with Azure Data Lake Store (ADLS)
▶︎
Impala with Amazon S3
Specifying Impala Credentials to Access S3
Ports Used by Impala
Migration Guide
Setting up Data Cache for Remote Reads
▶︎
Managing Metadata in Impala
On-demand Metadata
Automatic Invalidation of Metadata Cache
▶︎
Automatic Invalidation/Refresh of Metadata
Configuring Event Based Automatic Metadata Sync
Transactions
▶︎
Apache Impala SQL Reference
Apache Impala SQL Overview
▶︎
Schema objects
Impala aliases
Databases
Functions
Identifiers
Tables
Views
▶︎
Data types
ARRAY complex type
BIGINT data type
BOOLEAN data type
CHAR data type
DATE data type
DECIMAL data type
DOUBLE data type
FLOAT data type
INT data type
MAP complex type
REAL data type
SMALLINT data type
STRING data type
STRUCT complex type
▶︎
TIMESTAMP data type
Customizing time zones
TINYINT data type
VARCHAR data type
Complex types
Literals
Operators
Comments
▶︎
SQL statements
ROLE statements
DDL statements
DML statements
ALTER DATABASE statement
ALTER TABLE statement
ALTER VIEW statement
COMMENT statement
COMPUTE STATS statement
CREATE DATABASE statement
CREATE FUNCTION statement
CREATE ROLE statement
CREATE TABLE statement
CREATE VIEW statement
DELETE statement
DESCRIBE statement
DROP DATABASE statement
DROP FUNCTION statement
DROP ROLE statement
DROP STATS statement
DROP TABLE statement
DROP VIEW statement
EXPLAIN statement
GRANT statement
GRANT ROLE statement
INSERT statement
INVALIDATE METADATA statement
LOAD DATA statement
REFRESH statement
REFRESH AUTHORIZATION statement
REFRESH FUNCTIONS statement
REVOKE statement
REVOKE ROLE statement
▶︎
SELECT statement
Joins in Impala SELECT statements
ORDER BY clause
GROUP BY clause
HAVING clause
LIMIT clause
OFFSET clause
UNION clause
Subqueries in Impala SELECT statements
TABLESAMPLE clause
WITH clause
DISTINCT operator
SET statement
SHOW statement
SHOW ROLES statement
SHOW CURRENT ROLES statement
SHOW ROLE GRANT GROUP statement
SHUTDOWN statement
TRUNCATE TABLE statement
UPDATE statement
UPSERT statement
USE statement
VALUES statement
Optimizer hints
Query options
▶︎
Built-in functions
Mathematical functions
Bit functions
Conversion functions
Date and time functions
Conditional functions
String functions
Miscellaneous functions
▶︎
Aggregate functions
APPX_MEDIAN function
AVG function
COUNT function
GROUP_CONCAT function
MAX function
MIN function
NDV function
STDDEV, STDDEV_SAMP, STDDEV_POP functions
SUM function
VARIANCE, VARIANCE_SAMP, VARIANCE_POP, VAR_SAMP, VAR_POP functions
▶︎
Analytic functions
OVER
WINDOW
AVG
COUNT
CUME_DIST
DENSE_RANK
FIRST_VALUE
LAG
LAST_VALUE
LEAD
MAX
MIN
NTILE
PERCENT_RANK
RANK
ROW_NUMBER
SUM
▶︎
User-defined functions (UDFs)
UDF concepts
Runtime environment for UDFs
Installing the UDF development package
Writing UDFs
Writing user-defined aggregate functions (UDAFs)
Building and deploying UDFs
Performance considerations for UDFs
Examples of creating and using UDFs
Security considerations for UDFs
Limitations and restrictions for Impala UDFs
Transactions
Reserved words
Impala SQL and Hive SQL
SQL migration to Impala
▶︎
Cloudera Search solrctl Reference
solrctl Reference
Using solrctl with an HTTP proxy
▶︎
Cloudera Search Morphlines Reference
Implementing your own Custom Command
Morphline commands overview
kite-morphlines-core-stdio
kite-morphlines-core-stdlib
kite-morphlines-avro
kite-morphlines-json
kite-morphlines-hadoop-core
kite-morphlines-hadoop-parquet-avro
kite-morphlines-hadoop-rcfile
kite-morphlines-hadoop-sequencefile
kite-morphlines-maxmind
kite-morphlines-metrics-servlets
kite-morphlines-protobuf
kite-morphlines-tika-core
kite-morphlines-tika-decompress
kite-morphlines-saxon
kite-morphlines-solr-core
kite-morphlines-solr-cell
kite-morphlines-useragent
▶︎
Operational Database
▶︎
Apache Phoenix Frequently Asked Questions
Frequently asked questions
▶︎
Apache Phoenix Performance Tuning
Performance tuning
▶︎
Apache Phoenix Command Reference
Apache Phoenix SQL command reference
▶︎
Operational Database powered by Apache Accumulo Reference
Default ports of OpDB
▶︎
Apache Atlas Reference
Apache Atlas Advanced Search language reference
Apache Atlas Statistics reference
Apache Atlas metadata attributes
Defining Apache Atlas enumerations
▶︎
Purging deleted entities
Auditing purged entities
PUT /admin/purge/ API
POST /admin/audits/ API
▶︎
Apache Atlas technical metadata migration reference
System metadata migration
HDFS entity metadata migration
Hive entity metadata migration
Impala entity metadata migration
Spark entity metadata migration
AWS S3 entity metadata migration
▶︎
NiFi metadata collection
How Lineage strategy works
Understanding the data that flow into Atlas
NiFi lineage
Atlas NiFi relationships
Atlas NiFi audit entries
How the reporting task runs in a NiFi cluster
Analysing event analysis
Limitations of Atlas-NiFi integration
▶︎
HiveServer metadata collection
HiveServer actions that produce Atlas entities
HiveServer entities created in Atlas
HiveServer relationships
HiveServer lineage
HiveServer audit entries
▶︎
HBase metadata collection
HBase actions that produce Atlas entities
HBase entities created in Atlas
Hbase lineage
HBase audit entries
▶︎
Impala metadata collection
Impala actions that produce Atlas entities
Impala entities created in Atlas
Impala lineage
Impala audit entries
▶︎
Kafka metadata collection
Kafka actions that produce Atlas entities
Kafka relationships
Kafka lineage
Kafka audit entries
▶︎
Spark metadata collection
Spark actions that produce Atlas entities
Spark entities created in Apache Atlas
Spark lineage
Spark relationships
Spark audit entries
Spark troubleshooting
▶︎
Streams Messaging
▶︎
Kafka Connect Connector Reference
HDFS Sink Connector Properties Reference
Amazon S3 Sink Connector Properties Reference
Schema Registry REST API Reference
▶︎
Streams Replication Manager Reference
srm-control Options Reference
Configuration Properties Reference for Properties not Available in Cloudera Manager
Kafka credentials property reference
Streams Messaging Manager REST API Reference
Streams Replication Manager REST API Reference
Cruise Control REST API Reference
▶︎
Cloudera Manager Reference
▶︎
Cloudera Manager Configuration Properties Reference
▶︎
Cloudera Manager Configuration Properties Reference for Cloudera Runtime 7.1.7
ADLS Connector Properties in Cloudera Runtime 7.1.7
Atlas Properties in Cloudera Runtime 7.1.7
Core Configuration Properties in Cloudera Runtime 7.1.7
Data Analytics Studio Properties in Cloudera Runtime 7.1.7
Data Context Connector Properties in Cloudera Runtime 7.1.7
HBase Properties in Cloudera Runtime 7.1.7
HDFS Properties in Cloudera Runtime 7.1
Hive Properties in Cloudera Runtime 7.1.7
Hive LLAP Properties in Cloudera Runtime 7.1.7
Hive on Tez Properties in Cloudera Runtime 7.1.7
Hue Properties in Cloudera Runtime 7.1.7
Impala Properties in Cloudera Runtime 7.1.7
Java KeyStore KMS Properties in Cloudera Runtime 7.1.7
Kafka Properties in Cloudera Runtime 7.1.7
Key Trustee KMS Properties in Cloudera Runtime 7.1.7
Key Trustee Server Properties in Cloudera Runtime 7.1.7
Key-Value Store Indexer Properties in Cloudera Runtime 7.1.7
Knox Properties in Cloudera Runtime 7.1.7
Kudu Properties in Cloudera Runtime 7.1.7
Livy Properties in Cloudera Runtime 7.1.7
Livy for Spark 3 Properties in Cloudera Runtime 7.1.7
Oozie Properties in Cloudera Runtime 7.1.7
Ozone Properties in Cloudera Runtime 7.1.7
Phoenix Properties in Cloudera Runtime 7.1.7
Ranger Properties in Cloudera Runtime 7.1.7
S3 Connector Properties in Cloudera Runtime 7.1.7
Schema Registry Properties in Cloudera Runtime 7.1.7
Solr Properties in Cloudera Runtime 7.1.7
Spark Properties in Cloudera Runtime 7.1.7
Spark 3 Properties in Cloudera Runtime 7.1.7
SQOOP_CLIENT Properties in Cloudera Runtime 7.1.7
Streams Messaging Manager Properties in Cloudera Runtime 7.1.7
Streams Replication Manager Properties in Cloudera Runtime 7.1.7
Stub DFS Properties in Cloudera Runtime 7.1.7
Tez Properties in Cloudera Runtime 7.1.7
YARN Properties in Cloudera Runtime 7.1.7
YARN Queue Manager Properties in Cloudera Runtime 7.1.7
Zeppelin Properties in Cloudera Runtime 7.1.7
ZooKeeper Properties in Cloudera Runtime 7.1.7
Host Configuration Properties
Cloudera Manager Server Properties
Cloudera Management Service
▶︎
Cloudera Manager Metrics Reference
▶︎
Cloudera Manager Metrics
Accumulo Metrics
Active Database Metrics
Active Key Trustee Server Metrics
Activity Metrics
Activity Monitor - Unsupported Since 7.0.0 Metrics
Agent Metrics
Alert Publisher Metrics
Atlas Metrics
Atlas Server Metrics
Attempt Metrics
Authentication Server Metrics
Authentication Server Load Balancer Metrics
Authentication Service Metrics
Cloudera Management Service Metrics
Cloudera Manager Server Metrics
Cluster Metrics
Core Configuration Metrics
Cruise Control Metrics
Cruise Control Server Metrics
Data Analytics Studio Metrics
Data Analytics Studio Eventprocessor Metrics
Data Analytics Studio Webapp Server Metrics
Data Discovery Service Agent Metrics
DataNode Metrics
Directory Metrics
Disk Metrics
Docker Server Metrics
Ecs Agent Metrics
Ecs Server Metrics
Event Server Metrics
Failover Controller Metrics
Filesystem Metrics
Flink Metrics
Flink Dashboard Metrics
Flume Metrics
Flume Channel Metrics
Flume Sink Metrics
Flume Source Metrics
Garbage Collector Metrics
HBase Metrics
HBase REST Server Metrics
HBase RegionServer Replication Peer Metrics
HBase Thrift Server Metrics
HDFS Metrics
HDFS Cache Directive Metrics
HDFS Cache Pool Metrics
HRegion Metrics
HTable Metrics
History Server Metrics
Hive Metrics
Hive Execution Metrics
Hive LLAP Metrics
Hive Metastore Server Metrics
Hive Table Metrics
Hive on Tez Metrics
HiveServer2 Metrics
Host Metrics
Host Monitor Metrics
HttpFS Metrics
Hue Metrics
Hue Server Metrics
Impala Metrics
Impala Catalog Server Metrics
Impala Daemon Metrics
Impala Daemon Resource Pool Metrics
Impala Llama ApplicationMaster Metrics
Impala Pool Metrics
Impala Pool User Metrics
Impala Query Metrics
Impala StateStore Metrics
Isilon Metrics
Java KeyStore KMS Metrics
JobHistory Server Metrics
JobTracker Metrics
JournalNode Metrics
Kafka Metrics
Kafka Broker Metrics
Kafka Broker Log Directory Metrics
Kafka Broker Topic Metrics
Kafka Broker Topic Partition Metrics
Kafka Connect Metrics
Kafka Connect Connector Sink Task Metrics Metrics
Kafka Connect Connector Source Task Metrics Metrics
Kafka Connect Connector Task Error Metrics Metrics
Kafka Connect Connector Task Metrics Metrics
Kafka Consumer Group Metrics
Kafka MirrorMaker Metrics
Kafka Producer Metrics
Kafka Replica Metrics
Kerberos Ticket Renewer Metrics
Key Management Server Metrics
Key Management Server Proxy Metrics
Key Trustee KMS Metrics
Key Trustee Server Metrics
Key-Value Store Indexer Metrics
Knox Metrics
Knox Gateway Metrics
Knox IDBroker Metrics
Kudu Metrics
Kudu Replica Metrics
LLAP Proxy Metrics
Lily HBase Indexer Metrics
Livy Metrics
Livy Server Metrics
Livy Server for Spark 3 Metrics
Livy for Spark 3 Metrics
Load Balancer Metrics
MapReduce Metrics
Master Metrics
Materialized View Engine Metrics
Monitor Metrics
NFS Gateway Metrics
NameNode Metrics
Navigator Audit Server Metrics
Navigator HSM KMS backed by SafeNet Luna HSM Metrics
Navigator HSM KMS backed by Thales HSM Metrics
Navigator Luna KMS Metastore Metrics
Navigator Luna KMS Proxy Metrics
Navigator Metadata Server Metrics
Navigator Thales KMS Metastore Metrics
Navigator Thales KMS Proxy Metrics
Network Interface Metrics
NodeManager Metrics
Omid Metrics
Omid tso server Metrics
Oozie Metrics
Oozie Server Metrics
Ozone Metrics
Ozone DataNode Metrics
Ozone Manager Metrics
Ozone Prometheus Metrics
Ozone Recon Metrics
Passive Database Metrics
Passive Key Trustee Server Metrics
Phoenix Metrics
Profiler Admin Agent Metrics
Profiler Manager Metrics
Profiler Metrics Agent Metrics
Profiler Scheduler Metrics
Profiler Scheduler Agent Metrics
Query Processor Metrics
Query Server Metrics
Ranger Metrics
Ranger Admin Metrics
Ranger KMS Metrics
Ranger KMS Server Metrics
Ranger KMS Server with KTS Metrics
Ranger KMS with Key Trustee Server Metrics
Ranger RMS Metrics
Ranger RMS Server Metrics
Ranger Raz Metrics
Ranger Raz Server Metrics
Ranger Tagsync Metrics
Ranger Usersync Metrics
RegionServer Metrics
Reports Manager Metrics
ResourceManager Metrics
S3 Gateway Metrics
SQL Stream Builder Metrics
SRM Distributed Herder metrics Metrics
SRM Driver Metrics
SRM Service Metrics
Schema Registry Metrics
Schema Registry Server Metrics
SecondaryNameNode Metrics
Sentry Metrics
Sentry Server Metrics
Server Metrics
Service Monitor Metrics
Solr Metrics
Solr Replica Metrics
Solr Server Metrics
Solr Shard Metrics
Spark Metrics
Spark 3 Metrics
Sqoop 2 Metrics
Sqoop 2 Server Metrics
Storage Container Manager Metrics
Streaming SQL Console Metrics
Streaming SQL Engine Metrics
Streams Messaging Manager Metrics
Streams Messaging Manager Rest Admin Server Metrics
Streams Messaging Manager UI Server Metrics
Streams Replication Manager Metrics
Tablet Server Metrics
TaskTracker Metrics
Telemetry Publisher Metrics
Tez Metrics
Time Series Table Metrics
Tracer Metrics
User Metrics
WebHCat Server Metrics
YARN Metrics
YARN Pool Metrics
YARN Pool User Metrics
YARN Queue Manager Metrics
YARN Queue Manager Store Metrics
YARN Queue Manager Webapp Metrics
Zeppelin Metrics
Zeppelin Server Metrics
ZooKeeper Metrics
common.service.type.docker Metrics
common.service.type.ecs Metrics
▶︎
Cloudera Manager Health Tests Reference
▶︎
Cloudera Manager Health Tests
Active Database Health Tests
Active Key Trustee Server Health Tests
Activity Monitor - Unsupported Since 7.0.0 Health Tests
Alert Publisher Health Tests
Atlas Health Tests
Atlas Server Health Tests
Authentication Server Health Tests
Authentication Server Load Balancer Health Tests
Authentication Service Health Tests
Cloudera Management Service Health Tests
Cruise Control Health Tests
Cruise Control Server Health Tests
DOCKER Health Tests
Data Analytics Studio Eventprocessor Health Tests
Data Analytics Studio Webapp Server Health Tests
Data Discovery Service Agent Health Tests
DataNode Health Tests
Docker Server Health Tests
ECS Health Tests
Ecs Agent Health Tests
Ecs Server Health Tests
Event Server Health Tests
Failover Controller Health Tests
Flink Dashboard Health Tests
Flume Health Tests
Flume Agent Health Tests
Garbage Collector Health Tests
HBase Health Tests
HBase REST Server Health Tests
HBase Thrift Server Health Tests
HDFS Health Tests
History Server Health Tests
Hive Health Tests
Hive Execution Health Tests
Hive LLAP Health Tests
Hive Metastore Server Health Tests
Hive on Tez Health Tests
HiveServer2 Health Tests
Host Health Tests
Host Monitor Health Tests
HttpFS Health Tests
Hue Health Tests
Hue Server Health Tests
Impala Health Tests
Impala Catalog Server Health Tests
Impala Daemon Health Tests
Impala Llama ApplicationMaster Health Tests
Impala StateStore Health Tests
JobHistory Server Health Tests
JobTracker Health Tests
JournalNode Health Tests
Kafka Health Tests
Kafka Broker Health Tests
Kafka Connect Health Tests
Kafka MirrorMaker Health Tests
Kerberos Ticket Renewer Health Tests
Key Management Server Health Tests
Key Management Server Proxy Health Tests
Key-Value Store Indexer Health Tests
Knox Health Tests
Knox Gateway Health Tests
Knox IDBroker Health Tests
Kudu Health Tests
LLAP Proxy Health Tests
Lily HBase Indexer Health Tests
Livy Health Tests
Livy Server Health Tests
Livy Server for Spark 3 Health Tests
Livy for Spark 3 Health Tests
Load Balancer Health Tests
MapReduce Health Tests
Master Health Tests
Materialized View Engine Health Tests
Monitor Health Tests
NFS Gateway Health Tests
NameNode Health Tests
Navigator Audit Server Health Tests
Navigator Luna KMS Metastore Health Tests
Navigator Luna KMS Proxy Health Tests
Navigator Metadata Server Health Tests
Navigator Thales KMS Metastore Health Tests
Navigator Thales KMS Proxy Health Tests
NodeManager Health Tests
Omid Health Tests
Omid tso server Health Tests
Oozie Health Tests
Oozie Server Health Tests
Ozone Health Tests
Ozone DataNode Health Tests
Ozone Manager Health Tests
Ozone Prometheus Health Tests
Ozone Recon Health Tests
Passive Database Health Tests
Passive Key Trustee Server Health Tests
Phoenix Health Tests
Profiler Admin Agent Health Tests
Profiler Metrics Agent Health Tests
Profiler Scheduler Agent Health Tests
Query Processor Health Tests
Query Server Health Tests
Ranger Health Tests
Ranger Admin Health Tests
Ranger KMS Health Tests
Ranger KMS Server Health Tests
Ranger KMS Server with KTS Health Tests
Ranger KMS with Key Trustee Server Health Tests
Ranger RMS Health Tests
Ranger RMS Server Health Tests
Ranger Raz Health Tests
Ranger Raz Server Health Tests
Ranger Tagsync Health Tests
Ranger Usersync Health Tests
RegionServer Health Tests
Reports Manager Health Tests
ResourceManager Health Tests
S3 Gateway Health Tests
SRM Driver Health Tests
SRM Service Health Tests
Schema Registry Health Tests
Schema Registry Server Health Tests
SecondaryNameNode Health Tests
Sentry Health Tests
Sentry Server Health Tests
Service Monitor Health Tests
Solr Health Tests
Solr Server Health Tests
Spark Health Tests
Spark 3 Health Tests
Sqoop 2 Health Tests
Sqoop 2 Server Health Tests
Storage Container Manager Health Tests
Streaming SQL Console Health Tests
Streaming SQL Engine Health Tests
Streams Messaging Manager Health Tests
Streams Messaging Manager Rest Admin Server Health Tests
Streams Messaging Manager UI Server Health Tests
Streams Replication Manager Health Tests
Tablet Server Health Tests
TaskTracker Health Tests
Telemetry Publisher Health Tests
Tracer Health Tests
WebHCat Server Health Tests
YARN Health Tests
YARN Queue Manager Store Health Tests
YARN Queue Manager Webapp Health Tests
Zeppelin Health Tests
Zeppelin Server Health Tests
ZooKeeper Health Tests
ZooKeeper Server Health Tests
▶︎
Cloudera Manager Event Schema Reference
LOG_MESSAGE Category
ACTIVITY_EVENT Category
AUDIT_EVENT Category
HEALTH_CHECK Category
SYSTEM Category
HBASE Category
▶︎
Cloudera Manager Entities Reference
▶︎
Cloudera Manager Entity Types and Attributes
Cloudera Manager Entity Types
Cloudera Manager Entity Type Attributes
▶︎
Security
▶︎
Authorization
Migrating from Sentry to Ranger
Check MySQL isolation configuration
Ranger audit schema reference
Ranger database schema reference
Ranger policies allowing create privilege for Hadoop_SQL databases
Ranger policies allowing create privilege for Hadoop_SQL tables
Access required to Read/Write on Hadoop_SQL tables using SQL
Mapping Sentry permissions for Solr to Ranger policies
▶︎
Encryption
Auto-TLS Requirements and Limitations
Rotate Auto-TLS Certificate Authority and Host Certificates
Auto-TLS Agent File Locations
"Unknown Attribute Name" exception
'Type' error while accessing Hue from Knox Gateway
(Optional) Configuring the character set
(Optional) Upgrading cx_Oracle to 6.4.1
(Recommended) Enable Auto-TLS
.NET client
502 Proxy Error while accessing Hue from the Load Balancer
7.1.7
7.1.7 SP1
7.1.7 SP2
7.1.7 SP3
A List of S3A Configuration Properties
Aborting a Pending Command
About Apache Ozone integration with Apache Atlas
About Atlas High Availability
About HBase snapshots
About the Off-heap BucketCache
Access HDFS from the NFS Gateway
Access Ozone S3 Gateway using the S3A filesystem
Access required to Read/Write on Hadoop_SQL tables using SQL
Access the Recon web user interface
Access the YARN Web User Interface
Accessing Aggregate Statistics Through tsquery
Accessing Apache HBase
Accessing Atlas logs
Accessing Avro data files from Spark SQL applications
Accessing Azure Storage account container from spark-shell
Accessing Cloud Data
Accessing compressed files in Spark
Accessing data stored in Amazon S3 through Spark
Accessing external storage from Spark
Accessing Files Within an Encryption Zone
Accessing HDFS Files from Spark
Accessing Hive files in Ozone
Accessing Hive files in Ozone
Accessing Hive from Spark
Accessing ORC Data in Hive Tables
Accessing ORC files from Spark
Accessing Ozone object store with Amazon Boto3 client
Accessing Parquet files from Spark SQL applications
Accessing Spark SQL through the Spark shell
Accessing Storage Using Amazon S3
Accessing Storage Using Microsoft ADLS
Accessing the Cloudera Manager Admin Console
Accessing the Cloudera Manager Admin Console
Accessing the Cloudera Manager Admin Console
Accessing the Directory Usage Report
Accessing the License Page
Accessing the Oozie server with a browser
Accessing the Oozie server with the Oozie Client
Accessing the Ranger console
Accessing the Ranger KMS Web UI
Accessing the Spark History Server
Accessing the tracing web interface
Accessing the Web UI of a Completed Spark Application
Accessing the Web UI of a Running Spark Application
Accommodate HMS changes for Hive replication policies
Accumulo Metrics
Achieving cross-cluster availability through Hive Load Balancer failover
ACID operations
ACL examples
ACLS on HDFS features
ACLs supported by Ranger KMS and Ranger KMS Mapping
Activate read replicas on a table
Activating Hive query editor on Hue UI
Activating the Hive web UI
Active / Active Architecture
Active / Stand-by Architecture
Active Database Health Tests
Active Database Metrics
Active Directory Settings
Active Key Trustee Server Health Tests
Active Key Trustee Server Metrics
Activity Charts
Activity Metrics
Activity Monitor - Unsupported Since 7.0.0 Health Tests
Activity Monitor - Unsupported Since 7.0.0 Metrics
Activity, Application, and Query Reports
ACTIVITY_EVENT Category
Add a custom coprocessor
Add a custom descriptor to Apache Knox
Add a group
Add a new provider in an existing provider configuration
Add a new shared provider configuration
Add a role through Hive
Add a role through Ranger
Add a user
Add a ZooKeeper service
Add Accumulo on CDP service
Add Accumulo on CDP service
Add Accumulo on CDP service
Add custom service parameter to descriptor
Add custom service to existing descriptor in Apache Knox Proxy
Add HDFS system mount
Add or edit permissions
Add Queue
Add queues using YARN Queue Manager UI
Add secure Accumulo on CDP service to your cluster
Add secure Accumulo on CDP service to your cluster
Add secure Accumulo on CDP service to your cluster
Add source cluster as peer to use in replication policies
Add storage directories using Cloudera Manager
Add Streams Replication Manager to an existing cluster
Add the HttpFS role
Add the user or group to a pre-defined access policy
Add unsecure Accumulo on CDP service to your cluster
Add unsecure Accumulo on CDP service to your cluster
Add unsecure Accumulo on CDP service to your cluster
Add-on Services
Adding a Cluster Using Currently Managed Hosts
Adding a Cluster Using New Hosts
Adding a Compute Cluster and Data Context
Adding a custom banner in Hue
Adding a Filter
Adding a HiveServer role
Adding a HiveServer role
Adding a Host to a Cluster
Adding a Hue role instance with Cloudera Manager
Adding a Hue service with Cloudera Manager
Adding a load balancer
Adding a New Chart to the Custom Dashboard
Adding a new schema
Adding a policy label to a resource-based policy
Adding a Ranger security zone
Adding a Role Instance
Adding a Service
Adding a splash screen in Hue
Adding a tag-based PII policy
Adding a tag-based service
Adding an Event Filter
Adding and Configuring Record Reader and Writer Controller Services
Adding and Deleting Clusters
Adding and Removing Charts from a Dashboard
Adding and Removing Range Partitions
Adding attributes to Business Metadata
Adding attributes to classifications
Adding clusters to SRM's configuration
Adding Cruise Control as a service
Adding default service users and roles for Ranger
Adding Files to an Encryption Zone
Adding schema to Oozie using Cloudera Manager
Adding self-healing goals to Cruise Control in Cloudera Manager
Adding tag-based policies
Adding the Lily HBase Indexer Service
Adding the Oozie service using Cloudera Manager
Adding trusted realms to the cluster
Additional Configuration Options for GCS
Additional considerations when configuring TLS/SSL for Oozie HA
Additional HDFS haadmin commands to administer the cluster
Additional Security Topics
Additional Steps for Apache Ranger
Adjust the Solr replication factor for index files stored in HDFS
ADLS Connector Properties in Cloudera Runtime 7.1.7
ADLS Proxy Setup
ADLS Trash Folder Behavior
Admin ACLs
Administering Hue
Administering Ranger Reports
Administering Ranger Users, Groups, Roles, and Permissions
Administrative commands
Administrative tools for Hive Metastore integration
Admission Control and Query Queuing
Admission Control Sample Scenario
Advanced Committer Configuration
Advanced configuration for write-heavy workloads
Advanced erasure coding configuration
Advanced ORC properties
Advanced partitioning
Advantages of defining a schema for production use
Advantages of Parcels
Advantages of Separating Compute and Data Resources
After Evaluating Trial Software
After You Install
Agent Hosts
Agent Metrics
Aggregate functions
Aggregating and grouping data
Aggregation for Analytics
Alert Publisher
Alert Publisher Health Tests
Alert Publisher Metrics
Alerts
Allocating DataNode memory as storage
Allocating Hosts for Key Trustee Server and Key Trustee KMS
Already present: FS layout already exists
Alter a table
ALTER DATABASE statement
ALTER MATERIALIZED VIEW REBUILD
ALTER MATERIALIZED VIEW REWRITE
ALTER TABLE statement
ALTER VIEW statement
Amazon S3 Security
Amazon S3 Sink Connector
Amazon S3 Sink Connector Properties Reference
Analysing event analysis
Analytic functions
Analyzing Ranger RMS resources
Apache Atlas Advanced Search language reference
Apache Atlas dashboard tour
Apache Atlas metadata attributes
Apache Atlas metadata collection overview
Apache Atlas Reference
Apache Atlas Statistics reference
Apache Atlas technical metadata migration reference
Apache Hadoop HDFS Overview
Apache Hadoop YARN Overview
Apache Hadoop YARN Reference
Apache HBase Overview
Apache Hive 3 ACID transactions
Apache Hive 3 architectural overview
Apache Hive 3 tables
Apache Hive content roadmap
Apache Hive features
Apache Hive Materialized View Commands
Apache Hive Metastore Overview
Apache Hive Overview
Apache Hive Performance Tuning
Apache Hive query basics
Apache Hive Reference
Apache Hive-Kafka integration
Apache Impala Overview
Apache Impala Reference
Apache Impala SQL Overview
Apache Impala SQL Reference
Apache Kafka Overview
Apache Knox Authentication
Apache Knox Gateway Overview
Apache Knox Install Role Parameters
Apache Knox Install Role Parameters
Apache Knox Overview
Apache Kudu Background Operations
Apache Kudu Overview
Apache Kudu usage limitations
Apache Ozone Overview
Apache Phoenix and SQL
Apache Phoenix Command Reference
Apache Phoenix Frequently Asked Questions
Apache Phoenix Performance Tuning
Apache Phoenix SQL command reference
Apache Phoenix-Hive usage examples
Apache Ranger Access Control and Auditing
Apache Ranger Auditing
Apache Ranger Authorization
Apache Ranger User Management
Apache Spark executor task statistics
Apache Spark Overview
Apache Spark Overview
Apache Zeppelin Overview
API Compatibility changes in 7.1.7 SP3 for Spark
API Compatibility changes in 7.1.7 SP3 for Zookeeper
APIs for accessing HDFS
Application ACL evaluation
Application ACLs
Application logs' ACLs
Application not running message
Application reservations
Applications and permissions reference
Applying a Host Template to a Host
APPX_MEDIAN function
Architecture
Architecture
ARRAY complex type
ASCII codec error
Assign or unassign a node to a partition
Assign Roles
Assigning administrator privileges to users
Assigning superuser status to an LDAP user
Assigning terms to categories
Associate a table in a non-customized environment without Kerberos
Associate partitions with queues
Associate table in a customized Kerberos environment
Associating Business Metadata attributes with entities
Associating classifications with entities
Associating tables of a schema to a namespace
Associating terms with entities
Atlas
Atlas
Atlas
Atlas
Atlas classifications drive Ranger policies
Atlas Export and Import Operations
Atlas Health Tests
Atlas Hook for Sqoop
Atlas index repair configuration
Atlas metadata model overview
Atlas Metrics
Atlas NiFi audit entries
Atlas NiFi relationships
Atlas Properties in Cloudera Runtime 7.1.7
Atlas Server Health Tests
Atlas Server Metrics
Atlas Server Operations
Atlas Type Definitions
Attempt Metrics
Audit enhancements
Audit Operations
Audit Overview
Auditing Atlas Entities
Auditing purged entities
Audits
AUDIT_EVENT Category
Authenticating with ADLS Gen2
Authentication
Authentication
Authentication
Authentication
Authentication
Authentication and Kerberos Issues
Authentication Server Health Tests
Authentication Server Load Balancer Health Tests
Authentication Server Load Balancer Metrics
Authentication Server Metrics
Authentication Service Health Tests
Authentication Service Metrics
Authentication using Kerberos
Authentication using Knox SSO
Authentication using LDAP
Authentication using SAML
Authorization
Authorization
Authorization
Authorization Exception error
Authorizing external tables
Auto-TLS Agent File Locations
Auto-TLS Requirements and Limitations
Autoconfiguration
Automatic Invalidation of Metadata Cache
Automatic Invalidation of Metadata Cache
Automatic Invalidation/Refresh of Metadata
Automatic Invalidation/Refresh of Metadata
Automatic Logout
Automatic Logout
Automating partition discovery and repair
Automating Spark Jobs with Oozie Spark Action
AVG
AVG function
Avro
Avro
AWS S3 entity metadata migration
Back up HDFS metadata
Back up HDFS metadata using Cloudera Manager
Back up Key Trustee Server clients
Back up Key Trustee Server manually
Back up Key Trustee Server using Cloudera Manager
Back up Key Trustee Server using the ktbackup.sh script
Back up tables
Backing up a collection from HDFS
Backing up a collection from local file system
Backing up and Recovering Apache Kudu
Backing up and restoring data
Backing up Cloudera Manager databases
Backing Up Encryption Keys
Backing up HDFS metadata
Backing up Key Trustee Server and clients
Backing up NameNode metadata
Backing up the Cloudera Manager configuration
Backing up the Hue database
Backup directory structure
Backup tools
Balancer commands
Balancing data across an HDFS cluster
Balancing data across disks of a DataNode
Basic partitioning
Basics
Batch Indexing
Batch indexing into offline Solr shards
Batch indexing into online Solr servers using GoLive
Batch indexing to Solr using SparkApp framework
Before You Begin a Trial Installation
Before You Install
Before You Install
Behavioral changes in Apache HBase
Behavioral changes in Apache Hive
Behavioral changes in Apache Hive
Behavioral changes in Apache Hive
Behavioral changes in Apache Impala
Behavioral changes in Cloudera Runtime 7.1.7
Behavioral changes in Cloudera Runtime 7.1.7 SP1
Behavioral changes in Cloudera Runtime 7.1.7 SP2
Behavioral changes in Cloudera Runtime 7.1.7 SP3
Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF5
Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF6
Behavioral Changes in Cloudera Search
Behavioral Changes in Cloudera Search
Benefits and Capabilities
Benefits of centralized cache management in HDFS
Best practices for building Apache Spark applications
Best practices for performance tuning
Best practices for rack and node setup for EC
Best practices when adding new tablet servers
Best practices when using RegionServer grouping
Bidirectional replication example of two active clusters
Bidirectional Replication Flows
BIGINT data type
Bit functions
Block cache size
Block move execution
Block move scheduling
BOOLEAN data type
Bring a tablet that has lost a majority of replicas back online
Broker garbage log collection and log rotation
Broker log management
Broker migration
Broker Tuning
Brokers
Browse HBase tables
Browse HDFS directories
BucketCache IO engine
Bucketed tables in Hive
Building a Chart with Time-Series Data
Building and deploying UDFs
Building and running a Spark Streaming application
Building Cloudera Manager charts with Kafka metrics
Building reusable modules in Apache Spark applications
Building Spark Applications
Building the project and upload the JAR
Built-in functions
Bulk Write Access
Business Metadata overview
Bypass the BlockCache
Cache eviction priorities
Caching terminology
Calculating Infra Solr resource needs
Calculations for reports
Calling Hive user-defined functions (UDFs)
Calling the UDF in a query
Canary test for pyspark command
Cancelling a Query
Cannot alter compressed tables in Hue
Cannot see databases, or the query editor is missing
Cannot see the DAG ID and the Application ID
Cannot view queries of other users
Catalog operations
CDP 7.1.7 SP2 and 7.1.7 SP3 Components with API differences
CDP Private Cloud Base
CDP Private Cloud Base API Modifications and Removals
CDP Private Cloud Base Installation Guide
CDP Private Cloud Base Requirements and Supported Versions
CDP Private Cloud Base service groups and component reference
CDP Private Cloud Base Trial Download Information
CDP PVC Base - Data Engineering
CDP PVC Base - Data Warehouse
CDP PVC Base - Enterprise Essentials
CDP PVC Base - Operational Database
CDP Security Overview
CDS 3 Powered by Apache Spark
CDS 3.2.3 Maven Artifacts
CDS 3.2.3 Overview
CDS 3.2.3 Packaging, and Download
CDS 3.2.3 Requirements
Centralized cache management architecture
Certmanager Options - Using CM's GenerateCMCA API
Change master hostnames
Change Queue Capacities
Change Queue Properties
Change resource allocation mode
Change root user password
Change the HDFS trash settings in Cloudera Manager
Changed Behavior after HDFS Encryption is Enabled
Changes for a cluster
Changes for a service, role, or host
Changing a nameservice name for Highly Available HDFS using Cloudera Manager
Changing directory configuration
Changing Embedded PostgreSQL Database Passwords
Changing Hostnames
Changing Ranger audit storage location and migrating data
Changing the Anomaly Notifier Class value to self-healing
Changing the Chart Type
Changing the Configuration of a Service or Role Instance
Changing the Hive warehouse location
Changing the page logo in Hue
Changing the retention period of DAS event logs
Changing the Upgrade Domain for hosts
Channel encryption
Channel encryption
CHAR data type
CHAR data type support
Chart Properties
Charting Time-Series Data
Charts
Charts Library
Check Cluster Security Settings
Check Job History
Check Job Status
Check MySQL isolation configuration
Check trace table
Check trace table
Checking Host Heartbeats
Checking query execution
Choose the right import method
Choosing and Configuring Data Compression
Choosing and Running a Filter
Choosing and Running a Filter
Choosing Data Formats
Choosing manual TLS or Auto-TLS
Choosing the number of partitions for a topic
Choosing the Sufficient Security Level for Your Environment
Choosing Transformations to Minimize Shuffles
ClassNotFoundException: com.cloudera.kudu.hive.KuduStorageHandler
Cleaning up after failed jobs
Cleaning up old data to improve performance
Cleaning up old queries, DAG information, and reports data
Cleaning up old queries, DAG information, and reports data using Ambari
CLI commands to perform snapshot operations
CLI tool support
Client and broker compatibility across Kafka versions
Client authentication to secure Kudu clusters
Client authentication using delegation tokens
Client Configuration Files
Client connections to HiveServer
Client examples
Client examples
Closing HiveWarehouseSession operations
Cloud storage connectors overview
Cloudera Authorization
Cloudera license requirements for Replication Manager
Cloudera Logging is now available in CDP Private Cloud Base 7.1.7 SP1
Cloudera Management Service
Cloudera Management Service
Cloudera Management Service
Cloudera Management Service Health Tests
Cloudera Management Service Metrics
Cloudera Manager
Cloudera Manager
Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Cloudera Manager 7.4.4 Release Notes
Cloudera Manager 7.6.1 Cumulative hotfix 1
Cloudera Manager 7.6.1 Cumulative hotfix 2
Cloudera Manager 7.6.1 Cumulative hotfix 3
Cloudera Manager 7.6.1 Cumulative hotfix 4
Cloudera Manager 7.6.1 Cumulative hotfix 5
Cloudera Manager 7.6.1 Cumulative hotfix 6
Cloudera Manager 7.6.1 Cumulative hotfix 7
Cloudera Manager 7.6.1 Cumulative hotfix 8
Cloudera Manager 7.6.1 Cumulative hotfix 9
Cloudera Manager 7.6.1 Release Notes (CDP Private Cloud Base 7.1.7 SP1)
Cloudera Manager 7.6.7 Cumulative hotfix 1
Cloudera Manager 7.6.7 Cumulative hotfix 10
Cloudera Manager 7.6.7 Cumulative hotfix 11
Cloudera Manager 7.6.7 Cumulative hotfix 12
Cloudera Manager 7.6.7 Cumulative hotfix 13
Cloudera Manager 7.6.7 Cumulative hotfix 2
Cloudera Manager 7.6.7 Cumulative hotfix 3
Cloudera Manager 7.6.7 Cumulative hotfix 4
Cloudera Manager 7.6.7 Cumulative hotfix 5
Cloudera Manager 7.6.7 Cumulative hotfix 6
Cloudera Manager 7.6.7 Cumulative hotfix 7
Cloudera Manager 7.6.7 Cumulative hotfix 8
Cloudera Manager 7.6.7 Cumulative hotfix 9
Cloudera Manager 7.6.7 Release Notes (CDP Private Cloud Base 7.1.7 SP2)
Cloudera Manager Admin Console
Cloudera Manager Agents
Cloudera Manager Agents
Cloudera Manager API
Cloudera Manager Configuration Properties Reference
Cloudera Manager Configuration Properties Reference for Cloudera Runtime 7.1.7
Cloudera Manager Download Information
Cloudera Manager Entities Reference
Cloudera Manager Entity Type Attributes
Cloudera Manager Entity Types
Cloudera Manager Entity Types and Attributes
Cloudera Manager Event Schema Reference
Cloudera Manager Health Tests
Cloudera Manager Health Tests Reference
Cloudera Manager Metrics
Cloudera Manager metrics overview
Cloudera Manager Metrics Reference
Cloudera Manager Overview
Cloudera Manager Release Notes
Cloudera Manager Server
Cloudera Manager Server Metrics
Cloudera Manager Server Properties
Cloudera Manager sudo command options
Cloudera Manager support for Cloudera Runtime and CDH
Cloudera Manager Trigger Use Cases
Cloudera Manager user accounts
Cloudera Manager User Roles
Cloudera Manager Version Information
Cloudera Navigator Key HSM Overview
Cloudera Navigator Key Trustee Server
Cloudera Navigator Key Trustee Server Overview
Cloudera Runtime
Cloudera Runtime 7.1.7 SP1 component versions
Cloudera Runtime 7.1.7 SP2 component versions
Cloudera Runtime 7.1.7 SP3 component versions
Cloudera Runtime component versions
Cloudera Runtime Download Information
Cloudera Runtime Release Notes
Cloudera Runtime Version Information
Cloudera Search and CDP
Cloudera Search architecture
Cloudera Search authentication
Cloudera Search config templates
Cloudera Search configuration and log files
Cloudera Search configuration files
Cloudera Search configuration files
Cloudera Search ETL
Cloudera Search log files
Cloudera Search Morphlines Reference
Cloudera Search Overview
Cloudera Search security aspects
Cloudera Search solrctl Reference
Cloudera Search tasks and processes
Cluster balancing algorithm
Cluster Configuration Overview
Cluster Lifecycle Management with Cloudera Manager
Cluster management limitations
Cluster management limitations
Cluster Metrics
Cluster Migration Architectures
Cluster sizing
Cluster Support Tokens using Cloudera Manager
Cluster Utilization Report overview
Cluster-Wide Configuration
Collecting metrics through HTTP
Column compression
Column design
Column encoding
Command Details
Command Details
Command Line Tools
Commands
Commands for configuring storage policies
Commands for managing buckets
Commands for managing keys
Commands for managing volumes
Commands for using cache pools and directives
COMMENT statement
Comments
Committing a transaction for Direct Reader
Common replication topologies
Common web interface pages
common.service.type.docker Metrics
common.service.type.ecs Metrics
Communication encryption
Compacting on-disk data
Compaction prerequisites
Compaction tasks
Compactor properties
Compare queries
Comparing Configurations for a Service Between Clusters
Comparing replication and erasure coding
Comparing Similar Activities
Comparing tables using ANY/SOME/ALL
Comparison of Fair Scheduler with Capacity Scheduler
Compatibility Considerations for Virtual Private Clusters
Compatibility policies
Completed Hue query shows executing on CM
Complex types
Component types and metrics for alert policies
Components
Components
Compose queries
Compound operators
COMPUTE STATS statement
Conditional functions
Configuration
Configuration Example
Configuration example for writing data to HDFS
Configuration example for writing data to Ozone FS
Configuration examples
Configuration for enabling mTLS in Ozone
Configuration options for Spark to work with o3fs
Configuration options to store Hive managed tables on Ozone
Configuration parameters migrated to Core Settings Service
Configuration properties
Configuration Properties Reference for Properties not Available in Cloudera Manager
Configuration to expose buckets under non-default volumes
Configurations and CLI options for the HDFS Balancer
Configure a resource-based policy: Atlas
Configure a resource-based policy: HadoopSQL
Configure a resource-based policy: HBase
Configure a resource-based policy: HDFS
Configure a resource-based policy: Kafka
Configure a resource-based policy: Knox
Configure a resource-based policy: NiFi
Configure a resource-based policy: NiFi Registry
Configure a resource-based policy: Solr
Configure a resource-based policy: YARN
Configure a resource-based service: Atlas
Configure a resource-based service: HadoopSQL
Configure a resource-based service: HBase
Configure a resource-based service: HDFS
Configure a resource-based service: Kafka
Configure a resource-based service: Knox
Configure a resource-based service: NiFi
Configure a resource-based service: NiFi Registry
Configure a resource-based service: Solr
Configure a resource-based service: YARN
Configure a resource-based storage handler policy: HadoopSQL
Configure a Spark job for dynamic resource allocation
Configure Access to GCS from Your Cluster
Configure Antivirus Software on CDP Hosts
Configure Apache Knox authentication for AD/LDAP
Configure Apache Knox authentication for PAM
Configure Apache Knox authentication for SAML
Configure archival storage
Configure Atlas authentication for AD
Configure Atlas authentication for LDAP
Configure Atlas file-based authentication
Configure Atlas PAM authentication
Configure Authentication for Amazon S3
Configure authentication using Active Directory
Configure authentication using an external program
Configure authentication using an LDAP-compliant identity service
Configure authentication using Kerberos (SPNEGO)
Configure authentication using SAML
Configure AWS Credentials
Configure Browser-based Interfaces to Require Authentication (SPNEGO)
Configure Browsers for Kerberos Authentication (SPNEGO)
Configure BucketCache IO engine
Configure bulk load replication
Configure clients on a producer or consumer level
Configure clients on an application level
Configure Cloudera Manager for FIPS
Configure cluster capacity with queues
Configure Cluster to Use Kerberos Authentication
Configure columns to store MOBs
Configure CPU scheduling and isolation
Configure Cross-Origin Support for YARN UIs and REST APIs
Configure data locality
Configure DataNode memory as storage
Configure Debug Delay
Configure Docker
Configure dynamic queue properties
Configure Encryption for Amazon S3
Configure encryption in HBase
Configure four-letter-word commands in ZooKeeper
Configure FPGA scheduling and isolation
Configure GPU scheduling and isolation
Configure HBase for use with Phoenix
Configure HBase garbage collection
Configure HBase in Cloudera Manager to store snapshots in Amazon S3
Configure HBase servers to authenticate with a secure HDFS cluster
Configure HBase-Spark connector using Cloudera Manager
Configure HDFS RPC protection
Configure High Availability for Hive-HDFS ACL Sync
Configure High Availability for Ranger KMS with DB
Configure High Availability for Ranger KMS with KTS
Configure Hive-HDFS ACL Sync
Configure HMS properties for authorization
Configure HSTS for HBase Web UIs
Configure JMX ephemeral ports
Configure Kafka brokers
Configure Kafka brokers
Configure Kafka brokers
Configure Kafka brokers
Configure Kafka brokers
Configure Kafka brokers
Configure Kafka clients
Configure Kafka clients
Configure Kafka clients
Configure Kafka clients
Configure Kafka clients
Configure Kafka clients
Configure Kafka MirrorMaker
Configure Kafka MirrorMaker
Configure Kerberos authentication for Apache Atlas
Configure Kerberos authentication for Apache Ranger
Configure Kerberos authentication for Solr
Configure Kerberos Authentication for the Kafka Connect role
Configure Kudu processes
Configure Lily HBase Indexer Service to Use Kerberos Authentication
Configure Lily HBase Indexer to use TLS/SSL
Configure Lily HBase Indexer to use TLS/SSL
Configure Log Aggregation
Configure memory settings
Configure mountable HDFS
Configure Network Names
Configure NodeManager heartbeat
Configure Oozie client when TLS/SSL is enabled
Configure Partitions
Configure Per Queue Properties
Configure Phoenix-Hive connector
Configure PostgreSQL as the backend database for Hue
Configure PostgreSQL for Streaming Components
Configure preemption
Configure queue ordering policies
Configure Ranger Admin High Availability
Configure Ranger Admin High Availability with a Load Balancer
Configure Ranger authentication for AD
Configure Ranger authentication for LDAP
Configure Ranger authentication for PAM
Configure Ranger authentication for UNIX
Configure Ranger authorization for Infra Solr
Configure Ranger Usersync for Deleted Users and Groups
Configure Ranger Usersync for invalid usernames
Configure Ranger with SSL/TLS enabled PostgreSQL Database
Configure read replicas using Cloudera Manager
Configure RegionServer grouping
Configure S3 credentials for working with Ozone
Configure Scheduler Properties at the Global Level
Configure secure HBase replication
Configure secure HBase replication
Configure secure replication
Configure session timeout for Ranger Admin Web UI
Configure snapshots
Configure source and destination realms in krb5.conf
Configure SRM for Failover and Failback
Configure storage balancing for DataNodes using Cloudera Manager
Configure the blocksize for a column family
Configure the Cluster Utilization Report
Configure the compaction speed using Cloudera Manager
Configure the dynamic resource pool used for exporting and importing snapshots in Amazon S3
Configure the graceful shutdown timeout property
Configure the HBase canary
Configure the HBase client TGT renewal period
Configure the HBase thrift server role
Configure the MOB cache using Cloudera Manager
Configure the NFS Gateway
Configure the off-heap BucketCache using Cloudera Manager
Configure the off-heap BucketCache using the command line
Configure the PostgreSQL server
Configure the resource-based Ranger service used for authorization
Configure the scanner heartbeat using Cloudera Manager
Configure the storage policy for WALs using Cloudera Manager
Configure the storage policy for WALs using the Command Line
Configure TLS encryption manually for Phoenix Query Server
Configure TLS encryption manually for Phoenix Query Server
Configure TLS Encryption Manually for Schema Registry
Configure TLS/SSL encryption for Solr
Configure TLS/SSL encryption for Solr
Configure TLS/SSL Encryption for the Kafka Connect Role
Configure TLS/SSL encryption manually for Apache Ranger
Configure TLS/SSL encryption manually for Apache Ranger
Configure TLS/SSL encryption manually for Ranger KMS
Configure TLS/SSL encryption manually for Ranger KMS
Configure TLS/SSL encryption manually for Ranger RMS
Configure TLS/SSL encryption manually for Ranger RMS
Configure TLS/SSL for Core Hadoop Services
Configure TLS/SSL for HBase REST Server
Configure TLS/SSL for HBase Thrift Server
Configure TLS/SSL for HBase Web UIs
Configure TLS/SSL for HDFS
Configure TLS/SSL for Oozie
Configure TLS/SSL for Oozie
Configure TLS/SSL for YARN
Configure transaction support
Configure ulimit for HBase using Cloudera Manager
Configure ulimit using Pluggable Authentication Modules using the Command Line
Configure User Impersonation for Access to Hive
Configure User Impersonation for Access to Phoenix
Configure Usersync assignment of Admin users
Configure work preserving recovery on NodeManager
Configure work preserving recovery on ResourceManager
Configure YARN for managing Docker containers
Configure YARN ResourceManager high availability
Configure YARN Security for Long-Running Applications
Configure YARN Services API to Manage Long-running Applications
Configure YARN Services using Cloudera Manager
Configure ZooKeeper client shell for Kerberos authentication
Configure ZooKeeper server for Kerberos authentication
Configure Zookeeper TLS/SSL support for Kafka
Configure Zookeeper TLS/SSL support for Kafka
Configure ZooKeeper TLS/SSL using Cloudera Manager
Configure ZooKeeper TLS/SSL using Cloudera Manager
Configuring /tmp directory for cluster hosts
Configuring a database for Ranger or Ranger KMS
Configuring a dedicated MIT KDC for cross-realm trust
Configuring a Local Package Repository
Configuring a Local Parcel Repository
Configuring a Mail Transfer Agent for Key Trustee Server
Configuring a PostgreSQL Database for Ranger or Ranger KMS
Configuring a Proxy Server
Configuring a Ranger audit filter policy
Configuring a Ranger or Ranger KMS Database: MySQL/MariaDB
Configuring a Ranger or Ranger KMS Database: Oracle
Configuring a Ranger or Ranger KMS Database: Oracle using /ServiceName format
Configuring a Secure Credential Storage Provider for Cloudera Manager (Technical Preview)
Configuring a secure Kudu cluster using Cloudera Manager
Configuring Access to Azure on CDP Public Cloud
Configuring Access to Azure on Cloudera Private Cloud Base
Configuring Access to Google Cloud Storage
Configuring access to Hive on YARN
Configuring Access to S3
Configuring Access to S3 on CDP Public Cloud
Configuring Access to S3 on Cloudera Private Cloud Base
Configuring ACLs on HDFS
Configuring Advanced Security Options for Apache Ranger
Configuring Alert Delivery
Configuring Alert Email Delivery
Configuring Alert SNMP Delivery
Configuring Alerts Transitioning Out of Alerting Health Threshold
Configuring an external database for Oozie
Configuring and Managing S3Guard
Configuring and Monitoring Atlas
Configuring and running the HDFS balancer using Cloudera Manager
Configuring and Starting the PostgreSQL Server
Configuring and tuning S3A block upload
Configuring and Using Hive-HDFS ACL Sync
Configuring and using Queue Manager REST API
Configuring and Using Ranger KMS
Configuring and Using Zeppelin Interpreters
Configuring Apache Hadoop YARN High Availability
Configuring Apache Hadoop YARN Log Aggregation
Configuring Apache Hadoop YARN Security
Configuring Apache HBase
Configuring Apache HBase for Apache Phoenix
Configuring Apache HBase High Availability
Configuring Apache Hive
Configuring Apache Impala
Configuring Apache Kafka
Configuring Apache Kudu
Configuring Apache Ranger High Availability
Configuring Apache Spark
Configuring Apache Zeppelin
Configuring Apache ZooKeeper
Configuring Atlas Authentication
Configuring Atlas Authorization
Configuring Atlas Authorization using Ranger
Configuring Atlas using Cloudera Manager
Configuring authentication for long-running Spark Streaming jobs
Configuring Authentication in Cloudera Manager
Configuring Authentication in Cloudera Manager
Configuring authentication with LDAP and Direct Bind
Configuring authentication with LDAP and Search Bind
Configuring Authorization
Configuring auto split policy in an HBase table
Configuring automatic group offset synchronization
Configuring block size
Configuring Built-in TLS Acceleration
Configuring capacity estimations
Configuring CDP Services for HDFS Encryption
Configuring Client Access to Impala
Configuring Cloudera Manager
Configuring Cloudera Manager Agents
Configuring Cloudera Manager Server Ports
Configuring Cloudera Manager to Use an Internal Remote Parcel Repository
Configuring Clusters
Configuring coarse-grained authorization with ACLs
Configuring collection of Cloudera Manager table data
Configuring compaction in Cloudera Manager
Configuring compaction using table properties
Configuring concurrent moves
Configuring Cruise Control
Configuring Custom Alert Scripts
Configuring Custom Cgroups
Configuring custom Kerberos principal for Apache Flink
Configuring custom Kerberos principal for Atlas
Configuring custom Kerberos principal for Cruise Control
Configuring custom Kerberos principal for Cruise Control
Configuring custom Kerberos principal for HBase
Configuring custom Kerberos principal for HDFS
Configuring custom Kerberos principal for Hive and Hive-on-Tez
Configuring custom Kerberos principal for HttpFS
Configuring custom Kerberos principal for Hue
Configuring custom Kerberos principal for Kafka
Configuring custom Kerberos principal for Kafka
Configuring custom Kerberos principal for Knox
Configuring custom Kerberos principal for Kudu
Configuring custom Kerberos principal for Kudu
Configuring custom Kerberos principal for Livy
Configuring custom Kerberos principal for NiFi and NiFi Registry
Configuring custom Kerberos principal for Omid
Configuring custom Kerberos principal for Oozie
Configuring custom Kerberos principal for Oozie
Configuring custom Kerberos principal for Ozone
Configuring custom Kerberos principal for Ozone
Configuring custom Kerberos principal for Phoenix
Configuring custom Kerberos principal for Schema Registry
Configuring custom Kerberos principal for Schema Registry
Configuring custom Kerberos principal for Spark
Configuring custom Kerberos principal for SQL Stream Builder
Configuring custom Kerberos principal for Streams Messaging Manager
Configuring custom Kerberos principal for Streams Replication Manager
Configuring custom Kerberos principal for Streams Replication Manager
Configuring custom Kerberos principal for Zeppelin
Configuring custom Kerberos principal for ZooKeeper
Configuring custom Kerberos principals and custom system users for Solr
Configuring custom Kerberos principals and custom system users for Solr
Configuring custom Kerberos principals and custom system users for Solr
Configuring Dashboards
Configuring Data Protection
Configuring Dedicated Coordinators and Executors
Configuring dedicated Impala coordinator
Configuring Delegation for Clients
Configuring Directories for Intermediate Data
Configuring Directory Monitoring
Configuring dynamic resource allocation
Configuring Dynamic Resource Pool
Configuring Encryption for Specific Buckets
Configuring Event Based Automatic Metadata Sync
Configuring Event Based Automatic Metadata Sync
Configuring external authentication and authorization for Cloudera Manager
Configuring Fault Tolerance
Configuring file and directory permissions for Hue
Configuring for HDFS high availability
Configuring for Kudu Tables
Configuring goals
Configuring group permissions
Configuring HBase BlockCache
Configuring HBase Hive integration
Configuring HBase MultiWAL
Configuring HBase snapshots
Configuring HBase to use HDFS HA
Configuring HBase-Spark connector when both are on same cluster
Configuring HBase-Spark connector when HBase is on remote cluster
Configuring HDFS ACLs
Configuring HDFS High Availability
Configuring HDFS properties to optimize log collection
Configuring HDFS trash
Configuring Health Monitoring
Configuring heap size to replicate large directories using replication policies
Configuring heterogeneous storage in HDFS
Configuring high availability for Hue
Configuring Hive access for S3A
Configuring Hive and Impala for high availability with Hue
Configuring Hive to use with HBase
Configuring HiveServer for ETL using YARN queues
Configuring HiveServer high availability using a load balancer
Configuring HiveServer high availability using Dynamic Service Discovery
Configuring HMS for high availability
Configuring Host Monitor Data Storage
Configuring Host Monitoring
Configuring Hosts
Configuring Hosts to Use the Internal Repository
Configuring HSTS for HDFS Web UIs
Configuring HSTS for Spark
Configuring HTTPS encryption
Configuring https endpoints in Ozone S3 Gateway to work with AWS CLI
Configuring Hue as a TLS/SSL client
Configuring Hue as a TLS/SSL client
Configuring Hue as a TLS/SSL server
Configuring Hue as a TLS/SSL server
Configuring Impala
Configuring Impala access for S3A
Configuring Impala Query Data Store Maximum Size
Configuring Impala Query Monitoring
Configuring Impala Query Monitoring
Configuring Impala TLS/SSL
Configuring Impala TLS/SSL
Configuring Impala to work with HDFS HA
Configuring Impala Web UI
Configuring Impyla for Impala
Configuring Infra Solr
Configuring JDBC for Impala
Configuring JVM options and system properties for Ranger services
Configuring Kafka ZooKeeper chroot
Configuring Kerberos Authentication
Configuring Kerberos Authentication
Configuring Kerberos authentication in Apache Knox shared providers
Configuring Kerberos properties
Configuring Key Trustee Server High Availability Using Cloudera Manager
Configuring LDAP Authentication
Configuring LDAP Group Mappings
Configuring LDAP on unmanaged clusters
Configuring legacy CREATE TABLE behavior
Configuring Lily HBase Indexer Security
Configuring Livy
Configuring Load Balancer for Impala
Configuring Local Package and Parcel Repositories
Configuring Log Alerts
Configuring Log Alerts
Configuring Log Directories
Configuring Log Events
Configuring log levels for command line tools
Configuring Logging Thresholds
Configuring Logs
Configuring Management Service Database Limits
Configuring MariaDB as the backend database for Hue
Configuring MariaDB for Oozie
Configuring MariaDB server
Configuring Maximum File Descriptors
Configuring Memory Allocations
Configuring metastore database properties
Configuring metastore location
Configuring Monitoring Settings
Configuring multiple listeners
Configuring multiple listeners
Configuring MultiWAL support using Cloudera Manager
Configuring MySQL 5 for Oozie
Configuring MySQL 8 for Oozie
Configuring MySQL as the backend database for Hue
Configuring MySQL for Streaming Components
Configuring MySQL server
Configuring Network Settings for a Proxy Server
Configuring Nginx for basic authentication
Configuring OAuth in Data Hub
Configuring OAuth with core-site.xml
Configuring OAuth with the Hadoop CredentialProvider
Configuring ODBC for Impala
Configuring Oozie data purge settings using Cloudera Manager
Configuring Oozie High Availability using Cloudera Manager
Configuring Oozie Sqoop1 Action workflow JDBC drivers
Configuring Oozie to enable MapReduce jobs to read or write from Amazon S3
Configuring oozie to use HDFS HA
Configuring Oozie to use HDFS HA
Configuring Oracle as backend database for Hue
Configuring Oracle for Oozie
Configuring Oracle for Streaming Components
Configuring other CDP components to use HDFS HA
Configuring Ozone
Configuring Ozone Security
Configuring Ozone to work as a pure object store
Configuring Ozone to work with Prometheus
Configuring PAM authentication using Apache Knox
Configuring PAM authentication with LDAP and SSSD
Configuring PAM authentication with Linux users
Configuring partitions for transactions
Configuring Per-Bucket Settings
Configuring Per-Bucket Settings to Access Data Around the World
Configuring Periodic Stacks Collection
Configuring Phoenix-Spark connector when both are on same cluster
Configuring Phoenix-Spark connector when Phoenix is on remote cluster
Configuring PostgreSQL for Oozie
Configuring properties for non-Kerberos authentication mechanisms
Configuring properties not exposed in Cloudera Manager
Configuring Proxy Users to Access HDFS
Configuring queue mapping to use the user name from the application tag using Cloudera Manager
Configuring queue mapping to use the user name from the application tag using Cloudera Manager
Configuring quotas
Configuring Ranger audit properties for HDFS
Configuring Ranger audit properties for Solr
Configuring Ranger audits to show actual client IP address
Configuring Ranger Authentication with UNIX, LDAP, AD, or PAM
Configuring Ranger Authentication with UNIX, LDAP, or AD
Configuring Ranger authorization
Configuring Ranger Authorization for Atlas
Configuring Ranger KMS High Availability
Configuring replication specific REST servers
Configuring replications
Configuring Resource Parameters
Configuring resource-based policies
Configuring resource-based services
Configuring Roles to Use a Custom Garbage Collection Parameter
Configuring S3Guard for Cluster Access to S3
Configuring SAML authentication on managed clusters
Configuring Schema Registry instance in NiFi
Configuring secure access between Solr and Hue
Configuring security for Storage Container Managers in High Availability
Configuring Service Monitor Data Storage
Configuring Service Monitoring
Configuring Services to Use LZO Compression
Configuring Simple Authorization in Atlas
Configuring SMM for basic authentication
Configuring SMM for monitoring Kafka cluster replications
Configuring SMM to recognize Prometheus's TLS certificate
Configuring Spark access for S3A
Configuring Spark application logging properties
Configuring Spark application properties in spark-defaults.conf
Configuring Spark Applications
Configuring Spark on YARN Applications
Configuring SRM Driver for performance tuning
Configuring srm-control
Configuring SSL/TLS certificate exchange between two Cloudera Manager instances
Configuring storage balancing for DataNodes
Configuring Streams Messaging Manager for Kafka Connect
Configuring Streams Replication Manager
Configuring Suppression of Health Tests Before Tests Run
Configuring tablet servers
Configuring temporary table storage
Configuring the ABFS Connector
Configuring the Atlas hook in Kafka
Configuring the balancer threshold
Configuring the compaction check interval
Configuring the Database for Streaming Components
Configuring the driver role target clusters
Configuring the Frequency of Diagnostic Data Collection
Configuring the Hive Delegation Token Store
Configuring the Hive Metastore to use HDFS HA
Configuring the HiveServer load balancer
Configuring the Hue Server to Store Data in the Oracle database
Configuring the Kafka Connect Role
Configuring the Kudu master
Configuring the Livy Thrift Server
Configuring the number of objects displayed in Hue
Configuring the number of storage container copies for a DataNode
Configuring the Ozone trash checkpoint values
Configuring the resource capacity of root queue
Configuring the server work directory path for a Ranger service
Configuring the service role target cluster
Configuring the SRM client's secure storage
Configuring the storage policy for the Write-Ahead Log (WAL)
Configuring Time-Series Query Results
Configuring timezone for Hue
Configuring TLS Encryption for Cloudera Manager Using Auto-TLS
Configuring TLS encryption manually for Apache Atlas
Configuring TLS encryption manually for Apache Atlas
Configuring TLS encryption manually for Schema Registry
Configuring TLS/SSL encryption
Configuring TLS/SSL encryption for Kudu using Cloudera Manager
Configuring TLS/SSL encryption for Kudu using Cloudera Manager
Configuring TLS/SSL encryption manually for Apache Knox
Configuring TLS/SSL encryption manually for CDP Services
Configuring TLS/SSL encryption manually for DAS using Cloudera Manager
Configuring TLS/SSL encryption manually for DAS using Cloudera Manager
Configuring TLS/SSL encryption manually for Key Trustee Server
Configuring TLS/SSL encryption manually for Livy
Configuring TLS/SSL encryption manually for NiFi and NiFi Registry
Configuring TLS/SSL encryption manually for Ozone
Configuring TLS/SSL encryption manually for Spark
Configuring TLS/SSL encryption manually for Zeppelin
Configuring TLS/SSL for Core Hadoop Services
Configuring TLS/SSL for HBase
Configuring TLS/SSL for HBase
Configuring TLS/SSL for HBase REST Server
Configuring TLS/SSL for HBase Thrift Server
Configuring TLS/SSL for HBase Web UIs
Configuring TLS/SSL for HDFS
Configuring TLS/SSL for Hue
Configuring TLS/SSL for Hue
Configuring TLS/SSL for the KMS
Configuring TLS/SSL for YARN
Configuring TLS/SSL manually
Configuring TLS/SSL properties
Configuring TLSv1.2-enforced MySQL server
Configuring Transparent Data Encryption for Ozone
Configuring ulimit for HBase
Configuring Upgrade Domains
Configuring Upgrade Domains
Configuring user authentication
Configuring user authentication using LDAP
Configuring user authentication using SPNEGO
Configuring Which Log Messages Become Events
Configuring YARN Application Monitoring
Configuring YARN Application Monitoring
Configuring YARN Docker Containers Support
Configuring Zeppelin caching
Confirm the election status of a ZooKeeper service
Connect to Phoenix Query Server
Connect to Phoenix Query Server through Apache Knox
Connect workers
Connecting Hive to BI tools using a JDBC/ODBC driver
Connecting KeySecure HSM to CipherTrust Manager after migration from Key Secure HSM
Connecting to an Apache Hive endpoint through Apache Knox
Connecting to Impala Daemon in Impala Shell
Connecting to PQS using JDBC
Connecting to the Apache Livy Thrift Server
Connection failed error when accessing the Search app (Solr) from Hue
Connectors
Connectors
Considerations for backfill inserts
Considerations for configuring High Availability on Storage Container Manager
Considerations for configuring High Availability on the Ozone Manager
Considerations for enabling SCM HA security
Considerations for Knox
Considerations for Oozie to work with AWS
Considerations for realm names to use for replication
Considerations for working with HDFS snapshots
ContainerExecutor Error Codes (YARN)
Contents of the BlockCache
Control access to queues using ACLs
Controlling Data Access with Tags
Conversion functions
Convert DER, JKS, PEM Files for TLS/SSL Artifacts
Converting a managed non-transactional table to external
Converting a queue to a Managed Parent Queue
Converting an HDFS file to ORC
Converting from an NFS-mounted shared edits directory to Quorum-Based Storage
Converting from Device Names to UUIDs for Encrypted Devices
Converting Hive CLI scripts to Beeline
Converting instance directories to configs
Copy sample tweets to HDFS
Copying data between a secure and an insecure cluster using DistCp and WebHDFS
Copying data with Hadoop DistCp
Core Configuration Metrics
Core Configuration Properties in Cloudera Runtime 7.1.7
Core Settings Service
Corruption: checksum error on CFile block
COUNT
COUNT function
Create a bucket
Create a collection for tweets
Create a Collection in Cloudera Search
Create a Collection in Cloudera Search
Create a Custom Access Policy
Create a Custom Role
Create a custom YARN service
Create a GCP Service Account
Create a Hadoop archive
Create a Hive authorizer URL policy
Create a Kafka Topic to Store your Events
Create a new Kudu table from Impala
Create a read-only Admin user (Auditor)
Create a snapshot policy
Create a standard YARN service
Create a Streams Cluster on CDP Private Cloud Base
Create a table in Hive
Create a test collection
Create a time-bound policy
Create a topology map
Create a topology script
Create a user-defined function
Create and Run a Note
CREATE DATABASE statement
Create empty table on the destination cluster
CREATE FUNCTION statement
Create indexer Maven project
CREATE MATERIALIZED VIEW
Create new YARN services using UI
Create partitions
Create placement rules
CREATE ROLE statement
Create snapshots on a directory
Create snapshots using Cloudera Manager
CREATE TABLE statement
CREATE VIEW statement
Creating a connector using Kafka Connect in SMM
Creating a CRUD transactional table
Creating a Custom Cluster Utilization Report
Creating a Dashboard
Creating a default directory for managed tables
Creating a group in Hue
Creating a Hive external table replication policy
Creating a Host Template
Creating a Hue user
Creating a JAAS configuration file
Creating a Kafka topic
Creating a Lily HBase Indexer Configuration File
Creating a Lily HBase Indexer Configuration File
Creating a Morphline Configuration File
Creating a Morphline Configuration File
Creating a new Dynamic Configuration
Creating a notifier
Creating a Permanent Internal Repository
Creating a Pre-Deployed Cloudera Manager Host
Creating a Pre-Deployed Worker Host
Creating a replica of an existing shard
Creating a Role Group
Creating a Runtime Cluster Using a Cloudera Manager Template
Creating a Solr collection
Creating a Sqoop import command
Creating a table for a Kafka stream
Creating a Temporary Internal Repository
Creating a temporary table
Creating a trace user in unsecure Accumulo deployment
Creating a Trigger for CPU Capacity
Creating a Trigger for Memory Capacity
Creating a Trigger Using the Expression Editor
Creating a truststore file in PEM format
Creating a truststore file in PEM format
Creating a view from Spark
Creating an alert policy
Creating an insert-only transactional table
Creating an Ozone-based external table
Creating and managing snapshot policies
Creating and using a materialized view
Creating and using a partitioned materialized view
Creating Business Metadata
Creating categories
Creating classifications
Creating Encryption Zones
Creating glossaries
Creating HDFS replication policy to replicate HDFS data
Creating Hue Schema in Oracle database
Creating labels
Creating partitions dynamically
Creating Static Pools
Creating system tables to run query on Hive and Tez DAG events
Creating tables
Creating terms
Creating the Hue database
Creating the Hue database
Creating the tables and view
Creating the Template
Creating the UDF class
Creating trace user in unsecure OpDB deployment
Creating Triggers from Charts
Creating Virtual Images of Cluster Hosts
Creating, using, and dropping an external table
Cross Data Center Replication
Cross data center replication example of multiple clusters
Cruise Control
Cruise Control
Cruise Control
Cruise Control
Cruise Control Health Tests
Cruise Control Metrics
Cruise Control Overview
Cruise Control REST API endpoints
Cruise Control Server Health Tests
Cruise Control Server Metrics
CUME_DIST
Cumulative hotfix CDP Private Cloud Base 7.1.7.3008-2 (SP3 Cumulative hotfix1)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3010-1 (SP3 Cumulative hotfix2)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3011-1 (SP3 Cumulative hotfix3)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3013-1 (SP3 Cumulative hotfix4)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3014-1 (SP3 Cumulative hotfix5)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3016-1 (SP3 Cumulative hotfix6)
Cumulative hotfix CDP PvC Base 7.1.7.2002-1 (SP2 cumulative hotfix1)
Cumulative hotfix CDP PvC Base 7.1.7.2009-1 (SP2 cumulative hotfix2)
Cumulative hotfix CDP PvC Base 7.1.7.2010-1 (SP2 cumulative hotfix3)
Cumulative hotfix CDP PvC Base 7.1.7.2011-1 (SP2 cumulative hotfix4)
Cumulative hotfix CDP PvC Base 7.1.7.2013-1 (SP2 cumulative hotfix5)
Cumulative hotfix CDP PvC Base 7.1.7.2016-1 (SP2 cumulative hotfix6)
Cumulative hotfix CDP PvC Base 7.1.7.2021-1 (SP2 cumulative hotfix7)
Cumulative hotfix CDP PvC Base 7.1.7.2023-1 (SP2 cumulative hotfix8)
Cumulative hotfix CDP PvC Base 7.1.7.2024-1 (SP2 cumulative hotfix9)
Cumulative hotfix CDP PvC Base 7.1.7.2025-2 (SP2 cumulative hotfix10)
Cumulative hotfix CDP PvC Base 7.1.7.2026-3 (SP2 cumulative hotfix11)
Cumulative hotfix CDP PvC Base 7.1.7.2030-1 (SP2 cumulative hotfix12)
Cumulative hotfix CDP PvC Base 7.1.7.2032-1 (SP2 cumulative hotfix13)
Cumulative hotfix CDP PvC Base 7.1.7.2035-2 (SP2 cumulative hotfix14)
Cumulative hotfix CDP PvC Base 7.1.7.2038-1 (SP2 cumulative hotfix15)
Cumulative hotfix CDP PvC Base 7.1.7.2040-4 (SP2 cumulative hotfix16)
Cumulative hotfix CDP PvC Base 7.1.7.2046-1 (SP2 cumulative hotfix17)
Cumulative hotfix CDP PvC Base 7.1.7.2047-1 (SP2 cumulative hotfix18)
Cumulative hotfix CDP PvC Base 7.1.7.2050-1 (SP2 cumulative hotfix19)
Cumulative hotfix CDS 3.2.7172000.10-1
Cumulative hotfix CDS 3.2.7172000.12-1
Cumulative hotfix CDS 3.2.7172000.13-4
Cumulative hotfix CDS 3.2.7172000.14-1
Cumulative hotfix CDS 3.2.7172000.15-1
Cumulative hotfix CDS 3.2.7172000.16-1
Cumulative hotfix CDS 3.2.7172000.3-3
Cumulative hotfix CDS 3.2.7172000.6-1
Cumulative hotfix CDS 3.2.7172000.8-1
Cumulative hotfix CDS 3.2.7172000.9-1
Cumulative hotfix CDS 3.2.7173000.2-1
Cumulative hotfix CDS 3.2.7173000.3-1
Cumulative hotfixes
Cumulative hotfixes
Cumulative hotfixes
Cumulative hotfixes
Cumulative hotfixes
Cumulative hotfixes for CDS
Custom Configuration
Custom Installation Scenarios
Custom Installation Solutions
Customize dynamic resource allocation settings
Customize interpreter settings in a note
Customize the HDFS home directory
Customizing HDFS
Customizing Kerberos principals
Customizing Per-Bucket Secrets Held in Credential Files
Customizing the Hue web UI
Customizing time zones
DAS
DAS
DAS administration using Ambari in CDP
DAS administration using Cloudera Manager in CDP
DAS architecture
Dashboard Types
Dashboards
Data Access
Data Access
Data Analytics Studio (DAS)
Data Analytics Studio Eventprocessor Health Tests
Data Analytics Studio Eventprocessor Metrics
Data Analytics Studio Metrics
Data Analytics Studio Overview
Data Analytics Studio overview
Data Analytics Studio Properties in Cloudera Runtime 7.1.7
Data Analytics Studio Webapp Server Health Tests
Data Analytics Studio Webapp Server Metrics
Data at Rest Encryption Reference Architecture
Data at Rest Encryption Requirements
Data at Rest Encryption Requirements
Data compaction
Data Context Connector Properties in Cloudera Runtime 7.1.7
Data Discovery Service Agent Health Tests
Data Discovery Service Agent Metrics
Data Encryption Components and Solutions
Data Granularity and Time-Series Metric Data
Data migration to Apache Hive
Data protection
Data Science
Data Stewardship with Apache Atlas
Data Storage for Monitoring Data
Data storage metrics
Data types
Database Requirements
Databases
Databases and Table Names
DataNode Health Tests
DataNode Metrics
DataNodes
DataNodes
DataNodes page
Date and time functions
DATE data type
DDL statements
Deactivate and Remove Parcels
Debug Web UI for Catalog Server
Debug Web UI for Impala Daemon
Debug Web UI for StateStore
Decide to use the BucketCache
DECIMAL data type
Decimal type
Decommission or remove a tablet server
Decommissioning Hosts
Decommissioning Ozone DataNodes
Decommissioning Role Instances
Decrease Reserve Space
Dedicated Coordinator
Default EXPIRES ON tag policy
Default ports of OpDB
Default Ranger audit filters
Default User Roles
Default view of Kafka Connect in the SMM UI
Defining a backup target in solr.xml
Defining and adding clusters for replication
Defining Apache Atlas enumerations
Defining co-located Kafka clusters using a service dependency
Defining co-located Kafka clusters using Kafka credentials
Defining external Kafka clusters
Defining related terms
Delegation token based authentication
Delete a bucket
Delete a group
Delete a Key
Delete a role
Delete a user
Delete data
Delete HBase snapshots from Amazon S3
Delete Objects
Delete placement rules
Delete Queue
Delete queues
Delete snapshots using Cloudera Manager
DELETE statement
Delete the Cluster
Deleting a Cluster
Deleting a collection
Deleting a connector using Kafka Connect in SMM
Deleting a Host from Cloudera Manager
Deleting a Host Template
Deleting a Kafka topic
Deleting a notifier
Deleting a schema
Deleting all documents in a collection
Deleting an alert policy
Deleting data from a table
Deleting dynamically created child queues
Deleting Encryption Zone Keys
Deleting Encryption Zones
Deleting Hosts
Deleting partitions
Deleting Role Instances
Deleting Services
Deleting tables
Deletion
Dell EMC PowerScale
DENSE_RANK
Deploy and manage services on YARN
Deploy HBase replication
Deploying and configuring Oozie Sqoop1 Action JDBC drivers
Deploying Atlas service
Deploying Clients
Deployment Planning for Cloudera Search
Deprecation notices in Cloudera Manager 7.11.3 CHF4
Deprecation notices in Cloudera Runtime 7.1.7
Deprecation notices in Cloudera Runtime 7.1.7 SP3
DESCRIBE EXTENDED and DESCRIBE FORMATTED
DESCRIBE statement
Describing a materialized view
Designating Directories to Include in Disk Usage Reports
Detecting slow DataNodes
Determining the table type
Developing and running an Apache Spark WordCount application
Developing Apache Kafka Applications
Developing Apache Spark Applications
Developing Applications with Apache Kudu
Diagnostic Data Collection
Diagnostics logging
Dimensioning guidelines
Direct Reader configuration properties
Direct Reader limitations
Direct Reader mode introduction
Directory configurations
Directory Metrics
Directory Usage Report
Disable a provider in an existing provider configuration
Disable loading of coprocessors
Disable OpDB's use of HDFS trash
Disable proxy for a known service in Apache Knox
Disable RegionServer grouping
Disable replication at the peer level
Disable the BoundedByteBufferPool
Disable the Firewall
Disabling an alert policy
Disabling and redeploying HDFS HA
Disabling auto queue deletion
Disabling automatic compaction
Disabling impersonation (doas)
Disabling Oozie High Availability
Disabling Oozie UI using Cloudera Manager
Disabling or changing the Credential Storage Provider (Technical Preview)
Disabling redaction
Disabling Redaction of sensitive information when using the Cloudera Manager API
Disabling replication of parameters during Hive replication
Disabling Static Service Pools
Disabling the Automatic Sending of Diagnostic Data from a Manually Triggered Collection
Disabling the Firewall
Disabling the reporting feature
Disabling the share option in Hue
Disabling the web metric collection for Hue
Disabling TLS protocols on JMX ports
Disabling Transparent Hugepages (THP)
Disassociate partitions from queues
Discovering possible predicates
Disk Balancer commands
Disk management
Disk Metrics
Disk Removal
Disk Replacement
Disk space usage issue
Disk space versus namespace
Disk Usage Reports
Disk Usage Reports
Displaying Chart Details
DistCp and Proxy Settings
Distcp between secure clusters in different Kerberos realms
Distcp syntax and examples
DISTINCT operator
DML statements
DOCKER Health Tests
Docker on YARN configuration properties
Docker on YARN example: DistributedShell
Docker on YARN example: MapReduce job
Docker on YARN example: Spark-on-Docker-on-YARN
Docker Server Health Tests
Docker Server Metrics
Documentation Errata in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Documentation Errata in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
Documentation Errata in Cloudera Runtime 7.1.7 SP1
Documentation Errata in Cloudera Runtime 7.1.7 SP2
Documentation Errata in Cloudera Runtime 7.1.7 SP3
DOUBLE data type
Download a file
Download and install PostgreSQL
Download the Cluster Utilization Report
Download the Trial version of CDP Private Cloud Base
Download the Trial version of CDP Private Cloud Base
Downloading and exporting data from Hue
Downloading and installing MariaDB database
Downloading and installing MySQL database
Downloading and Publishing the Package Repository
Downloading and Publishing the Parcel Repository
Downloading Audit Events
Downloading Client Configuration Files
Downloading HDFS Directory Access Permission Reports
Downloading Hdfsfindtool from the CDH archives
Downloading query results from Hue takes time
Downloading Reports as CSV and XLS Files
Downloading, staging, and activating the Oracle Instant Client parcel
Driver inter-node coordination
Drop a Kudu table
DROP DATABASE statement
DROP FUNCTION statement
DROP MATERIALIZED VIEW
DROP ROLE statement
DROP STATS statement
DROP TABLE statement
DROP VIEW statement
Dropping a materialized view
Dropping an external table along with data
Dumping the Oozie database
Dynamic allocation
Dynamic Queue Scheduling [Technical Preview]
Dynamic resource allocation properties
Dynamic Resource Pool Settings
Dynamic resource-based column masking in Hive with Ranger policies
Dynamic tag-based column masking in Hive with Ranger policies
Dynamically loading a custom filter
Ecs Agent Health Tests
Ecs Agent Metrics
ECS Health Tests
Ecs Server Health Tests
Ecs Server Metrics
Edit a group
Edit a role
Edit a user
Edit or delete a snapshot policy
Editing a Chart
Editing a Host Template
Editing rack assignments for hosts
Editing tables
Editing the S3Guard Configuration
Editing, Deleting, Suppressing, or Deleting a Trigger
Effects of WAL rolling on replication
Elements of the Recon web user interface
Enable Access Control for Data
Enable Access Control for Interpreter, Configuration, and Credential Settings
Enable Access Control for Notebooks
Enable an NTP Service
Enable an NTP Service
Enable and disable snapshot creation using Cloudera Manager
Enable asynchronous scheduler
Enable authorization for additional HDFS web UIs
Enable authorization for HDFS web UIs
Enable authorization in Kafka with Ranger
Enable bulk load replication using Cloudera Manager
Enable Cgroups
Enable core dump
Enable detection of slow DataNodes
Enable disk IO statistics
Enable document-level authorization
Enable garbage collector logging
Enable GZipCodec as the default compression codec
Enable HBase high availability using Cloudera Manager
Enable HBase indexing
Enable hedged reads for HBase
Enable high availability
Enable HTTPS communication
Enable Intra-Queue Preemption for a specific queue
Enable Kerberos authentication
Enable Kerberos authentication in Solr
Enable LDAP authentication in Solr
Enable multi-threaded faceting
Enable namespace mapping
Enable node label on a cluster to configure partition
Enable or disable authentication with delegation tokens
Enable override of default queue mappings
Enable Phoenix ACLs
Enable preemption for a specific queue
Enable proxy for a known service in Apache Knox
Enable Ranger Admin login using kerberos authentication
Enable Ranger authorization in Solr
Enable RegionServer grouping using Cloudera Manager
Enable replication on a specific table
Enable Replication on HBase Column Families
Enable security for Cruise Control
Enable security for Cruise Control
Enable Sensitive Data Redaction
Enable server-server mutual authentication
Enable snapshot creation on a directory
Enable the AdminServer
Enable the Cluster Utilization Report
Enabling a multi-threaded environment for Hue
Enabling Access Control for Zeppelin Elements
Enabling access to HBase browser from Hue
Enabling ACL for RegionServer grouping
Enabling Admission Control
Enabling all scheduled queries
Enabling an alert policy
Enabling and Configuring Static Service Pools
Enabling and disabling HDFS snapshots
Enabling and Disabling Log Event Capture
Enabling and disabling trash
Enabling CDS 3.2.3 with GPU Support
Enabling Configuration Change Alerts
Enabling Configuration Change Alerts
Enabling custom Kerberos principal support in a Queue Manager cluster
Enabling custom Kerberos principal support in a Queue Manager cluster
Enabling custom Kerberos principal support in YARN
Enabling custom Kerberos principal support in YARN
Enabling DEBUG
Enabling dynamic child creation in weight mode
Enabling Fast Upload using Cloudera Manager
Enabling fault-tolerant processing in Spark Streaming
Enabling HBase Alerts
Enabling HDFS HA
Enabling Health Alerts
Enabling High Availability and automatic failover
Enabling httpd log rotation for Hue
Enabling Hue applications with Cloudera Manager
Enabling Hue as a TLS/SSL client
Enabling Hue as a TLS/SSL client
Enabling Hue as a TLS/SSL server using Cloudera Manager
Enabling Hue as a TLS/SSL server using Cloudera Manager
Enabling interceptors
Enabling Intra-Queue preemption
Enabling Kerberos authentication and RPC encryption
Enabling Kerberos Authentication for CDP
Enabling Kerberos Authentication for the KMS
Enabling Kerberos for the SRM service
Enabling LazyPreemption
Enabling LDAP Authentication for impala-shell
Enabling LDAP authentication with HiveServer2 and Impala
Enabling LDAP for in Hue
Enabling Native Acceleration For MLlib
Enabling Oozie High Availability
Enabling Oozie SLA with Cloudera Manager
Enabling or disabling anonymous usage date collection
Enabling Ranger authorization
Enabling replication between clusters with Kerberos authentication
Enabling Resource Management with Control Groups
Enabling SASL in HiveServer
Enabling scheduled queries
Enabling security for Apache Flink
Enabling self-healing for all or individual anomaly types
Enabling self-healing in Cruise Control
Enabling Snapshots
Enabling Solr clients to authenticate with a secure Solr
Enabling Spark authentication
Enabling Spark Encryption
Enabling Spark rolling event log files in CDP
Enabling Speculative Execution
Enabling SSE-C
Enabling SSE-KMS
Enabling SSE-S3
Enabling the Dynamic Queue Scheduling feature
Enabling the Hive Metastore integration
Enabling the Oozie web console on managed clusters
Enabling the SQL editor autocompleter
Enabling TLS Encryption for SMM on CDP Private Cloud
Enabling TLS/SSL communication with HiveServer2
Enabling TLS/SSL communication with HiveServer2
Enabling TLS/SSL communication with Impala
Enabling TLS/SSL communication with Impala
Enabling TLS/SSL for HiveServer
Enabling TLS/SSL for HiveServer
Enabling TLS/SSL for Hue Load Balancer
Enabling TLS/SSL for Hue Load Balancer
Enabling TLS/SSL for the SRM service
Enabling TLS/SSL for the SRM service
Enabling vectorized query execution
Encrypting an S3 Bucket with Amazon S3 Default Encryption
Encrypting and Decrypting Data Using Cloudera Navigator Encrypt
Encrypting Data at Rest
Encrypting Data at Rest
Encrypting Data in Transit
Encrypting Data in Transit
Encrypting data in transit between clusters
Encrypting Data on S3
Encryption
Encryption
Encryption in SSB
Encryption Zones and Keys
End to end latency overview
End to end latency use case
Ending a CDP Private Cloud Base Trial
Enforcing TLS version 1.2 for Hue
Enhancements related to bulk glossary terms import
Enter Required Parameters
Environment variables for sizing NameNode heap memory
Erasure coding CLI command
Erasure coding examples
Erasure coding overview
Error Messages and Various Failures
Error validating LDAP user in Hue
Errors during hole punching test
Escaping an invalid identifier
Essential metrics to monitor
ETL with Cloudera Morphlines
Event Server
Event Server Health Tests
Event Server Metrics
Events
Evolving a schema
Example - Placement rules creation
Example configuration to add to the sudoers file
Example for using THttpClient API in secure cluster
Example for using THttpClient API in unsecure cluster
Example for using TSaslClientTransport API in secure cluster without HTTP
Example of Cruise Control goal configuration
Example use cases
Example workload
Example: Configuration for work preserving recovery
Example: Running SparkPi on YARN
Example: Using the HBase-Spark connector
Examples
Examples of accessing Amazon S3 data from Spark
Examples of Audit Operations
Examples of controlling data access using classifications
Examples of creating and using UDFs
Examples of DistCp commands using the S3 protocol and hidden credentials
Examples of estimating NameNode heap memory
Examples of interacting with Schema Registry
Examples of overlapping quota policies
Examples of using the AWS CLI for Ozone S3 Gateway
Examples of using the S3A filesystem with Ozone S3 Gateway
Examples of writing data in various file formats
Excluding audits for specific users, groups, and roles
Exit statuses for the HDFS Balancer
Experimental flags
EXPLAIN statement
Exploring using Lineage
Export a Note
Export a snapshot to another cluster
Export all resource-based policies for all services
Export Ranger reports
Export resource-based policies for a specific service
Export tag-based policies
Exporting Data from Charts
Exporting the Cluster Configuration
Expose HBase metrics to a Ganglia server
Extending Atlas to Manage Metadata from Additional Sources
Extending Cloudera Manager
External table access
Failover Controller Health Tests
Failover Controller Metrics
Failures during INSERT, UPDATE, UPSERT, and DELETE operations
Fan-in and Fan-out Replication Flows
FAQ
Feature comparison
Feature Comparisons
Fetching Spark Maven dependencies
File descriptor limits
File descriptors
File system partitioning recommendations
Files and directories
Files and directories
Filesystem Metrics
Filesystems
Filter Attributes
Filter Attributes
Filter Expressions
Filter Expressions
Filter HMS results
Filter service access logs from Ranger UI
Filter types
Filtering Audit Events
Filtering by Day of Week or Hour of Day
Filtering Events
Filtering Jobs
Filtering Logs
Filtering Metrics
Filtering Metrics
Filtering Queries
Filtering the Activities List
Filtering the Tasks List
Filters
Find latest OpDB keytab
Finding issues
Finding the list of Hue superusers
Finding the list of Hue superusers
FIRST_VALUE
Fixed Common Vulnerabilities and Exposures 7.1.7 SP1
Fixed Common Vulnerabilities and Exposures 7.1.7 SP2
Fixed Common Vulnerabilities and Exposures 7.1.7 SP3
Fixed Common Vulnerabilities and Exposures in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
Fixed Issues in Apache Atlas
Fixed Issues in Apache Atlas
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed Issues in Apache Avro
Fixed Issues in Apache Avro
Fixed issues in Apache Calcite
Fixed issues in Apache Calcite
Fixed Issues in Apache Hadoop
Fixed Issues in Apache Hadoop
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HBase
Fixed Issues in Apache HBase
Fixed Issues in Apache HBase
Fixed Issues in Apache HDFS
Fixed Issues in Apache HDFS
Fixed Issues in Apache HDFS
Fixed Issues in Apache Hive
Fixed Issues in Apache Hive
Fixed Issues in Apache Hive
Fixed Issues in Apache Impala
Fixed Issues in Apache Impala
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Apache Kafka
Fixed Issues in Apache Kafka
Fixed Issues in Apache Knox
Fixed Issues in Apache Knox
Fixed Issues in Apache Knox
Fixed Issues in Apache Kudu
Fixed Issues in Apache Kudu
Fixed Issues in Apache Kudu
Fixed Issues in Apache Oozie
Fixed Issues in Apache Oozie
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed issues in Apache Ozone
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Apache Parquet
Fixed Issues in Apache Parquet
Fixed Issues in Apache Phoenix
Fixed Issues in Apache Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Apache Ranger
Fixed Issues in Apache Ranger
Fixed Issues in Apache Solr
Fixed Issues in Apache Solr
Fixed Issues in Apache Spark
Fixed Issues in Apache Spark
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Apache Sqoop
Fixed Issues in Apache Sqoop
Fixed Issues in Apache Tez
Fixed Issues in Apache Tez
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN
Fixed Issues in Apache YARN and YARN Queue Manager
Fixed Issues in Apache YARN and YARN Queue Manager
Fixed Issues in Apache Zookeeper
Fixed Issues in Apache Zookeeper
Fixed Issues in Apache Zookeeper
Fixed issues in Cloud Connectors
Fixed issues in Cloud Connectors
Fixed Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Fixed Issues in Cloudera Manager 7.4.4
Fixed Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
Fixed Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
Fixed issues in Cloudera Runtime 7.1.7
Fixed issues in Cloudera Runtime 7.1.7 SP1
Fixed issues in Cloudera Runtime 7.1.7 SP2
Fixed issues in Cloudera Runtime 7.1.7 SP3
Fixed Issues in Cloudera Search
Fixed Issues in Cloudera Search
Fixed Issues in Cloudera Search
Fixed issues in Cruise Control
Fixed issues in Cruise Control
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed issues in Data Analytics Studio
Fixed issues in Data Analytics Studio
Fixed Issues in Hue
Fixed Issues in Hue
Fixed Issues in Hue
Fixed Issues in Kerberos
Fixed Issues in Kerberos
Fixed Issues in Livy
Fixed Issues in Livy
Fixed Issues in MapReduce
Fixed Issues in MapReduce
Fixed Issues in Navigator Encrypt
Fixed Issues in Navigator Encrypt
Fixed Issues in Navigator Encrypt
Fixed Issues in Phoenix
Fixed Issues in Schema Registry
Fixed Issues in Schema Registry
Fixed Issues in Schema Registry
Fixed Issues in Streams Messaging Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Replication Manager
Fixed Issues in Zeppelin
Fixed Issues in Zeppelin
Fixed Issues in Zeppelin
Fixing a warning related to accessing non-optimized Hue
Fixing authentication issues between HBase and Hue
Fixing block inconsistencies
Fixing incorrect start time and duration on Hue Job Browser
Fixing issues
Flink Dashboard Health Tests
Flink Dashboard Metrics
Flink Metrics
FLOAT data type
Flume Agent Health Tests
Flume Channel Metrics
Flume Health Tests
Flume Metrics
Flume Sink Metrics
Flume Source Metrics
Flush options
Flushing data to disk
Format for using Hadoop archives with MapReduce
Frequently asked questions
Functions
Garbage Collector Health Tests
Garbage Collector Metrics
General Quota Syntax
General Settings
Generate a table list
Generating a New Certificate
Generating and viewing Apache Hive statistics
Generating collection configuration using configs
Generating Solr collection configuration using instance directories
Generating statistics
Generating surrogate keys
Generating Table and Column Statistics
Getting Metrics for Streams Messaging Manager
Getting scheduled query information and monitor the query
Getting Started on your Streams Cluster
Getting the JDBC driver
Getting the ODBC driver
Glossaries overview
Governance
Governance
Governance Overview
Graceful HBase shutdown
Gracefully shut down an HBase RegionServer
Gracefully shut down the HBase service
GRANT ROLE statement
GRANT statement
Granularity of metrics for end-to-end latency
GROUP BY clause
Grouping (Faceting) Time Series
Groups and fetching
GROUP_CONCAT function
Guidelines for Schema Design
Guidelines to add or delete source data during replication job run
Guidelines to use snapshot diff-based replication
Hadoop
Hadoop
Hadoop archive components
Hadoop File Formats Support
Hadoop File System commands
Hadoop Users (user:group) and Kerberos Principals
Handling disk failures
Handling large messages
Hardware Requirements
Hash and hash partitioning
Hash and range partitioning
Hash partitioning
Hash partitioning
HashTable/SyncTable tool configuration
HAVING clause
HBase
HBase
HBase
HBase
HBase actions that produce Atlas entities
HBase audit entries
HBase authentication
HBase authorization
HBase backup and disaster recovery strategies
HBASE Category
HBase entities created in Atlas
HBase filtering
HBase Health Tests
HBase I/O components
HBase is using more disk space than expected
Hbase lineage
HBase MCC Configurations
HBase MCC Restrictions
HBase MCC Usage in Spark with Java
HBase MCC Usage in Spark with Scala
HBase MCC Usage with Kerberos
HBase metadata collection
HBase Metrics
HBase online merge
HBase Properties in Cloudera Runtime 7.1.7
HBase read replicas
HBase RegionServer Replication Peer Metrics
HBase REST Server Health Tests
HBase REST Server Metrics
HBase Shell example
HBase snapshots on Amazon S3 with Kerberos enabled
HBase Thrift Server Health Tests
HBase Thrift Server Metrics
HBaseMapReduceIndexerTool command line reference
HBCK2 tool command reference
HDFS
HDFS
HDFS
HDFS
HDFS
HDFS ACLs
HDFS Block Skew
HDFS Cache Directive Metrics
HDFS Cache Pool Metrics
HDFS Caching
HDFS commands for metadata files and directories
HDFS Encryption Issues
HDFS entity metadata migration
HDFS Health Tests
HDFS Metrics
HDFS Metrics
HDFS Properties in Cloudera Runtime 7.1
HDFS replication in Sentry-enabled clusters
HDFS replication policies
HDFS replication policy considerations
HDFS Sink Connector
HDFS Sink Connector Properties Reference
HDFS storage demands due to retained HDFS trash
HDFS storage policies
HDFS storage types
HDFS storage types
HDFS to Apache Hive data migration
HDFS Transparent Encryption
Head a bucket
Head an object
Health Tests
Health Tests and Health History
Health Tests and Health History
HEALTH_CHECK Category
Heap sampling
HeapDumpPath (/tmp) in Hive data nodes gets full due to .hprof files
Hierarchical namespaces vs. non-namespaces
Hierarchical queue characteristics
High Availability on HDFS clusters
Highly Available Kafka Architectures
History Server Health Tests
History Server Metrics
Hive
Hive
Hive
Hive
Hive
Hive access authorization
Hive authentication
Hive entity metadata migration
Hive Execution Health Tests
Hive Execution Metrics
Hive external table replication policies
Hive Health Tests
Hive LLAP Health Tests
Hive LLAP Metrics
Hive LLAP Properties in Cloudera Runtime 7.1.7
Hive Metastore Server Health Tests
Hive Metastore Server Metrics
Hive Metrics
Hive on Tez configurations
Hive on Tez Health Tests
Hive on Tez introduction
Hive on Tez Metrics
Hive on Tez Properties in Cloudera Runtime 7.1.7
Hive Properties in Cloudera Runtime 7.1.7
Hive replication policy considerations
Hive Table Metrics
Hive tables and DDL commands
Hive unsupported interfaces and features
Hive Warehouse Connector for accessing Apache Spark data
Hive Warehouse Connector Interfaces
Hive-HDFS ACL Sync Reference
Hive-HDFS ACL Sync Use Cases
Hive/Impala replication using snapshots
HiveServer actions that produce Atlas entities
HiveServer audit entries
HiveServer entities created in Atlas
HiveServer is unresponsive due to large queries running in parallel
HiveServer lineage
HiveServer metadata collection
HiveServer relationships
HiveServer2 Health Tests
HiveServer2 Metrics
HMS table storage
Home Page
Host Configuration Properties
Host Details
Host Health Tests
Host Inspector
Host Management
Host Metrics
Host Monitor
Host Monitor and Service Monitor Memory Configuration
Host Monitor Health Tests
Host Monitor Metrics
Host Templates
Hosts Disks Overview
Hotfixes in Cloudera Runtime 7.1.7
Hotfixes in Cloudera Runtime 7.1.7 SP1
Hotfixes in Cloudera Runtime 7.1.7 SP2
How Client Configurations are Deployed
How Cloudera Search works
How DAS helps to debug Hive on Tez queries
How Integration works
How Lineage strategy works
How NameNode manages blocks on a failed DataNode
How NFS Gateway authenticates and maps users
How Ozone manages read operations
How Ozone manages write operations
How tag-based access control works
How the reporting task runs in a NiFi cluster
How to add a coarse URI check for Hive agent
How to Add Root and Intermediate CAs to Truststore for TLS/SSL
How to Authenticate Kerberos Principals Using Java
How to change the password for Ranger users
How to clear Ranger Admin access logs
How to Configure a MapReduce Job to Access S3 with an HDFS Credstore
How to configure Ranger HDFS plugin configs per (NameNode) Role Group
How to full sync the Ranger RMS database
How to pass JVM options to Ranger KMS services
How to read the Placement Rules table
How to read the Schedule table
How to set audit filters in Ranger Admin Web UI
How to Set up Failover and Failback
How to suppress database connection notifications
How to: Compute
How to: Data Access
How to: Data Science
How to: Governance
How to: Jobs Management
How to: Next-Gen Storage
How to: Operational Database
How to: Security
How to: Storage
How to: Streams Messaging
HRegion Metrics
HSM-Specific Setup for Cloudera Navigator Key HSM
HTable Metrics
HTTP 403 error while accessing Hue
HttpFS authentication
HttpFS Health Tests
HttpFS Metrics
Hue
Hue
Hue
Hue
Hue Advanced Configuration Snippet
Hue configuration files
Hue configurations in CDP Runtime
Hue Health Tests
Hue in a Virtual Private Cluster Environment
Hue Load Balancer does not start
Hue load balancer does not start after enabling TLS
Hue logs
Hue Metrics
Hue Overview
Hue overview
Hue Properties in Cloudera Runtime 7.1.7
Hue Server Health Tests
Hue Server Metrics
Hue service Django logs
Hue supported browsers
HWC and DataFrame API limitations
HWC and DataFrame APIs
HWC API Examples
HWC authorization
HWC authorization
HWC integration with pyspark, sparklyr, and Zeppelin
HWC limitations
HWC supported types mapping
IAM Role permissions for working with SSE-KMS
IBM Spectrum Scale
Identifiers
Identify Roles that Use the Embedded Database Server
Identifying problems
Identity Management
Impact of quota violation policy
Impala
Impala
Impala
Impala
Impala
Impala actions that produce Atlas entities
Impala aliases
Impala audit entries
Impala Authentication
Impala Authorization
Impala Best Practices
Impala Catalog Server Health Tests
Impala Catalog Server Metrics
Impala Daemon Health Tests
Impala Daemon Metrics
Impala Daemon Resource Pool Metrics
Impala database containment model
Impala DDL for Kudu
Impala DML for Kudu Tables
Impala entities created in Atlas
Impala entity metadata migration
Impala Health Tests
Impala integration limitations
Impala integration limitations
Impala lineage
Impala lineage
Impala Llama ApplicationMaster Health Tests
Impala Llama ApplicationMaster Metrics
Impala Logs
Impala metadata collection
Impala Metrics
Impala Pool Metrics
Impala Pool User Metrics
Impala Properties in Cloudera Runtime 7.1.7
Impala query counter metrics
Impala Query Metrics
Impala Requirements
Impala Shell Command Reference
Impala Shell Configuration File
Impala Shell Configuration Options
Impala Shell Tool
Impala SQL and Hive SQL
Impala StateStore Health Tests
Impala StateStore Metrics
Impala Tab
Impala with Amazon S3
Impala with Azure Data Lake Store (ADLS)
Impala with HBase
Impala with HDFS
Impala with Kudu
Implementing your own Custom Command
Import a Note
Import and sync LDAP users and groups
Import command options
Import Data from RDBMS into an S3 Bucket
Import Data into an External Hive Table Backed by S3
Import Data into S3 Bucket in Incremental Mode
Import External Packages
Import resource-based policies for a specific service
Import resource-based policies for all services
Import tag-based policies
Importance of a Secure Cluster
Importing and exporting resource-based policies
Importing and exporting tag-based policies
Importing Business Metadata associations in bulk
Importing Confluent Schema Registry schemas into Cloudera Schema Registry
Importing Data into Amazon S3 Using Sqoop
Importing data into HBase
Importing Data into Microsoft Azure Data Lake Store (Gen1 and Gen2) Using Sqoop
Importing Glossary terms in bulk
Importing Hive Metadata using Command-Line (CLI) utility
Importing RDBMS data into Hive
Importing RDBMS data to HDFS
Importing Sentry privileges into Ranger policies
Importing the Template to a New Cluster
Imports into Hive
Improve network latency during replication job run
Improve Performance in Schema Registry
Improving Performance for S3A
Improving Performance in Shuffle Handler and IFile Reader
Improving performance with centralized cache management
Improving performance with short-circuit local reads
Improving Software Performance
Increasing StateStore Timeout
Increasing storage capacity with HDFS compression
Increasing the maximum number of processes for Oracle database
Incrementally updating an imported table
Index sample data
Indexing
Indexing Data
Indexing Data Using Morphlines
Indexing Data Using Spark-Solr Connector
Indexing data with MapReduceIndexerTool in Solr backup format
Indexing sample Tweets with Cloudera Search
Information and debugging
Ingestion
Initializing Navigator Key HSM
Initializing Standalone Key Trustee Server
Initializing Standalone Key Trustee Server Using Cloudera Manager
Initiate replication when data already exist
Initiating automatic compaction in Cloudera Manager
Initiating HDFS failover using the Cloudera Manager API
INSERT and primary key uniqueness violations
Insert data
Insert data in test_table through Spark
INSERT statement
Inserting data into a table
Inspecting Network Performance
Install Accumulo
Install Accumulo 1.10 parcel
Install Accumulo CSD file
Install Accumulo parcel using Local Parcel Repository
Install Accumulo using Remote Parcel Repository
Install and configure additional required components
Install and Configure MariaDB for CDP
Install and Configure MySQL for CDP
Install and Configure Oracle Database
Install and Configure PostgreSQL for CDP
Install CDP
Install CDP
Install CDP
Install Cloudera Manager Packages
Install Cloudera Runtime
Install Cloudera Runtime
Install Docker
Install OpDB
Install OpDB
Install OpDB CSD file
Install OpDB CSD file
Install OpDB parcel
Install OpDB parcel
Install OpDB parcel using Local Parcel Repository
Install OpDB parcel using Local Parcel Repository
Install OpDB parcel using Remote Parcel Repository
Install OpDB parcel using Remote Parcel Repository
Install the NFS Gateway
Installation Reference
Installation Wizard
Installing a Java Keystore KMS
Installing a Kafka-centric cluster
Installing a Trial Cluster
Installing a Trial Streaming Cluster
Installing Accumulo Parcel 1.0.0
Installing Accumulo Parcel 1.1.0
Installing Accumulo Parcel 1.10
Installing and Configuring CDP with FIPS
Installing and configuring MariaDB on RHEL 8
Installing and configuring MySQL on RHEL 8
Installing and configuring the Oracle server
Installing Apache Knox
Installing Apache Knox
Installing Atlas in HA using CDP Private Cloud Base cluster
Installing Atlas using Add Service
Installing CDP Private Cloud Base
Installing CDS 3.2.3
Installing Cloudera Manager, Cloudera Runtime, and Managed Services
Installing Cloudera Navigator Encrypt
Installing Cloudera Navigator Key HSM
Installing Connectors
Installing Hive on Tez and adding a HiveServer role
Installing OpenJDK for CDP Runtime
Installing OpenJDK on Cloudera Manager
Installing Operational Database powered by Apache Accumulo
Installing Oracle JDK for CDP Runtime
Installing Postgres JDBC Driver
Installing PostgreSQL Server
Installing Ranger KMS backed by a Database and HA
Installing Ranger KMS backed with a Key Trustee Server and HA
Installing Ranger RMS
Installing Ranger using Add Service
Installing the GPL Extras Parcel
Installing the Kafka Connect Role
Installing the psycopg2 Python package for PostgreSQL-backed Hue
Installing the REST Server using Cloudera Manager
Installing the UDF development package
Instantiating a Cloudera Manager Image
Instantiating a worker host
INT data type
Integrating Apache Hive with Apache Spark and BI
Integrating Atlas with Ozone
Integrating Components for Encrypting Data at Rest
Integrating Hive and a BI tool
Integrating Kafka and Schema Registry
Integrating Key HSM with Key Trustee Server
Integrating MIT Kerberos and Active Directory
Integrating Ranger KMS DB with CipherTrust Manager HSM
Integrating Ranger KMS DB with Google Cloud HSM
Integrating Ranger KMS DB with SafeNet Keysecure HSM
Integrating the Hive Metastore with Apache Kudu
Integrating with Flink and SSB
Integrating with NiFi
Integrating with Schema Registry
Integrating your identity provider's SAML server with Hue
Inter-broker security
Inter-broker security
Interacting with Hive views
Internal and external Impala tables
Introducing the S3A Committers
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction to alert policies in Streams Messaging Manager
Introduction to Apache HBase
Introduction to Apache Phoenix
Introduction to Azure Storage and the ABFS Connector
Introduction to HBase Multi-cluster Client
Introduction to HBase Multi-cluster Client
Introduction to HDFS metadata files and directories
Introduction to Hive metastore
Introduction to Kafka Connect
Introduction to monitoring Kafka cluster replications in SMM
Introduction to Ozone
Introduction to Parcels
Introduction to Streams Messaging Manager
Invalid method name: 'GetLog' error
Invalid query handle
INVALIDATE METADATA statement
Isilon Metrics
ISR management
Issues starting or restarting the master or the tablet server
Java API example
Java client
Java KeyStore KMS Metrics
Java KeyStore KMS Properties in Cloudera Runtime 7.1.7
Java Requirements
JBOD
JBOD Disk migration
JBOD setup
JDBC connection string syntax
JDBC connection string syntax
JDBC mode configuration properties
JDBC mode limitations
JDBC read mode introduction
JobHistory Server Health Tests
JobHistory Server Metrics
JobTracker Health Tests
JobTracker Metrics
Joins in Impala SELECT statements
JournalNode Health Tests
JournalNode Metrics
JournalNodes
JournalNodes
JVM and garbage collection
Kafka
Kafka
Kafka
Kafka
Kafka
Kafka
Kafka actions that produce Atlas entities
Kafka Architecture
Kafka audit entries
Kafka Broker Health Tests
Kafka Broker Log Directory Metrics
Kafka Broker Metrics
Kafka Broker Topic Metrics
Kafka Broker Topic Partition Metrics
Kafka brokers and Zookeeper
Kafka clients and ZooKeeper
Kafka cluster load balancing using Cruise Control
Kafka Connect
Kafka Connect API Security
Kafka Connect Connector Reference
Kafka Connect Connector Sink Task Metrics Metrics
Kafka Connect Connector Source Task Metrics Metrics
Kafka Connect Connector Task Error Metrics Metrics
Kafka Connect Connector Task Metrics Metrics
Kafka Connect Health Tests
Kafka Connect Metrics
Kafka Connect Overview
Kafka Connect property configuration in Cloudera Manager for Prometheus
Kafka Connect Setup
Kafka Consumer Group Metrics
Kafka consumers
Kafka credentials property reference
Kafka FAQ
Kafka Health Tests
Kafka Introduction
Kafka lineage
Kafka metadata collection
Kafka Metrics
Kafka MirrorMaker Health Tests
Kafka MirrorMaker Metrics
Kafka Producer Metrics
Kafka producers
Kafka Properties in Cloudera Runtime 7.1.7
Kafka property configuration in Cloudera Manager for Prometheus
Kafka public APIs
Kafka relationships
Kafka Replica Metrics
Kafka security hardening with Zookeeper ACLs
Kafka storage handler and table properties
Kafka Streams
kafka-*-perf-test
kafka-configs
kafka-console-consumer
kafka-console-producer
kafka-consumer-groups
kafka-delegation-tokens
kafka-log-dirs
kafka-reassign-partitions
kafka-topics
Kafka-ZooKeeper performance tuning
Keep replicas current
Kerberos
Kerberos
Kerberos authentication
Kerberos authentication for non-default users
Kerberos configuration for Ozone
Kerberos Configuration Strategies for CDP
Kerberos configurations for HWC
Kerberos connectivity test
Kerberos principal and keytab properties for Ozone service daemons
Kerberos Security Artifacts Overview
Kerberos setup guidelines for Distcp between secure clusters
Kerberos Ticket Renewer Health Tests
Kerberos Ticket Renewer Metrics
Kernel stack watchdog traces
Key Concepts and Architecture
Key Features
Key Management Server Health Tests
Key Management Server Metrics
Key Management Server Proxy Health Tests
Key Management Server Proxy Metrics
Key management using ofs
Key Trustee KMS Encryption Issues
Key Trustee KMS Metrics
Key Trustee KMS operations not supported by Ranger KMS
Key Trustee KMS Properties in Cloudera Runtime 7.1.7
Key Trustee Server
Key Trustee Server Metrics
Key Trustee Server Properties for TLS
Key Trustee Server Properties in Cloudera Runtime 7.1.7
Key Trustee Server System Requirements
Key-Value Store Indexer Health Tests
Key-Value Store Indexer Metrics
Key-Value Store Indexer Properties in Cloudera Runtime 7.1.7
Keystores and the Key Management Server
kite-morphlines-avro
kite-morphlines-core-stdio
kite-morphlines-core-stdlib
kite-morphlines-hadoop-core
kite-morphlines-hadoop-parquet-avro
kite-morphlines-hadoop-rcfile
kite-morphlines-hadoop-sequencefile
kite-morphlines-json
kite-morphlines-maxmind
kite-morphlines-metrics-servlets
kite-morphlines-protobuf
kite-morphlines-saxon
kite-morphlines-solr-cell
kite-morphlines-solr-core
kite-morphlines-tika-core
kite-morphlines-tika-decompress
kite-morphlines-useragent
KMS ACL Configuration for Hive
Known issues and limitations
Known Issues for Apache Sqoop
Known Issues for Apache Sqoop
Known Issues for Apache Sqoop
Known Issues for IBM PowerPC
Known Issues in Apache Atlas
Known Issues in Apache Atlas
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known Issues in Apache Avro
Known Issues in Apache Avro
Known Issues in Apache Calcite
Known issues in Apache Calcite
Known Issues in Apache Hadoop
Known Issues in Apache Hadoop
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in Apache HBase
Known Issues in Apache HBase
Known Issues in Apache Hive
Known Issues in Apache Hive
Known Issues in Apache Hive
Known Issues in Apache Impala
Known Issues in Apache Impala
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Apache Kafka
Known Issues in Apache Kafka
Known Issues in Apache Knox
Known Issues in Apache Knox
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Apache Kudu
Known Issues in Apache Kudu
Known Issues in Apache Oozie
Known Issues in Apache Oozie
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Ozone
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Parquet
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Phoenix
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Apache Ranger
Known Issues in Apache Ranger
Known Issues in Apache Spark
Known Issues in Apache Spark
Known Issues in Apache Spark
Known Issues in Apache Zeppelin
Known Issues in Apache Zeppelin
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
Known Issues in Apache ZooKeeper
Known Issues in Apache ZooKeeper
Known Issues in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
Known Issues in Cloudera Manager 7.4.4
Known Issues in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
Known Issues in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
Known issues in Cloudera Runtime 7.1.7
Known issues in Cloudera Runtime 7.1.7 SP1
Known issues in Cloudera Runtime 7.1.7 SP2
Known Issues in Cloudera Runtime 7.1.7 SP3
Known Issues in Cloudera Search
Known Issues in Cloudera Search
Known Issues in Cloudera Search
Known Issues in Cruise Control
Known issues in Cruise Control
Known issues in Cruise Control
Known Issues in Data Analytics Studio
Known Issues in Data Analytics Studio
Known Issues in Data Analytics Studio
Known Issues in HDFS
Known Issues in HDFS
Known Issues in HDFS
Known Issues in Hue
Known Issues in Hue
Known Issues in Hue
Known Issues in Kerberos
Known Issues in MapReduce and YARN
Known Issues in MapReduce and YARN
Known Issues in MapReduce and YARN
Known Issues in Navigator Encrypt
Known Issues in Navigator Encrypt
Known Issues in Navigator Encrypt
Known Issues in Schema Registry
Known Issues in Schema Registry
Known Issues in Schema Registry
Known issues in Streams Messaging Manager
Known issues in Streams Messaging Manager
Known issues in Streams Messaging Manager
Known Issues in Streams Replication Manager
Known Issues in Streams Replication Manager
Known Issues in Streams Replication Manager
Knox
Knox
Knox Gateway Health Tests
Knox Gateway Metrics
Knox Gateway UI: incorrect username or password
Knox Health Tests
Knox IDBroker Health Tests
Knox IDBroker Metrics
Knox Metrics
Knox Properties for TLS
Knox Properties in Cloudera Runtime 7.1.7
Knox Supported Services Matrix
Knox Topology Management in Cloudera Manager
Kudu
Kudu
Kudu
Kudu
Kudu
Kudu and Apache Ranger integration
Kudu architecture in a CDP private cloud base deployment
Kudu authentication
Kudu authentication tokens
Kudu authentication with Kerberos
Kudu authorization policies
Kudu authorization tokens
Kudu backup
Kudu coarse-grained authorization
Kudu concepts
Kudu example applications
Kudu fine-grained authorization
Kudu Health Tests
Kudu integration with Spark
Kudu introduction
Kudu master web interface
Kudu metrics
Kudu Metrics
Kudu network architecture
Kudu Properties in Cloudera Runtime 7.1.7
Kudu Python client
Kudu recovery
Kudu Replica Metrics
Kudu schema design
Kudu security considerations
Kudu security limitations
Kudu security limitations
Kudu tablet server web interface
Kudu tracing
Kudu transaction semantics
Kudu web interfaces
Kudu-Impala integration
LAG
LAST_VALUE
Launch a YARN service
Launch distcp
Launch Zeppelin
Launching Apache Phoenix Thin Client
LAZY_PERSIST memory storage policy
LDAP authentication
LDAP properties
LDAP search fails with invalid credentials error
LDAP Settings
LEAD
Leader positions and in-sync replicas
Lengthy BalancerMember Route length
Leveraging Business Metadata
Lifecycle and Security Auditing
Lily HBase batch indexing for Cloudera Search
Lily HBase Indexer Health Tests
Lily HBase Indexer Metrics
Lily HBase Near Real Time Indexing for Cloudera Search
LIMIT clause
Limit CPU usage with Cgroups
Limitations
Limitations
Limitations
Limitations and restrictions for Impala UDFs
Limitations of Amazon S3
Limitations of Atlas-NiFi integration
Limitations of erasure coding
Limitations of Phoenix-Hive connector
Limitations of the S3A Committers
Limiting concurrent connections
Limiting the speed of compactions
Lineage lifecycle
Lineage overview
Linux Container Executor
Linux Control Groups (cgroups)
List and Create Keys
List buckets
List files in Hadoop archives
List of APIs verified
List of supported non-alphanumeric characters for file and directory names in Hue
List of Thrift API and HBase configurations
Listing available metrics
Listing Repositories
Literals
Live write access
Livy API reference for batch jobs
Livy API reference for interactive sessions
Livy batch object
Livy for Spark 3 Health Tests
Livy for Spark 3 Metrics
Livy for Spark 3 Properties in Cloudera Runtime 7.1.7
Livy Health Tests
Livy interpreter configuration
Livy Metrics
Livy objects for interactive sessions
Livy Properties in Cloudera Runtime 7.1.7
Livy Server for Spark 3 Health Tests
Livy Server for Spark 3 Metrics
Livy Server Health Tests
Livy Server Metrics
LLAP Proxy Health Tests
LLAP Proxy Metrics
Load Balancer Health Tests
Load Balancer Metrics
LOAD DATA statement
Loading ORC data into DataFrames using predicate push-down
Loading the Oozie database
Local file system support
Locating Hive tables and changing the location
Log a Security Support Case
Log Aggregation File Controllers
Log Aggregation Properties
Log cleaner
Log Details
Log support in Cloudera Manager for ECS cluster
Logical Architecture
Logical operators, comparison operators and comparators
Logs
Logs and Events
Logs and log segments
Logs List
LOG_MESSAGE Category
Main Use Cases
Maintaining Cloudera Navigator Encrypt
Maintenance manager
Maintenance Mode
Manage databases and tables
Manage dynamic queues
Manage HBase snapshots on Amazon S3 in Cloudera Manager
Manage HBase snapshots using Cloudera Manager
Manage HBase snapshots using the HBase shell
Manage individual delegation tokens
Manage placement rules
Manage Policies for HBase snapshots in Amazon S3
Manage queries
Manage Queues
Manage reports
Manage the YARN service life cycle through the REST API
Managed Parent Queues
Management basics
Management of existing Apache Knox shared providers
Management of Knox shared providers in Cloudera Manager
Management of Service Parameters for Apache Knox via Cloudera Manager
Management of services for Apache Knox via Cloudera Manager
Managing Access Control Lists
Managing alert policies and notifiers in SMM
Managing Alert Policies using Streams Messaging Manager
Managing Alerts
Managing and Allocating Cluster Resources using Capacity Scheduler
Managing Anonymous Usage Data Collection
Managing Apache Hadoop YARN Services
Managing Apache HBase
Managing Apache HBase Security
Managing Apache Hive
Managing Apache Impala
Managing Apache Kafka
Managing Apache Kudu
Managing Apache Kudu Security
Managing Apache Phoenix Security
Managing Apache Phoenix security
Managing Apache ZooKeeper
Managing Apache ZooKeeper Security
Managing Auditing with Ranger
Managing Business Terms with Atlas Glossaries
Managing Cloudera Manager
Managing Cloudera Manager Server Logs
Managing Cloudera Runtime Services
Managing Cloudera Search
Managing Clusters
Managing collection configuration
Managing collections
Managing columns
Managing Cruise Control
Managing Dashboards
Managing Data Storage
Managing Disk Space for Log Files
Managing dynamic child creation enabled parent queues
Managing Dynamic Configurations
Managing dynamically created child queues
Managing Encryption Keys and Zones
Managing HDFS snapshots in Cloudera Manager
Managing Hosts
Managing Hue permissions
Managing Kafka Topics using Streams Messaging Manager
Managing Kerberos credentials using Cloudera Manager
Managing Key Trustee Server Certificates
Managing Key Trustee Server Organizations
Managing Licenses
Managing Logs
Managing Metadata in Impala
Managing Metadata in Impala
Managing Operational Database powered by Apache Accumulo
Managing Parcels
Managing partition retention time
Managing partitions
Managing query rewrites
Managing Re-encryption Operations
Managing replication policies
Managing Resources in Impala
Managing Role Groups
Managing Roles
Managing snapshot policies using Cloudera Manager
Managing Spark Driver Logs
Managing storage elements by using the command-line interface
Managing Suppressed Validations
Managing tables
Managing the Cloudera Manager Agent Logs
Managing the Navigator Key HSM Service
Managing topics across multiple Kafka clusters
Managing YARN Docker Containers
Managing YARN queue users
Managing, Deploying and Monitoring Connectors
Manually configuring SAML authentication
Manually Configuring TLS Encryption for Cloudera Manager
Manually Configuring TLS Encryption on the Agent Listening Port
Manually failing over to the standby NameNode
Manually Install Cloudera Manager Agent Packages
Manually Install Cloudera Software Packages
Manually Redeploying Client Configuration Files
Manually Triggering Collection and Transfer of Diagnostic Data to Cloudera
MAP complex type
Mapping Apache Phoenix schemas to Apache HBase namespaces
Mapping Kerberos Principals to Short Names
Mapping Sentry permissions for Solr to Ranger policies
MapReduce Health Tests
MapReduce indexing
MapReduce Job ACLs
MapReduce Metrics
MapReduceIndexerTool
MapReduceIndexerTool input splits
MapReduceIndexerTool metadata
MapReduceIndexerTool usage syntax
Master Health Tests
Master Metrics
Materialized View Engine Health Tests
Materialized View Engine Metrics
Materialized views
Mathematical functions
Maven Artifacts for Cloudera Runtime 7.1.7 SP1
Maven Artifacts for Cloudera Runtime 7.1.7.0
MAX
MAX function
Memory
Memory limits
Merge process stops during Sqoop incremental imports
Merging data in tables
Metric Aggregation
Metric Expression Functions
Metric Expressions
Metrics
Metrics and Insight
Metrics and queries
Migrate brokers by modifying broker IDs in meta.properties
Migrate data on the same host
Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server
Migrate from the Cloudera Manager External PostgreSQL Database Server to a MySQL/Oracle Database Server
Migrate ResourceManager to another host
Migrate the Ranger Admin role instance to a new host
Migrate the Ranger KMS db role instance to a new host
Migrate the Ranger KMS KTS role instance to a new host
Migrate to multiple Kudu masters
Migrate to strongly consistent indexing
Migrating ACLs from Key Trustee KMS to Ranger KMS
Migrating Consumer Groups Between Clusters
Migrating Data Using Sqoop
Migrating database configuration to a new location
Migrating Embedded PostgreSQL Database to External PostgreSQL Database
Migrating from PostgreSQL Database Server to MySQL/Oracle Database Server
Migrating from Sentry to Ranger
Migrating from the Cloudera Manager Embedded PostgreSQL Database Server to an External PostgreSQL Database
Migrating Hue service by adding new role instances
Migrating Hue service using Add Service wizard
Migrating Keys from a Java KeyStore to Cloudera Navigator Key Trustee Server
Migrating Ranger Key Management Server Role Instances to a New Host
Migrating Ranger Usersync and Tagsync role groups
Migrating Solr replicas
Migration from Fair Scheduler to Capacity Scheduler
Migration Guide
MIN
MIN function
Minimize cluster distruption during planned downtime
Miscellaneous functions
Missing Containers page
MOB cache properties
Modify a provider in an existing provider configuration
Modify custom service parameter in descriptor
Modify GCS Bucket Permissions
Modify interpreter settings
Modifying a collection configuration generated using an instance directory
Modifying a connector using Kafka Connect in SMM
Modifying a Kafka topic
Modifying Configuration Properties Using Cloudera Manager
Modifying Impala Startup Options
Modifying the Health Threshold
Modifying the session cookie timeout value
Monitor cluster health with ksck
Monitor Health Tests
Monitor Metrics
Monitor RegionServer grouping
Monitor the BlockCache
Monitor your Cluster from the SMM UI
Monitoring
Monitoring a Cluster Using Cloudera Manager
Monitoring Activities
Monitoring and Debugging Spark Applications
Monitoring and Diagnostics
Monitoring and Diagnostics
Monitoring Apache Impala
Monitoring Apache Kudu
Monitoring checkpoint latency for cluster replication
Monitoring cluster profile using Kafka Connect in SMM
Monitoring Clusters
Monitoring connector profile using Kafka Connect in SMM
Monitoring connector settings using Kafka Connect in SMM
Monitoring connectors using Kafka Connect in SMM
Monitoring end to end latency for Kafka topic
Monitoring End-to-End Latency using Streams Messaging Manager
Monitoring heap memory usage
Monitoring Hosts
Monitoring Impala Queries
Monitoring Kafka brokers
Monitoring Kafka cluster replications by quick ranges
Monitoring Kafka Cluster Replications using Streams Messaging Manager
Monitoring Kafka clusters
Monitoring Kafka Clusters using Streams Messaging Manager
Monitoring Kafka Connect using Streams Messaging Manager
Monitoring Kafka consumers
Monitoring Kafka producers
Monitoring Kafka topics
Monitoring replication latency for cluster replication
Monitoring replication throughput and latency by values
Monitoring Replication with Streams Messaging Manager
Monitoring Service Status
Monitoring Services
Monitoring Spark Applications
Monitoring status of the clusters to be replicated
Monitoring the performance of HDFS replication policies
Monitoring the performance of Hive/Impala replication policies
Monitoring throughput for cluster replication
Monitoring topics to be replicated
Monitoring YARN Applications
More Resources
Morphline commands overview
Move HBase Master Role to another host
Moving a Host Between Clusters
Moving a NameNode to a different host using Cloudera Manager
Moving and Resizing Charts
Moving highly available NameNode, failover controller, and JournalNode roles using the Migrate Roles wizard
Moving Monitoring Data on an Active Cluster
Moving NameNode roles
Moving the Cloudera Manager Server to a New Host
Moving the Hue service to a different host
Moving the JournalNode edits directory for a role group using Cloudera Manager
Moving the JournalNode edits directory for a role instance using Cloudera Manager
Moving the Oozie service to a different host
Multi-Raft configuration for efficient write performances
Multi-server LDAP/AD autentication
Multilevel partitioning
Multipart upload
MySQL: 1040, 'Too many connections' exception
NameNode architecture
NameNode Health Tests
NameNode Metrics
NameNodes
NameNodes
Navigator Audit Server Health Tests
Navigator Audit Server Metrics
Navigator Encrypt
Navigator Encrypt
Navigator Encrypt
Navigator Encrypt
Navigator Encrypt Access Control List
Navigator Encrypt Overview
Navigator HSM KMS backed by SafeNet Luna HSM Metrics
Navigator HSM KMS backed by Thales HSM Metrics
Navigator Key HSM
Navigator Key Trustee Server
Navigator Luna KMS Metastore Health Tests
Navigator Luna KMS Metastore Metrics
Navigator Luna KMS Proxy Health Tests
Navigator Luna KMS Proxy Metrics
Navigator Metadata Server Health Tests
Navigator Metadata Server Metrics
Navigator Thales KMS Metastore Health Tests
Navigator Thales KMS Metastore Metrics
Navigator Thales KMS Proxy Health Tests
Navigator Thales KMS Proxy Metrics
NDV function
Near Real Time Indexing
Network and I/O threads
Network Interface Metrics
Networking and Security Requirements
Networking Considerations for Virtual Private Clusters
Networking parameters
New topic and consumer group discovery
NFS Gateway Health Tests
NFS Gateway Metrics
Nginx configuration for Prometheus
Nginx installtion
Nginx proxy configuration over Prometheus
NiFi lineage
NiFi metadata collection
NiFi Registry TLS/SSL Properties
NiFi TLS/SSL properties
NodeManager Health Tests
NodeManager Metrics
Non-covering range partitions
Notes about replication
Notifiers
NTILE
Number-of-Regions Quotas
Number-of-Tables Quotas
Obtain and Deploy Keys and Certificates for TLS/SSL
Obtaining client to Ozone through session
Obtaining resources to Ozone
Obtaining Time-Series Data Using the API
Off-heap BucketCache
Offloading Application Logs to Ozone
OFFSET clause
Offsets Subcommand
Omid Health Tests
Omid Metrics
Omid tso server Health Tests
Omid tso server Metrics
On-demand Metadata
On-demand Metadata
On-premise to Cloud and Kafka Version Upgrade
Oozie
Oozie
Oozie
Oozie configurations with CDP services
Oozie database configurations
Oozie Health Tests
Oozie High Availability
Oozie Load Balancer configuration
Oozie Metrics
Oozie Properties in Cloudera Runtime 7.1.7
Oozie scheduling examples
Oozie security enhancements
Oozie Server Health Tests
Oozie Server Metrics
OpDB overview
Operating System Requirements
Operating system requirements
Operational Database
Operational Database
Operational Database Overview
Operational Database overview
Operational Database powered by Apache Accumulo Overview
Operational Database powered by Apache Accumulo Reference
Operators
Optimize mountable HDFS
Optimize performance for evaluating SQL predicates
Optimizer hints
Optimizing data storage
Optimizing HBase I/O
Optimizing NameNode disk space with Hadoop archives
Optimizing performance
Optimizing Performance for HDFS Transparent Encryption
Optimizing Performance in Cloudera Runtime
Optimizing queries using partition pruning
Optimizing S3A read performance for different file types
Options to determine differences between contents of snapshots
Options to rerun Oozie workflows in Hue
ORC file format
ORC vs Parquet formats
Orchestrate a rolling restart with no downtime
ORDER BY clause
Orphaned snapshots
Other known issues
Other Tasks and Settings
OVER
Overriding Configuration Properties
Overriding custom keystore alias on a Ranger KMS Server
Overriding custom keystore alias on a Ranger KMS Server
Overview
Overview
Overview
Overview
Overview
Overview
Overview
Overview of Hadoop archives
Overview of HDFS
Overview of Oozie
Overview of Parcels
Overview of proxy usage and load balancing for Search
Overview of Storage Container Manager in High Availability
Overview of the Ozone Manager in High Availability
Overview page
Overview Tab
Ozone
Ozone
Ozone
Ozone
Ozone architecture
Ozone configuration options to work with CDP components
Ozone DataNode Health Tests
Ozone DataNode Metrics
Ozone Health Tests
Ozone Manager Health Tests
Ozone Manager Metrics
Ozone Manager nodes in High Availability
Ozone Metrics
Ozone Prometheus Health Tests
Ozone Prometheus Metrics
Ozone Properties in Cloudera Runtime 7.1.7
Ozone Recon Health Tests
Ozone Recon Metrics
Ozone security architecture
Ozone trash overview
Packaging different versions of libraries with an Apache Spark application
PAM authentication
Parameters to configure the Disk Balancer
Parcel Configuration Settings
Parcel Life Cycle
Parcel Locations
Parcels
Parquet
Parquet
Partition pruning
Partition Pruning for Queries
Partition refresh and configuration
Partitioning
Partitioning
Partitioning examples
Partitioning for Kudu Tables
Partitioning guidelines
Partitioning limitations
Partitioning limitations
Partitioning tables
Partitions
Partitions and performance
Passive Database Health Tests
Passive Database Metrics
Passive Key Trustee Server Health Tests
Passive Key Trustee Server Metrics
Pausing a Cluster in AWS
PERCENT_RANK
Perform a backup of the HDFS metadata
Perform a disk hot swap for DataNodes using Cloudera Manager
Perform ETL by ingesting data from Kafka into Hive
Perform master hostname changes
Perform scans using HBase Shell
Perform the migration
Perform the recovery
Perform the removal
Performance and Scalability
Performance and scalability limitations to consider for replication policies
Performance and storage considerations for Spark SQL DROP TABLE PURGE
Performance Best Practices
Performance comparison between Cloudera Manager and Prometheus
Performance Considerations
Performance considerations
Performance Considerations
Performance considerations for UDFs
Performance Impact of Encryption
Performance improvement using partitions
Performance issues
Performance Management
Performance Trade Offs
Performance tuning
Performance tuning for Ozone
Performant .NET producer
Performing Maintenance on a Cluster Host
Periodic Stacks Collection
Periodically rebuilding a materialized view
Phoenix
Phoenix
Phoenix
Phoenix
Phoenix Health Tests
Phoenix Metrics
Phoenix Properties in Cloudera Runtime 7.1.7
Phoenix-Spark connector usage examples
Physical backups of an entire node
Pillars of Security
Pipelines page
Placement rule policies
Placing Ozone DataNodes in offline mode
Plan the data movement across disks
Planning for Apache Impala
Planning for Apache Kudu
Planning for Infra Solr
Planning for Streams Replication Manager
Planning overview
Platform and OS
Platform and OS
Pluggable authentication modules in HiveServer
Populating an HBase Table
Port and network requirements for Replication Manager on CDP Private Cloud Base
Ports
Ports Used by Cloudera Manager
Ports Used by Cloudera Navigator Key Trustee Server
Ports Used by Cloudera Runtime Components
Ports Used by DistCp
Ports Used by Impala
Ports Used by Third-Party Components
POST /admin/audits/ API
Post-migration verification
Pre-defined Access Policies for Schema Registry
Predicate push-down optimization
Predicates
Preloaded resource-based services and policies
Prepare for master hostname changes
Prepare for removal
Prepare for the migration
Prepare for the recovery
Prepare Kerberos authentication-enabled clusters for replication
Prepare to back up the HDFS metadata
Prepare to replicate using replication policies
Preparing a New Cluster
Preparing a thrift server and client
Preparing for Encryption Using Cloudera Navigator Encrypt
Preparing the hardware resources for HDFS High Availability
Prerequisite
Prerequisites
Prerequisites
Prerequisites
Prerequisites
Prerequisites
Prerequisites and Assumptions
Prerequisites and exceptions for the example configuration
Prerequisites for configuring short-ciruit local reads
Prerequisites for configuring TLS/SSL for Oozie
Prerequisites for enabling erasure coding
Prerequisites for enabling HDFS HA using Cloudera Manager
Prerequisites for installing Atlas
Prerequisites for installing Docker
Prerequisites for Prometheus configuration
Prerequisites for setting up Atlas HA
Prerequisites to configure TLS/SSL for HBase
Prerequisites to configure TLS/SSL for HBase
Presentation of Aggregate Data
Preventing inadvertent deletion of directories
Previewing tables using Data Preview
Primary key design
Primary key index
Principal name mapping
Principal name mapping
Privileged commands for Cloudera Manager installation
Problem area: Compose page
Problem area: Queries page
Problem area: Reports page
Process Management
Processes
Production Installation
Profiler Admin Agent Health Tests
Profiler Admin Agent Metrics
Profiler Manager Metrics
Profiler Metrics Agent Health Tests
Profiler Metrics Agent Metrics
Profiler Scheduler Agent Health Tests
Profiler Scheduler Agent Metrics
Profiler Scheduler Metrics
Prometheus configuration for SMM
Prometheus for SMM limitations
Prometheus metrics overview
Prometheus properties configuration
Propagating classifications through lineage
Propagation of tags as deferred actions
Properties for configuring centralized caching
Properties for configuring short-circuit local reads on HDFS
Properties for configuring the Balancer
Properties to set the size of the NameNode edits directory
Protocol between consumer and broker
Provide Read-only access to Queue Manager UI
Provide user permissions
Provide user permissions
Proxy Cloudera Manager through Apache Knox
Purging deleted entities
Purposely using a stale materialized view
PUT /admin/purge/ API
Putting all Hosts in an Upgrade Domain group into Maintenance Mode
Queries are not appearing on the Queries page
Query an existing Kudu table from Impala
Query column is empty but you can see the DAG ID and Application ID
Query Details
Query fails with "Counters limit exceeded" error message
Query Join Performance
Query options
Query Processor Health Tests
Query Processor Metrics
Query results cache
Query sample data
Query scheduling
Query Server Health Tests
Query Server Metrics
Query vectorization
Query vectorization properties
Querying
Querying a schema
Querying correlated data
Querying files into a DataFrame
Querying Kafka data
Querying live data from Kafka
Querying metric data
Querying the information_schema database
Queue ACLs
Quick Start Deployment for a Streams Cluster
Quota enforcement
Quota violation policies
Quotas
Rack awareness
Rack awareness (Location awareness)
Range partitioning
Range partitioning
Ranger
Ranger
Ranger
Ranger
Ranger
Ranger access conditions
Ranger AD Integration
Ranger Admin Health Tests
Ranger Admin Metrics
Ranger Audit Filters
Ranger audit schema reference
Ranger console navigation
Ranger database schema reference
Ranger Health Tests
Ranger KMS
Ranger KMS Health Tests
Ranger KMS Metrics
Ranger KMS Server Health Tests
Ranger KMS Server Metrics
Ranger KMS Server with KTS Health Tests
Ranger KMS Server with KTS Metrics
Ranger KMS with Key Trustee Server Health Tests
Ranger KMS with Key Trustee Server Metrics
Ranger Metrics
Ranger policies allowing create privilege for Hadoop_SQL databases
Ranger policies allowing create privilege for Hadoop_SQL tables
Ranger policies for Kudu
Ranger Policies Overview
Ranger Properties in Cloudera Runtime 7.1.7
Ranger Raz Health Tests
Ranger Raz Metrics
Ranger Raz Server Health Tests
Ranger Raz Server Metrics
Ranger RMS - HIVE-HDFS ACL Sync Overview
Ranger RMS Health Tests
Ranger RMS Metrics
Ranger RMS Server Health Tests
Ranger RMS Server Metrics
Ranger Security Zones
Ranger special entities
Ranger tag-based policies
Ranger Tagsync Health Tests
Ranger Tagsync Metrics
Ranger UI authentication
Ranger UI authorization
Ranger user management
Ranger Usersync
Ranger Usersync Health Tests
Ranger Usersync Metrics
RANK
Re-encrypting an EDEK
Re-encrypting Encrypted Data Encryption Keys (EDEKs)
Read access
Read and write operations
Read and write requests with Ozone Manager in High Availability
Read operations (scans)
Read replica properties
Read the Events
Reading and writing Hive tables in R
Reading and writing Hive tables in Zeppelin
Reading data from HBase
Reading data through HWC
Reading Hive ORC tables
Reads (scans)
REAL data type
Reassigning replicas between log directories
Reassignment examples
Rebalance after adding Kafka broker
Rebalance after demoting Kafka broker
Rebalance after removing Kafka broker
Rebalancing partitions
Rebalancing with Cruise Control
Rebuild a Kudu filesystem layout
Recommendations for client development
Recommendations for managing Docker containers on YARN
Recommended configurations for the Balancer
Recommended configurations for the balancer
Recommended deployment architecture
Recommended Hive configurations when using Ozone
Recommissioning an Ozone DataNode
Recommissioning Hosts
Recommissioning Role Instances
Record management
Record order and assignment
Record User Data Paths
Records
Recover data from a snapshot
Recover from a dead Kudu master
Recover from disk failure
Recover from full disks
Recovering a Key Trustee Server
Redaction of Sensitive Information from Diagnostic Bundles
Redeploying the Oozie ShareLib
Redeploying the Oozie sharelib using Cloudera Manager
Reducing the Size of Data Structures
Refer to a table using dot notation
Reference architecture
Referencing Amazon S3 in URIs
Referencing S3 Credentials for YARN, MapReduce, or Spark Clients
Referencing S3 Data in Applications
Referer checking failed
Refining query search using filters
REFRESH AUTHORIZATION statement
REFRESH FUNCTIONS statement
REFRESH statement
RegionServer Health Tests
RegionServer Metrics
Registering a Lily HBase Indexer Configuration with the Lily HBase Indexer Service
Registering Cloudera Navigator Encrypt with Key Trustee Server
Registering the UDF
Relax WAL durability
Release notes
Reloading, viewing, and filtering functions
Remote Topics
Remove a DataNode
Remove a RegionServer from RegionServer grouping
Remove Cloudera Manager, User Data, and Databases
Remove custom service parameter from descriptor
Remove Kudu masters
Remove or add storage directories for NameNode data directories
Remove storage directories using Cloudera Manager
Removing a Chart from a Custom Dashboard
Removing a Filter
Removing a Host From a Cluster
Removing an Event Filter
Removing Ozone DataNodes from the cluster
Removing scratch directories
Renaming a Cluster
Renaming a Service
Renew and Redistribute Certificates
Renewing a License
Reorder placement rules
Repairing partitions manually using MSCK repair
Replace a disk on a DataNode host
Replace a ZooKeeper disk
Replace a ZooKeeper role on an unmanaged cluster
Replace a ZooKeeper role with ZooKeeper service downtime
Replace a ZooKeeper role without ZooKeeper service downtime
Replacing Key Trustee Server Certificates
Replicate pre-exist data in an active-active deployment
Replicating Data
Replicating data to Impala clusters
Replicating from unsecure to secure clusters
Replication
Replication across three or more clusters
Replication caveats
Replication Flows Overview
Replication Manager
Replication Manager in CDP Private Cloud Base
Replication of encrypted data
Replication of Impala and Hive User Defined Functions (UDFs)
Replication requirements
Report craches using breakpad
Reports
Reports Manager
Reports Manager Health Tests
Reports Manager Metrics
Repository Configuration Files
Request a timeline-consistent read
Required Databases
Required ports in Kerberos authentication-enabled clusters for replication
Requirements for compressing and extracting files using Hue File Browser
Requirements for Oozie High Availability
Reserved words
Resetting Configuration Properties to the Default Value
Resetting Hue user password
Resolving "The user authorized on the connection does not match the session username" error
Resolving "You are accessing a non-optimized Hue" error
Resource allocation overview
Resource distribution workflow
Resource Management
Resource Management
Resource Planning for Data at Rest Encryption
Resource Scheduling and Management
Resource Tuning Example
Resource-based Services and Policies
ResourceManager Health Tests
ResourceManager Metrics
Resources
REST endpoints supported on Ozone S3 Gateway
Restarting a Cloudera Runtime Service
Restarting Services and Instances after Configuration Changes
Restarting the Cloudera Management Service
Restore an HBase snapshot from Amazon S3
Restore an HBase snapshot from Amazon S3 with a new name
Restore data from a replica
Restore HDFS metadata from a backup using Cloudera Manager
Restore Key Trustee Server from ktbackup.sh backups
Restore Key Trustee Server in package-based installations
Restore Key Trustee Server in parcel-based installations
Restore tables from backups
Restoring a collection
Restoring HDFS snapshots
Restoring NameNode metadata
Restoring Navigator Key Trustee Server
Restoring the Cloudera Manager configuration
Restricting access to Kafka metadata in Zookeeper
Restricting classifications based on user permission
Restricting supported ciphers for Hue
Restricting user login
Results Tab
Results Tab
Retaining logs for Replication Manager
Retries
Retrieving log directory replica assignment information
Retrieving metric data
Retrieving the clusterstate.json file
Review Changes
REVOKE ROLE statement
REVOKE statement
Role Assignments
Role Groups
Role Instance Reference
Role Instances
ROLE statements
Roll Over an Existing Key
Rolling Encryption Keys
Rolling Restart
Rotate Auto-TLS Certificate Authority and Host Certificates
Rotate the master key/secret
Row-level filtering and column masking in Hive
Row-level filtering in Hive with Ranger policies
Row-level filtering in Impala with Ranger policies
ROW_NUMBER
RPC timeout traces
Run a tablet rebalancing tool in Cloudera Manager
Run a tablet rebalancing tool in command line
Run a tablet rebalancing tool on a rack-aware cluster
Run the Cloudera Manager Server Installer
Run the Cloudera Manager Server Installer
Run the Disk Balancer plan
Run the spark-submit job
Run the tablet rebalancing tool
Running a Hive command
Running a MapReduce Job
Running a query on a different Hive instance
Running a query on a different Hive instance
Running a Spark MLlib example
Running an interactive session with the Livy API
Running Apache Spark Applications
Running applications with CDS 3.2.3 with GPU Support
Running Commands and SQL Statements in Impala Shell
Running Diagnostic Commands for Roles
Running Dockerized Applications on YARN
Running HBaseMapReduceIndexerTool
Running PySpark in a virtual environment
Running sample Spark applications
Running shell commands
Running Spark 3 Applications
Running Spark 3 Applications with CDS 3.2.3
Running Spark applications on secure clusters
Running Spark applications on YARN
Running Spark Python applications
Running the balancer
Running the HBCK2 tool
Running the Host Inspector
Running the Prune Command Using Cloudera Manager Admin Console
Running the Prune Command Using the Cloudera Manager API
Running YARN Services
Running your first Spark application
Runtime 7.1.7.2000-305
Runtime 7.1.7.2002-1
Runtime 7.1.7.2009-1
Runtime 7.1.7.2010-1
Runtime 7.1.7.2011-1
Runtime 7.1.7.2013-1
Runtime 7.1.7.2016-1
Runtime 7.1.7.2021-1
Runtime 7.1.7.2023-1
Runtime 7.1.7.2024-1
Runtime 7.1.7.2025-2
Runtime 7.1.7.2026-3
Runtime 7.1.7.2030-1
Runtime 7.1.7.2032-1
Runtime 7.1.7.2035-2
Runtime 7.1.7.2038-1
Runtime 7.1.7.2040-4
Runtime 7.1.7.2046-1
Runtime 7.1.7.2047-1
Runtime 7.1.7.2050-1
Runtime 7.1.7.3000-77
Runtime 7.1.7.3008-2
Runtime 7.1.7.3010-1
Runtime 7.1.7.3011-1
Runtime 7.1.7.3013-1
Runtime 7.1.7.3014-1
Runtime 7.1.7.3016-1
Runtime Cluster Hosts and Role Assignments
Runtime environment for UDFs
Runtime error: Could not create thread: Resource temporarily unavailable (error 11)
Runtime Filtering
S3 Connector Properties in Cloudera Runtime 7.1.7
S3 Gateway Health Tests
S3 Gateway Metrics
S3 Performance Checklist
S3A and Checksums (Advanced Feature)
S3Guard with Sqoop
Safely Writing to S3 Through the S3A Committers
SAML properties
Sample Custom Alert Script
Sample pom.xml file for Spark Streaming with Kafka
Sample Python Code
Sample script to connect Spark to Ozone
SAN Certificates
Save a YARN service definition
Saving a Chart
Saving aliases
Saving Charts to a New Dashboard
Saving Charts to an Existing Dashboard
Saving Charts to Dashboards
Saving searches
Saving the search results
Scalability Considerations
Scaling Kudu
Scaling Limits and Guidelines
Scaling recommendations and limitations
Scaling recommendations and limitations
Scheduler performance improvements
Scheduling among queues
Scheduling in Oozie using cron-like syntax
Schema alterations
Schema design limitations
Schema design limitations
Schema Entities
Schema objects
Schema Registry
Schema Registry
Schema Registry
Schema Registry Authorization through Ranger Access Policies
Schema Registry Component Architecture
Schema Registry Concepts
Schema Registry Health Tests
Schema Registry Metrics
Schema Registry Overview
Schema Registry Overview
Schema Registry Properties in Cloudera Runtime 7.1.7
Schema Registry Server Health Tests
Schema Registry Server Metrics
Schema Registry TLS Properties
Schema Registry Use Cases
Schemaless mode overview and best practices
Script with HBase Shell
SDX
Search
Search
Search
Search
Search
Search and other Runtime components
Search applications
Search Ranger reports
Search Tutorial
Searching by topic name
Searching for entities using Business Metadata attributes
Searching for entities using classifications
Searching for Properties
Searching Kafka cluster replications by source
Searching metadata tags
Searching overview
Searching queries
Searching tables
Searching using terms
Searching with Metadata
Searching Within the File System
Secondary Sort
SecondaryNameNode Health Tests
SecondaryNameNode Metrics
Secure access mode introduction
Secure by Design
Secure Prometheus for SMM
Secure Your Cluster
Securing Access to Hadoop Cluster: Apache Knox
Securing an endpoint under AutoTLS
Securing Apache Hive
Securing Apache Impala
Securing Apache Kafka
Securing Atlas
Securing Atlas
Securing Cloudera Search
Securing configs with ZooKeeper ACLs and Ranger
Securing Cruise Control
Securing database connections with TLS/SSL
Securing database connections with TLS/SSL
Securing DataNodes
Securing Hive metastore
Securing HiveServer using LDAP
Securing Hue
Securing Hue passwords with scripts
Securing Impala
Securing Kafka Connect
Securing Schema Registry
Securing sensitive information using a Secure Credential Storage Provider (Technical Preview)
Securing sessions
Securing Streams Messaging Manager
Securing Streams Messaging Manager
Securing Streams Replication Manager
Securing the Key Management System (KMS)
Securing the S3A Committers
Security considerations for encrypted data during replication
Security considerations for UDFs
Security examples
Security examples
Security Levels
Security Management
Security Management Model
Security Model and Operations on S3
Security overview
Security Terms
Security tokens in Ozone
Security Zones Administration
Security Zones Example Use Cases
Select Services
SELECT statement
Selecting a Point In Time or a Time Range
Selecting Columns to Show in the Activities List
Selecting Columns to Show in the Tasks List
Sending Diagnostic Data to Cloudera for YARN Applications
Sending Usage and Diagnostic Data to Cloudera
Sentry Health Tests
Sentry Metrics
Sentry Server Health Tests
Sentry Server Metrics
Sentry to Ranger replication for Hive external tables
Server and Client Configuration
Server management limitations
Server management limitations
Server Metrics
Service Dependencies in Cloudera Manager
Service Monitor Health Tests
Service Monitor Metrics
Service Monitor Requirements
Service Summary
Services backed by PostgreSQL fail or stop responding
Set Application-Master resource-limit for a specific queue
Set credentials for Ranger Usersync
Set default Application Master resource limit
Set global application limits
Set HADOOP_CONF to the destination cluster
Set HDFS quotas
Set Maximum Application limit for a specific queue
Set Ordering policies within a specific queue
Set properties in Cloudera Manager
Set proxy server authentication for clusters using Kerberos
SET statement
Set up
Set Up a Cluster Using the Wizard
Set Up a Gateway Host to Restrict Access to the Cluster
Set up a PostgreSQL database
Set up a storage policy for HDFS
Set Up a Streaming Cluster
Set Up Access to Cloudera EDH (Microsoft Azure Marketplace)
Set Up an Environment
Set up an Oracle database
Set up GCP Cloud HSM for Ranger KMS, KTS, and KeyHSM
Set up Luna 6 HSM for Ranger KMS, KTS, and KeyHSM
Set up Luna 7 HSM for Ranger KMS w/database
Set up Luna 7 HSM for Ranger KMS, KTS, and KeyHSM
Set up MariaDB or MySQL database
Set up MirrorMaker in Cloudera Manager
Set up SSD storage using Cloudera Manager
Set up WebHDFS on a secure cluster
Set user limits within a queue
Setting an Advanced Configuration Snippet for a Cloudera Runtime Service
Setting an Advanced Configuration Snippet for a Cluster
Setting capacity estimations and goals
Setting consumer and producer table properties
Setting global maximum application priority
Setting HDFS quotas in Cloudera Manager
Setting Java system properties for Solr
Setting Oozie permissions
Setting Python path variables for Livy
Setting Quotas
Setting SELinux Mode
Setting the cache timeout
Setting the Idle Query and Idle Session Timeouts
Setting the Oozie database timezone
Setting the secure storage password as an environment variable
Setting the trash interval
Setting the vm.swappiness Linux Kernel Parameter
Setting Timeout and Retries for Thrift Connections to Backend Client
Setting Timeouts in Impala
Setting up a JDBC URL connection override
Setting Up a Web Server
Setting Up a Web Server
Setting up and configuring the ABFS connector
Setting up Atlas High Availability
Setting up Atlas Kafka import tool
Setting up basic authentication with TLS for Prometheus
Setting up CipherTrust HSM for Ranger KMS, KTS, and KeyHSM
Setting Up Data at Rest Encryption for HDFS
Setting up Data Cache for Remote Reads
Setting up Data Cache for Remote Reads
Setting Up HDFS Caching
Setting up JDBCStorageHandler for Postgres
Setting Up Key Trustee Server High Availability
Setting up mTLS for Prometheus
Setting up o3fs
Setting up secure access mode
Setting Up Sqoop
Setting up the backend Hive metastore database
Setting up the cost-based optimizer and statistics
Setting up the development environment
Setting up the metastore database
Setting up TLS for Prometheus
Setting user limits for HBase
Setting user limits for Kafka
Settings to avoid data loss
Setup Database
Shell commands
Shiro Settings: Reference
shiro.ini Example
SHOW CURRENT ROLES statement
SHOW MATERIALIZED VIEWS
SHOW ROLE GRANT GROUP statement
SHOW ROLES statement
SHOW statement
Showing Atlas Server status
Showing materialized views
Shut Down Impala
SHUTDOWN statement
Shutting Down and Starting Up the Cluster
Simple .NET consumer
Simple .NET producer
Simple Java consumer
Simple Java producer
Single tablet write operations
Size the BlockCache
Sizing estimation based on network and disk message throughput
Sizing NameNode heap memory
Slow name resolution and nscd
SMALLINT data type
SMM property configuration in Cloudera Manager for Prometheus
Snapshot failures
Snapshot policies in Replication Manager
Snapshots
Snapshots history
Software Distribution Management
Solr and HDFS - the block cache
Solr Health Tests
Solr Metrics
Solr Properties in Cloudera Runtime 7.1.7
Solr Replica Metrics
Solr Server Health Tests
Solr Server Metrics
Solr server tuning categories
Solr Shard Metrics
solrctl Reference
Solutions to Common Problems
Sorting the Activities List
Sorting the Tasks List
Space quotas
Spark
Spark
Spark
Spark
Spark 3 Health Tests
Spark 3 Metrics
Spark 3 Properties in Cloudera Runtime 7.1.7
Spark actions that produce Atlas entities
Spark application model
Spark audit entries
Spark cluster execution overview
Spark entities created in Apache Atlas
Spark entity metadata migration
Spark execution model
Spark Health Tests
Spark indexing using morphlines
Spark integration best practices
Spark integration known issues and limitations
Spark integration limitations
Spark Job ACLs
Spark lineage
Spark metadata collection
Spark Metrics
Spark on YARN deployment modes
Spark Properties in Cloudera Runtime 7.1.7
Spark relationships
Spark security
Spark SQL example
Spark Streaming and Dynamic Allocation
Spark Streaming Example
Spark troubleshooting
Spark tuning
spark-submit command options
Specify the JDBC connection string
Specify truststore properties
Specifying domains or pages to which Hue can redirect users
Specifying hosts to improve HDFS replication policy performance
Specifying hosts to improve Hive replication policy performance
Specifying HTTP request methods
Specifying Impala Credentials to Access S3
Specifying Racks for Hosts
Specifying racks for hosts
Specifying the Diagnostic Data Directory
Specifying TLS/SSL Minimum Allowed Version and Ciphers
Specifying trusted users
Speeding up Job Commits by Increasing the Number of Threads
Spooling Query Results
SQL migration to Impala
SQL statements
SQL Stream Builder Metrics
SQLContext and HiveContext
Sqoop
Sqoop
Sqoop
Sqoop 2 Health Tests
Sqoop 2 Metrics
Sqoop 2 Server Health Tests
Sqoop 2 Server Metrics
Sqoop Hive import stops when HS2 does not use Kerberos authentication
Sqoop Import into ADLS
Sqoop Import into Amazon S3
SQOOP_CLIENT Properties in Cloudera Runtime 7.1.7
SRM Command Line Tools
SRM Distributed Herder metrics Metrics
SRM Driver Health Tests
SRM Driver Metrics
SRM security example
SRM Service Health Tests
SRM Service Metrics
srm-control
srm-control Options Reference
SSE-C: Server-Side Encryption with Customer-Provided Encryption Keys
SSE-KMS: Amazon S3-KMS Managed Encryption Keys
SSE-S3: Amazon S3-Managed Encryption Keys
Stale Configurations
Standard stream logs
Start and stop Kudu processes
Start and stop queues
Start and stop the NFS Gateway services
Start HBase
Start Prometheus
Start Queue
Start the NFS Gateway services
Starting a Cloudera Runtime Service on All Hosts
Starting All the Roles on a Host
Starting and Stopping Apache Impala
Starting and Stopping Cloudera Management Service Roles
Starting and stopping HBase using Cloudera Manager
Starting Apache Hive
Starting compaction manually
Starting Hive on an insecure cluster
Starting Hive using a password
Starting the Cloudera Management Service
Starting the Embedded PostgreSQL Database
Starting the Lily HBase NRT Indexer Service
Starting the Oozie server
Starting, Stopping, and Restarting Cloudera Manager Agents
Starting, Stopping, and Restarting Role Instances
Starting, Stopping, and Restarting the Cloudera Manager Server
Starting, Stopping, Refreshing, and Restarting a Cluster
State Management
Static Service Pools
Statistics generation and viewing commands
Status
Status Summary
Status Summary
STDDEV, STDDEV_SAMP, STDDEV_POP functions
Step 11: Inspect Cluster
Step 1: Configuration changes on HDP and CDP clusters
Step 1: Configure a Repository for Cloudera Manager
Step 1: Enabling hdfs user to run YARN jobs
Step 1: Identify Roles that Use the Embedded Database Server
Step 1: Install Cloudera Manager and CDP
Step 1: Welcome (Add Cluster - Installation)
Step 1: Welcome (Add Cluster - Installation)
Step 1: Worker host configuration
Step 2: Cluster Basics
Step 2: Cluster Basics
Step 2: Configuration changes on the CDP cluster
Step 2: Configuring user to run YARN jobs on both the clusters
Step 2: Install Java Development Kit
Step 2: Install JCE policy files for AES-256 encryption
Step 2: Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server
Step 2: Worker host planning
Step 3: Cluster size
Step 3: Create the Kerberos Principal for Cloudera Manager Server
Step 3: Deploy Cloudera Manager Server and Cloudera Manager Agents
Step 3: Install Cloudera Manager Server
Step 3: Running DistCp job on CDP cluster
Step 3: Running the DistCp job on the HDP cluster
Step 3: Setup Auto-TLS
Step 3: Setup Auto-TLS
Step 4. Install and Configure Databases
Step 4: Enable Kerberos using the wizard
Step 4: Specify Hosts
Step 4: Specify Hosts
Step 5: Create the HDFS superuser
Step 5: Select Repository
Step 5: Select Repository
Step 5: Set up and configure the Cloudera Manager database
Step 6: Get or create a Kerberos principal for each user account
Step 6: Install Parcels
Step 6: Select JDK
Step 6: Start the Cloudera Manager Server and Agents
Step 6: Verify container settings on cluster
Step 6A: Cluster container capacity
Step 6B: Container parameters checking
Step 7: Enter Login Credentials
Step 7: MapReduce configuration
Step 7: Prepare the cluster for each user
Step 7: Set Up a Cluster Using the Wizard
Step 7A: MapReduce settings checking
Step 8: Inspect Cluster
Step 8: Install Agents
Step 8: Verify that Kerberos security is working
Step 9: (Optional) Enable authentication for HTTP web consoles for Hadoop roles
Step 9: Install Parcels
Steps 4 and 5: Verify settings
Stop all Services
Stop HBase
Stop Queue
Stop replication in an emergency
Stop the NFS Gateway services
Stopping a Cloudera Runtime Service on All Hosts
Stopping All the Roles on a Host
Stopping the Cloudera Management Service
Stopping the Embedded PostgreSQL Database
Stopping the Oozie server
Storage
Storage Container Manager Health Tests
Storage Container Manager Metrics
Storage Container Manager operations in High Availability
Storage group classification
Storage group pairing
Storage Space Planning for Cloudera Manager
Storage Systems Supports
Store HBase snapshots on Amazon S3
Storing Data Using Ozone
Storing medium objects (MOBs)
Streaming SQL Console Health Tests
Streaming SQL Console Metrics
Streaming SQL Engine Health Tests
Streaming SQL Engine Metrics
Streams Messaging
Streams Messaging
Streams Messaging Manager
Streams Messaging Manager
Streams Messaging Manager
Streams Messaging Manager Health Tests
Streams Messaging Manager Metrics
Streams Messaging Manager Overview
Streams Messaging Manager Properties in Cloudera Runtime 7.1.7
Streams Messaging Manager Rest Admin Server Health Tests
Streams Messaging Manager Rest Admin Server Metrics
Streams Messaging Manager UI Server Health Tests
Streams Messaging Manager UI Server Metrics
Streams Replication Manager
Streams Replication Manager
Streams Replication Manager
Streams Replication Manager
Streams Replication Manager Architecture
Streams Replication Manager Driver
Streams Replication Manager Health Tests
Streams Replication Manager Metrics
Streams Replication Manager Overview
Streams Replication Manager Properties in Cloudera Runtime 7.1.7
Streams Replication Manager Reference
Streams Replication Manager requirements
Streams Replication Manager Service
STRING data type
String functions
STRUCT complex type
Stub DFS Properties in Cloudera Runtime 7.1.7
Submitting a Python app
Submitting a Scala or Java application
Submitting batch applications using the Livy API
Submitting Spark applications
Submitting Spark Applications to YARN
Submitting Spark applications using Livy
Subqueries in Impala SELECT statements
Subquery restrictions
Subscribing to a topic
SUM
SUM function
Summary
Summary
Support matrix for Replication Manager on CDP Private Cloud Base
Suppressing a Configuration Validation in Cloudera Manager
Suppressing a Health Test
Suppressing Configuration and Parameter Validation Warnings
Suppressing Configuration Validations Before They Trigger Warnings
Suppressing Health Test Results
Symbolizing stack traces
Synchronize table data using HashTable/SyncTable tool
Synchronizing the contents of JournalNodes
Syntax for scm_prepare_database.sh
SYSTEM Category
System Level Broker Tuning
System metadata migration
System requirements
System Requirements for POC Streams Cluster
Table and Column Statistics
Tables
TABLESAMPLE clause
Tablet history garbage collection and the ancient history mark
Tablet Server Health Tests
Tablet Server Metrics
Tag-based Services and Policies
Tags and policy evaluation
Take a snapshot using a shell script
Take HBase snapshots
Taking and deleting HDFS snapshots
Task architecture and load-balancing
Task Attempts
TaskController Error Codes (MRv1)
TaskTracker Health Tests
TaskTracker Hosts
TaskTracker Metrics
Telemetry Publisher Health Tests
Telemetry Publisher Metrics
Terminology
Terminology
Terms
Test MOB storage and retrieval performance
Testing the Installation
Testing the LDAP configuration
Testing with Hue
Tez
Tez Metrics
Tez Properties in Cloudera Runtime 7.1.7
The Actions Menu
The Cloud Storage Connectors
The File Browser
The HDFS mover command
The Hue load balancer not distributing users evenly across various Hue servers
The perfect schema
The Processes Tab
The S3A Committers and Third-Party Object Stores
The Task Distribution Chart
Third-party filesystems
Thread Tuning for S3A Data Upload
Threads
Thrift Server crashes after receiving invalid data
Throttle quota examples
Throttle quotas
Time Line
Time Series Attributes
Time Series Entities and their Attributes
Time Series Table Metrics
Timeline consistency
TIMESTAMP compatibility for Parquet files
TIMESTAMP data type
TINYINT data type
Tips and Best Practices for Jobs
TLS Certificate Requirements and Recommendations
TLS Encryption
TLS Mutual Authentication
TLS/SSL certificate requirements and recommendations
TLS/SSL client authentication
TLS/SSL client authentication
TLS/SSL Issues
TLS/SSL settings for Streams Messaging Manager
Tombstoned or STOPPED tablet replicas
Tool usage
Top-down process for adding a new metadata source
Topics
Topics and Groups Subcommand
Tracer Health Tests
Tracer Metrics
Tracking an Apache Hive query in YARN
Tracking Hive on Tez query execution
Transactional table access
Transactions
Transactions
Transparent Encryption Recommendations for HBase
Transparent Encryption Recommendations for Hive
Transparent Encryption Recommendations for Hue
Transparent Encryption Recommendations for Impala
Transparent Encryption Recommendations for MapReduce and YARN
Transparent Encryption Recommendations for Search
Transparent Encryption Recommendations for Spark
Transparent Encryption Recommendations for Sqoop
Trash behavior with HDFS Transparent Encryption enabled
Trial Installation
Triggers
Troubleshoot RegionServer grouping
Troubleshooting
Troubleshooting ABFS
Troubleshooting Apache Hadoop YARN
Troubleshooting Apache HBase
Troubleshooting Apache Hive
Troubleshooting Apache Impala
Troubleshooting Apache Kudu
Troubleshooting Apache Sqoop
Troubleshooting Cloudera Search
Troubleshooting Cluster Configuration and Operation
Troubleshooting Data Analytics Studio
Troubleshooting Docker on YARN
Troubleshooting HBase
Troubleshooting Hue
Troubleshooting Impala
Troubleshooting Installation Problems
Troubleshooting Linux Container Executor
Troubleshooting NTP stability problems
Troubleshooting on YARN
Troubleshooting Operational Database powered by Apache Accumulo
Troubleshooting Performance of Decommissioning
Troubleshooting Prometheus for SMM
Troubleshooting replication failure in the DAS Event Processor
Troubleshooting replication policies between on-premises clusters
Troubleshooting S3
Troubleshooting SAML authentication
Troubleshooting Security Issues
Troubleshooting Security Issues
Troubleshooting the S3A Committers
TRUNCATE TABLE statement
tsquery Language
tsquery Syntax
Tuning and Troubleshooting Host Decommissioning
Tuning Apache Hadoop YARN
Tuning Apache Impala
Tuning Apache Kafka Performance
Tuning Apache Spark
Tuning Apache Spark Applications
Tuning Cloudera Search
Tuning garbage collection
Tuning HBase Prior to Decommissioning DataNodes
Tuning HDFS Prior to Decommissioning DataNodes
Tuning Hue
Tuning JVM Garbage Collection
Tuning replication
Tuning Resource Allocation
Tuning S3A Uploads
Tuning Spark Shuffle Operations
Tuning the metastore
Tuning the Number of Partitions
Turning safe mode on HA NameNodes
Tutorial
Tutorial: Using Impala, Hive and Hue with Virtual Private Clusters
UDF concepts
UI Tools
Unable to access Hue from Knox Gateway UI
Unable to authenticate users in Hue using SAML
Unable to connect Oracle database to Hue using SCAN
Unable to connect to database with provided credential
Unable to log into Hue with Knox
Unable to read Sqoop metastore created by an older HSQLDB version
Unable to start DAS
Unable to terminate Hive queries from Job Browser
Unable to use pip command in CDP
Unable to view new databases and tables, or unable to see changes to the existing databases or tables
Unable to view or create Oozie workflows
Unable to view Snappy-compressed files
Unaffected Components in this release
Understand the NiFi Record Based Processors and Controller Services
Understanding --go-live and HDFS ACLs
Understanding co-located and external clusters
Understanding erasure coding policies
Understanding HBase garbage collection
Understanding Hue users and groups
Understanding Impala integration with Kudu
Understanding Keystores and Truststores
Understanding Package Management
Understanding Performance using EXPLAIN Plan
Understanding Performance using Query Profile
Understanding Performance using SUMMARY Report
Understanding Replication Flows
Understanding SRM properties, their configuration and hierarchy
Understanding the data that flow into Atlas
Understanding the extractHBaseCells Morphline Command
Understanding the extractHBaseCells Morphline Command
Understanding the kafka-run-class Bash Script
Understanding YARN architecture
Under‐replicated block exceptions or cluster failure occurs on small clusters
Uninstall Cloudera Manager Agent and Managed Software
Uninstall the Cloudera Manager Server
Uninstalling a Runtime Component From a Single Host
Uninstalling Cloudera Manager and Managed Software
UNION clause
Unlocking access to Kafka metadata in Zookeeper
Unsupported Apache Spark Features
Unsupported command line tools
Unsuppressing Health Tests
Update data
UPDATE statement
Updating a notifier
Updating an alert policy
Updating data in a table
Updating Spark 2 apps for Spark 3
Updating Spark 2 apps for Spark 3.x
Updating the schema in a collection
Upgrading existing Kudu tables for Hive Metastore integration
Upgrading from a CDP Private Cloud Base Trial to CDP Private Cloud Base
Upload a file
Uploading tables
Upsert a row
Upsert option in Kudu Spark
UPSERT statement
Usability issues
Use a CTE in a query
Use a custom MapReduce job
Use BulkLoad
Use Case 1: Registering and Querying a Schema for a Kafka Topic
Use case 1: Use Cloudera Manager to generate internal CA and corresponding certificates
Use case 2: Enabling Auto-TLS with an intermediate CA signed by an existing Root CA
Use Case 2: Reading/Deserializing and Writing/Serializing Data from and to a Kafka Topic
Use Case 3: Dataflow Management with Schema-based Routing
Use case 3: Enabling Auto-TLS with Existing Certificates
Use Case Architectures
Use cases
Use cases for ACLs on HDFS
Use cases for BulkLoad
Use cases for centralized cache management
Use Cgroups
Use cluster names in the kudu command line tool
Use cluster replication
Use CopyTable
Use CPU scheduling
Use CPU scheduling with distributed shell
Use CREATE TABLE AS SELECT
Use curl to access a URL protected by Kerberos HTTP SPNEGO
Use Digest Authentication Provider
Use DistCp to migrate HDFS data from HDP to CDP
Use FPGA scheduling
Use FPGA with distributed shell
Use GPU scheduling
Use GPU scheduling with distributed shell
Use GZipCodec with a one-time job
Use HashTable and SyncTable Tool
Use multiple ZooKeeper services
Use partitions when submitting a job
Use rsync to copy files from one broker to another
Use Self-Signed Certificates for TLS
Use snapshots
Use Spark
Use Spark with a secure Kudu cluster
Use Sqoop
USE statement
Use strongly consistent indexing
Use the Charts Library
Use the Cluster Utilization Report to manage resources
Use the HBase APIs for Java
Use the HBase command-line utilities
Use the HBase REST server
Use the HBase shell
Use the Hue HBase app
Use the JDBC interpreter to access Hive
Use the Livy interpreter to access Spark
Use the Network Time Protocol (NTP) with HBase
Use the YARN CLI to View Logs for Applications
Use the YARN REST APIs to manage applications
Use the yarn rmadmin tool to administer ResourceManager high availability
Use transactions with tables
Use wildcards with SHOW DATABASES
User Account Requirements
User authentication in Hue
User authorization configuration for Oozie
User Management
User management in Hue
User Metrics
User-defined functions (UDFs)
Using --go-live with SSL or Kerberos
Using a Credential Provider to Secure S3 Credentials
Using a credential provider to secure S3 credentials
Using a custom Kerberos keytab retrieval script
Using a load balancer
Using a load balancer
Using a Local Parcel Repository
Using a subquery
Using ABFS using CLI
Using advanced search
Using an Internally Hosted Remote Parcel Repository
Using Apache HBase Backup and Disaster Recovery
Using Apache Hive
Using Apache Impala with Apache Kudu
Using Apache Phoenix to Store and Access Data
Using Apache Phoenix-Hive connector
Using Apache Phoenix-Spark connector
Using Apache Zeppelin
Using Atlas-Hive import utility with Ozone entities
Using auth-to-local rules to isolate cluster users
Using Avro Data Files
Using Basic Search
Using Breakpad Minidumps for Crash Reporting
Using CLI commands to create and list ACLs
Using Cloudera Manager to manage HDFS HA
Using common table expressions
Using Configuration Properties to Authenticate
Using constraints
Using Context-Sensitive Variables in Charts
Using custom JAR files with Search
Using custom libraries with Spark
Using Data Analytics Studio
Using dfs.datanode.max.transfer.threads with HBase
Using Direct Reader mode
Using DistCp
Using DistCp between HA clusters using Cloudera Manager
Using DistCp to copy files
Using DistCp to migrate data from secure HDP to secure CDP using DistCp
Using DistCp to migrate data from secure HDP to unsecure CDP
Using DistCp with Amazon S3
Using DistCp with Highly Available remote clusters
Using DNS with HBase
Using EC2 Instance Metadata to Authenticate
Using Environment Variables to Authenticate
Using erasure coding for existing data
Using erasure coding for new data
Using Fast Upload with Amazon S3
Using Free-text Search
Using functions
Using governance-based data discovery
Using HBase blocksize
Using HBase coprocessors
Using HBase replication
Using HBase scanner heartbeat
Using HDFS snapshots for data protection
Using HdfsFindTool to find files
Using hedged reads
Using Hive Metastore with Apache Kudu
Using Hive Warehouse Connector with Oozie Spark Action
Using HttpFS to provide access to HDFS
Using Hue
Using Hue
Using HWC for streaming
Using Impala to query Kudu tables
Using impala-shell and Hive
Using import utility tools with Atlas
Using JDBC API
Using JDBC read mode
Using JdbcStorageHandler to query RDBMS
Using JdbcStorageHandler to query RDBMS
Using JMX for accessing HDFS metrics
Using Kafka Connect
Using Livy with interactive notebooks
Using Livy with Spark
Using Load Balancer with HttpFS
Using MapReduce batch indexing to index sample Tweets
Using MariaDB database with Hue
Using metadata for cluster governance
Using Morphlines to index Avro
Using Morphlines with Syslog
Using MySQL database with Hue
Using non-JDBC drivers
Using optimizations from a subquery
Using Oracle database with Hue
Using ORC Data Files
Using Ozone S3 Gateway to work with storage elements
Using Parquet Data Files
Using Per-Bucket Credentials to Authenticate
Using PostgreSQL database with Hue
Using PySpark
Using quota management
Using rack awareness for read replicas
Using Ranger client libraries
Using Ranger to Provide Authorization in CDP
Using Ranger to Provide Authorization in CDP
Using Ranger with Ozone
Using RCFile Data Files
Using Record-Enabled Processors
Using RegionServer grouping
Using Schema Registry
Using Search filters
Using secondary indexing
Using secure access mode
Using SequenceFile Data Files
Using session cookies to validate Ranger policies
Using snapshots with replication
Using solrctl with an HTTP proxy
Using Spark Hive Warehouse and HBase Connector Client .jar files with Livy
Using Spark MLlib
Using Spark SQL
Using Spark Streaming
Using Sqoop actions with Oozie
Using Streams Replication Manager
Using tag attributes and values in Ranger tag-based policy conditions
Using Tags in Cloudera Manager
Using Text Data Files
Using the Apache HBase Hive integration
Using the Apache Thrift Proxy API
Using the AWS CLI with Ozone S3 Gateway
Using the CDS 3.2.3 Maven Repo
Using the Cloudera Manager API
Using the Cloudera Manager API for Cluster Automation
Using the Cloudera Manager API to backup and restore clusters
Using the Cloudera Manager API to Manage and Configure Clusters
Using the Cloudera Manager API to Obtain Configuration Files
Using the Cloudera Manager API to Set Advanced Configuration Snippets (Safety Valves)
Using the Cloudera Runtime Maven repository 7.1.7
Using the Cloudera Runtime Maven repository 7.1.7 SP1
Using the Cloudera Runtime Maven repository 7.1.7 SP2
Using the Cloudera Runtime Maven repository 7.1.7 SP3
Using the Database Explorer
Using the Directory Committer in MapReduce
Using the Directory Usage Report
Using the HBCK2 tool to remediate HBase clusters
Using the Indexer HTTP Interface
Using the Lily HBase NRT Indexer Service
Using the Livy API to run Spark jobs
Using the NFS Gateway for accessing HDFS
Using the Note Toolbar
Using the Ranger Console
Using the Ranger Key Management Service
Using the REST API
Using the REST API
Using the REST proxy API
Using the S3Guard Command to List and Delete Uploads
Using the Spark DataFrame API
Using transactions
Using Unique Filenames to Avoid File Update Inconsistency
Using YARN Web UI and CLI
Using Zeppelin Interpreters
UTF-8 codec error
Validating Hadoop Key Operations
Validating Key HSM Settings
Validating the Cloudera Search deployment
Validation of Configuration Properties
VALUES statement
VARCHAR data type
Varchar type
VARIANCE, VARIANCE_SAMP, VARIANCE_POP, VAR_SAMP, VAR_POP functions
Variations on Put
Verifing use of a query rewrite
Verify that replication works
Verify the ZooKeeper authentication
Verify validity of the NFS services
Verify your Accumulo installation
Verify your OpDB installation
Verify your OpDB installation
Verifying Cloudera Navigator Key Trustee Server Operations
Verifying if a memory limit is sufficient
Verifying That an S3A Committer Was Used
Verifying that Indexing Works
Verifying the Impala dependency on Kudu
Verifying the setup
Version and Download Information
Versions
View All Applications
View and modify log levels for Search and related services
View and modify Search configuration
View application details
View audit details
View Cluster Overview
View HDFS directory structure of Compute clusters
View HDFS replication policy details
View historical details for an HDFS replication policy
View Nodes and Node Details
View partitions
View query details
View Queues and Queue Details
View Ranger reports
View the API documentation
Viewing a Job's Task Attempts
Viewing a List of All Suppressed Validations
Viewing a List of Suppressed Health Tests
Viewing Activity Details in a Report Format
Viewing All Hosts
Viewing and Debugging Spark Applications Using Logs
Viewing and Downloading Stacks Logs
Viewing and Editing Host Overrides
Viewing and Editing Overridden Configuration Properties
Viewing and Reverting Configuration Changes
Viewing Audit Events
Viewing Charts for Cluster, Service, Role, and Host Instances
Viewing Cloudera Manager Agent Logs in the Logs Page
Viewing Cloudera Manager Server Logs in the Logs Page
Viewing compaction progress
Viewing Current Disk Usage by User, Group, or Directory
Viewing detailed information
Viewing Events
Viewing existing collections
Viewing Health Test Results
Viewing Historical Disk Usage by User, Group, or Directory
Viewing Host and Service Monitor Data Storage
Viewing Host Details
Viewing Host Role Assignments
Viewing Host Status
Viewing Individual Hosts
Viewing Jobs
Viewing Kafka cluster replication details
Viewing lineage
Viewing Logs
Viewing Parcel Usage
Viewing Past Host Inspector Results
Viewing Past Status
Viewing Past Status
Viewing Queries
Viewing racks assigned to cluster hosts
Viewing Role Instance Status
Viewing Running and Recent Commands
Viewing Running and Recent Commands For a Cluster
Viewing Running and Recent Commands for a Service or Role
Viewing Service Instance Details
Viewing Service Status
Viewing storage information
Viewing table and column statistics
Viewing the Cloudera Manager Agent Log
Viewing the Cloudera Manager Agent Logs
Viewing the Cloudera Manager Server Log
Viewing the Cloudera Manager Server Log
Viewing the DAG counters
Viewing the DAG flow
Viewing the Disks Overview
Viewing the Distribution of Task Attempts
Viewing the Health and Status of a Role Instance
Viewing the Hive configurations for a query
Viewing the Hosts in a Cluster
Viewing the Jobs in a Pig, Oozie, or Hive Activity
Viewing the Join report
Viewing the Maintenance Mode Status of a Cluster
Viewing the Maintenance Mode Status of a Cluster
Viewing the query details
Viewing the query recommendations
Viewing the query timeline
Viewing the Read and Write report
Viewing the Status of a Service Instance
Viewing the task-level DAG information
Viewing the Tez configurations for a query
Viewing the URLs of the Client Configuration Files
Viewing the visual explain for a query
Viewing transaction locks
Viewing transactions
Views
Virtual machine options for HBase Shell
Virtual memory handling
Virtual Private Clusters and Cloudera SDX
Visualizing Spark Applications Using the Web Application UI
Volume and bucket management using ofs
Web User Interface for Debugging
WebHCat Server Health Tests
WebHCat Server Metrics
What is CDP Private Cloud?
What is Cloudera Search
What's new in 7.1.7
What's New in Cloudera Manager 7.11.3 Cumulative hotfix 4 (CDP Private Cloud Base 7.1.7 SP3)
What's New in Cloudera Manager 7.4.4
What's New in Cloudera Manager 7.6.1 (CDP Private Cloud Base 7.1.7 SP1)
What's New in Cloudera Manager 7.6.7 (CDP Private Cloud Base 7.1.7 SP2)
What's new in Cloudera Runtime 7.1.7 SP1
What's new in Cloudera Runtime 7.1.7 SP2
What's new in Cloudera Runtime 7.1.7 SP3
When Shuffles Do Not Occur
When to Add a Shuffle Transformation
When to use Atlas classifications for access control
Why HDFS data becomes unbalanced
Why one scheduler?
Wildcards and variables in resource-based policies
WINDOW
WITH clause
Work Preserving Recovery for YARN components
Working with Amazon S3
Working with Apache Hive Metastore
Working with Atlas classifications and labels
Working with Classifications and Labels
Working with Google Cloud Storage
Working with ofs
Working with Ozone File System (o3fs)
Working with S3 buckets in the same AWS region
Working with the ABFS Connector
Working with the Oozie server
Working with the Recon web user interface
Working with Third-party S3-compatible Object Stores
Working with versioned S3 buckets
Working with Zeppelin Notes
Write a few Events into the Topic
Write-ahead log garbage collection
Writes
Writing data in a Kerberos and TLS/SSL enabled cluster
Writing data in an unsecured cluster
Writing data through HWC
Writing data to HBase
Writing data to Kafka
Writing Kafka data to Ozone with Kafka Connect
Writing to multiple tablets
Writing transformed Hive data to Kafka
Writing UDFs
Writing user-defined aggregate functions (UDAFs)
YARN
YARN
YARN
YARN
YARN
YARN ACL rules
YARN ACL syntax
YARN ACL types
YARN Configuration Properties
YARN Features
YARN Health Tests
YARN Log Aggregation Overview
YARN Metrics
YARN Pool Metrics
YARN Pool User Metrics
YARN Properties in Cloudera Runtime 7.1.7
YARN Queue Manager Metrics
YARN Queue Manager Properties in Cloudera Runtime 7.1.7
YARN Queue Manager Store Health Tests
YARN Queue Manager Store Metrics
YARN Queue Manager Webapp Health Tests
YARN Queue Manager Webapp Metrics
YARN resource allocation of multiple resource-types
YARN ResourceManager High Availability
YARN ResourceManager high availability architecture
YARN services API examples
YARN Tab
YARN tuning overview
YARN, MRv1, and Linux OS Security
Zeppelin
Zeppelin
Zeppelin Health Tests
Zeppelin Metrics
Zeppelin Properties in Cloudera Runtime 7.1.7
Zeppelin Server Health Tests
Zeppelin Server Metrics
Zookeeper
ZooKeeper
ZooKeeper
ZooKeeper ACLs Best Practices
ZooKeeper ACLs Best Practices: Atlas
ZooKeeper ACLs Best Practices: Cruise Control
ZooKeeper ACLs Best Practices: HBase
ZooKeeper ACLs Best Practices: HDFS
ZooKeeper ACLs Best Practices: Kafka
ZooKeeper ACLs Best Practices: Oozie
ZooKeeper ACLs Best Practices: Ranger
ZooKeeper ACLs best practices: Search
ZooKeeper ACLs Best Practices: YARN
ZooKeeper ACLs Best Practices: ZooKeeper
ZooKeeper Authentication
Zookeeper Configurations
ZooKeeper Health Tests
ZooKeeper Metrics
ZooKeeper Properties in Cloudera Runtime 7.1.7
ZooKeeper Server Health Tests
zookeeper-security-migration
«
Filter topics
Known Issues in Apache Hadoop
Overview
▼
7.1.7 SP3
What's new in Cloudera Runtime 7.1.7 SP3
Cloudera Runtime 7.1.7 SP3 component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7 SP3
Runtime 7.1.7.3000-77
Runtime 7.1.7.3008-2
Runtime 7.1.7.3010-1
Runtime 7.1.7.3011-1
Runtime 7.1.7.3013-1
Runtime 7.1.7.3014-1
Runtime 7.1.7.3016-1
▶︎
Fixed issues in Cloudera Runtime 7.1.7 SP3
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed issues in Apache Calcite
Fixed issues in Cloud Connectors
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HDFS
Fixed Issues in Apache HBase
Fixed Issues in Apache Hive
Fixed Issues in Hue
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Kerberos
Fixed Issues in Apache Kudu
Fixed Issues in Apache Knox
Fixed Issues in Livy
Fixed Issues in MapReduce
Fixed Issues in Navigator Encrypt
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Apache Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Schema Registry
Fixed Issues in Cloudera Search
Fixed Issues in Apache Solr
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN and YARN Queue Manager
Fixed Issues in Zeppelin
Fixed Issues in Apache Zookeeper
▼
Known Issues in Cloudera Runtime 7.1.7 SP3
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known Issues in Cruise Control
Known Issues in Apache Calcite
Known Issues in Data Analytics Studio
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in HDFS
Known Issues in Apache Hive
Known Issues in Hue
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Navigator Encrypt
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Schema Registry
Known Issues in Cloudera Search
Known Issues in Apache Spark
Known Issues in Streams Replication Manager
Known Issues for Apache Sqoop
Known issues in Streams Messaging Manager
Known Issues in MapReduce and YARN
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7 SP3
Behavioral changes in Apache Hive
▶︎
CDP Private Cloud Base API Modifications and Removals
▶︎
CDP 7.1.7 SP2 and 7.1.7 SP3 Components with API differences
API Compatibility changes in 7.1.7 SP3 for Spark
API Compatibility changes in 7.1.7 SP3 for Zookeeper
▶︎
Deprecation notices in Cloudera Runtime 7.1.7 SP3
Platform and OS
Fixed Common Vulnerabilities and Exposures 7.1.7 SP3
Documentation Errata in Cloudera Runtime 7.1.7 SP3
▶︎
Cumulative hotfixes
▶︎
Cumulative hotfix CDP Private Cloud Base 7.1.7.3016-1 (SP3 Cumulative hotfix6)
Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF6
▶︎
Cumulative hotfix CDP Private Cloud Base 7.1.7.3014-1 (SP3 Cumulative hotfix5)
Behavioral changes in Cloudera Runtime 7.1.7 SP3 CHF5
Cumulative hotfix CDP Private Cloud Base 7.1.7.3013-1 (SP3 Cumulative hotfix4)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3011-1 (SP3 Cumulative hotfix3)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3010-1 (SP3 Cumulative hotfix2)
Cumulative hotfix CDP Private Cloud Base 7.1.7.3008-2 (SP3 Cumulative hotfix1)
▶︎
7.1.7 SP2
What's new in Cloudera Runtime 7.1.7 SP2
Cloudera Runtime 7.1.7 SP2 component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7 SP2
Runtime 7.1.7.2000-305
Runtime 7.1.7.2002-1
Runtime 7.1.7.2009-1
Runtime 7.1.7.2010-1
Runtime 7.1.7.2011-1
Runtime 7.1.7.2013-1
Runtime 7.1.7.2016-1
Runtime 7.1.7.2021-1
Runtime 7.1.7.2023-1
Runtime 7.1.7.2024-1
Runtime 7.1.7.2025-2
Runtime 7.1.7.2026-3
Runtime 7.1.7.2030-1
Runtime 7.1.7.2032-1
Runtime 7.1.7.2035-2
Runtime 7.1.7.2038-1
Runtime 7.1.7.2040-4
Runtime 7.1.7.2046-1
Runtime 7.1.7.2047-1
Runtime 7.1.7.2050-1
▶︎
Fixed issues in Cloudera Runtime 7.1.7 SP2
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed issues in Apache Calcite
Fixed issues in Cloud Connectors
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HDFS
Fixed Issues in Apache HBase
Fixed Issues in Apache Hive
Fixed Issues in Hue
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Kerberos
Fixed Issues in Apache Kudu
Fixed Issues in Apache Knox
Fixed Issues in Livy
Fixed Issues in MapReduce
Fixed Issues in Navigator Encrypt
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Apache Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Schema Registry
Fixed Issues in Cloudera Search
Fixed Issues in Apache Solr
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN and YARN Queue Manager
Fixed Issues in Zeppelin
Fixed Issues in Apache Zookeeper
Hotfixes in Cloudera Runtime 7.1.7 SP2
▶︎
Known issues in Cloudera Runtime 7.1.7 SP2
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known issues in Cruise Control
Known issues in Apache Calcite
Known Issues in Data Analytics Studio
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in HDFS
Known Issues in Apache Hive
Known Issues in Hue
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Navigator Encrypt
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Schema Registry
Known Issues in Cloudera Search
Known Issues in Apache Spark
Known Issues in Streams Replication Manager
Known Issues for Apache Sqoop
Known issues in Streams Messaging Manager
Known Issues in MapReduce and YARN
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7 SP2
Behavioral changes in Apache Hive
Behavioral Changes in Cloudera Search
Behavioral changes in Apache Impala
Fixed Common Vulnerabilities and Exposures 7.1.7 SP2
Documentation Errata in Cloudera Runtime 7.1.7 SP2
▶︎
Cumulative hotfixes
Cumulative hotfix CDP PvC Base 7.1.7.2050-1 (SP2 cumulative hotfix19)
Cumulative hotfix CDP PvC Base 7.1.7.2047-1 (SP2 cumulative hotfix18)
Cumulative hotfix CDP PvC Base 7.1.7.2046-1 (SP2 cumulative hotfix17)
Cumulative hotfix CDP PvC Base 7.1.7.2040-4 (SP2 cumulative hotfix16)
Cumulative hotfix CDP PvC Base 7.1.7.2038-1 (SP2 cumulative hotfix15)
Cumulative hotfix CDP PvC Base 7.1.7.2035-2 (SP2 cumulative hotfix14)
Cumulative hotfix CDP PvC Base 7.1.7.2032-1 (SP2 cumulative hotfix13)
Cumulative hotfix CDP PvC Base 7.1.7.2030-1 (SP2 cumulative hotfix12)
Cumulative hotfix CDP PvC Base 7.1.7.2026-3 (SP2 cumulative hotfix11)
Cumulative hotfix CDP PvC Base 7.1.7.2025-2 (SP2 cumulative hotfix10)
Cumulative hotfix CDP PvC Base 7.1.7.2024-1 (SP2 cumulative hotfix9)
Cumulative hotfix CDP PvC Base 7.1.7.2023-1 (SP2 cumulative hotfix8)
Cumulative hotfix CDP PvC Base 7.1.7.2021-1 (SP2 cumulative hotfix7)
Cumulative hotfix CDP PvC Base 7.1.7.2016-1 (SP2 cumulative hotfix6)
Cumulative hotfix CDP PvC Base 7.1.7.2013-1 (SP2 cumulative hotfix5)
Cumulative hotfix CDP PvC Base 7.1.7.2011-1 (SP2 cumulative hotfix4)
Cumulative hotfix CDP PvC Base 7.1.7.2010-1 (SP2 cumulative hotfix3)
Cumulative hotfix CDP PvC Base 7.1.7.2009-1 (SP2 cumulative hotfix2)
Cumulative hotfix CDP PvC Base 7.1.7.2002-1 (SP2 cumulative hotfix1)
▶︎
7.1.7 SP1
What's new in Cloudera Runtime 7.1.7 SP1
Cloudera Runtime 7.1.7 SP1 component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7 SP1
Maven Artifacts for Cloudera Runtime 7.1.7 SP1
▶︎
Fixed issues in Cloudera Runtime 7.1.7 SP1
Fixed Issues in Apache Atlas
Fixed Issues in Apache Avro
Fixed issues in Cruise Control
Fixed issues in Data Analytics Studio
Fixed Issues in Apache Hadoop
Fixed Issues in Apache HDFS
Fixed Issues in Apache HBase
Fixed Issues in Apache Hive
Fixed Issues in Hue
Fixed Issues in Apache Impala
Fixed Issues in Apache Kafka
Fixed Issues in Apache Kudu
Fixed Issues in Apache Knox
Fixed Issues in Navigator Encrypt
Fixed Issues in Apache Oozie
Fixed issues in Apache Ozone
Fixed Issues in Apache Parquet
Fixed Issues in Phoenix
Fixed Issues in Apache Ranger
Fixed Issues in Schema Registry
Fixed Issues in Cloudera Search
Fixed Issues in Apache Spark
Fixed Issues in Apache Sqoop
Fixed Issues in Streams Replication Manager
Fixed Issues in Streams Messaging Manager
Fixed Issues in Apache Tez
Fixed Issues in Apache YARN
Fixed Issues in Zeppelin
Fixed Issues in Apache Zookeeper
Hotfixes in Cloudera Runtime 7.1.7 SP1
▶︎
Known issues in Cloudera Runtime 7.1.7 SP1
Known Issues in Apache Atlas
Known Issues in Apache Avro
Known issues in Cruise Control
Known Issues in Data Analytics Studio
Known Issues in Apache Hadoop
Known Issues in Apache HBase
Known Issues in HDFS
Known Issues in Apache Hive
Known Issues in Hue
Known Issues in Apache Impala
Known Issues in Apache Kafka
Known Issues in Kerberos
Known Issues in Apache Knox
Known Issues in Apache Kudu
Known Issues in Navigator Encrypt
Known Issues in Apache Oozie
Known Issues in Apache Ozone
Known Issues in Apache Parquet
Known Issues in Apache Phoenix
Known Issues in Apache Ranger
Known Issues in Schema Registry
Known Issues in Cloudera Search
Known Issues in Apache Spark
Known Issues in Streams Replication Manager
Known Issues for Apache Sqoop
Known issues in Streams Messaging Manager
Known Issues in MapReduce and YARN
Known Issues in Apache Zeppelin
Known Issues in Apache ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7 SP1
Behavioral changes in Apache Hive
Behavioral Changes in Cloudera Search
Behavioral changes in Apache HBase
Fixed Common Vulnerabilities and Exposures 7.1.7 SP1
Documentation Errata in Cloudera Runtime 7.1.7 SP1
Cloudera Logging is now available in CDP Private Cloud Base 7.1.7 SP1
▶︎
Cumulative hotfixes
Cumulative hotfix 24 (CDP PvC Base 7.1.7.1069-3)
Cumulative hotfix 23 (CDP PvC Base 7.1.7.1067-1)
Cumulative hotfix 22 (CDP PvC Base 7.1.7.1065-1)
Cumulative hotfix 21 (CDP PvC Base 7.1.7.1064-2)
Cumulative hotfix 20 (CDP PvC Base 7.1.7.1063-8)
Cumulative hotfix 19 (CDP PvC Base 7.1.7.1061-1)
Cumulative hotfix 18 (CDP PvC Base 7.1.7.1060-1)
Cumulative hotfix 17 (CDP PvC Base 7.1.7.1059-1)
Cumulative hotfix 16 (CDP PvC Base 7.1.7.1058-1)
Cumulative hotfix 15 (CDP PvC Base 7.1.7.1057-2)
Cumulative hotfix 14 (CDP PvC Base 7.1.7.1054-1)
Cumulative hotfix 13 (CDP PvC Base 7.1.7.1050-1)
Cumulative hotfix 12 (CDP PvC Base 7.1.7.1046-1)
Cumulative hotfix 11 (CDP PvC Base 7.1.7.1044-1)
Cumulative hotfix 10 (CDP PvC Base 7.1.7.1041-1)
Cumulative hotfix 9 (CDP PvC Base 7.1.7.1039-2)
Cumulative hotfix 8 (CDP PvC Base 7.1.7.1037-2)
Cumulative hotfix 7 (CDP PvC Base 7.1.7.1035-1)
Cumulative hotfix 6 (CDP PvC Base 7.1.7.1032-1)
Cumulative hotfix 5 (CDP PvC Base 7.1.7.1029-4)
Cumulative hotfix 4 (CDP PvC Base 7.1.7.1024-4)
Cumulative hotfix 3 (CDP PvC Base 7.1.7.1022-1)
Cumulative hotfix 2 (CDP PvC Base 7.1.7.1014-4)
Cumulative hotfix 1 (CDP PvC Base 7.1.7.1003-5)
▶︎
7.1.7
▶︎
What's new in 7.1.7
Atlas
Cruise Control
Hive
Hue
Impala
Kafka
Kerberos
Kudu
Ozone
Ranger
Schema Registry
Search
Spark
Sqoop
Streams Replication Manager
Streams Messaging Manager
YARN
Unaffected Components in this release
Cloudera Runtime component versions
▶︎
Using the Cloudera Runtime Maven repository 7.1.7
Maven Artifacts for Cloudera Runtime 7.1.7.0
▶︎
Fixed issues in Cloudera Runtime 7.1.7
Atlas
Avro
Cruise Control
DAS
Hadoop
HDFS
HBase
Hive
Hue
Impala
Kafka
Kudu
Knox
Navigator Encrypt
Oozie
Ozone
Parquet
Phoenix
Ranger
Schema Registry
Search
Spark
Sqoop
Streams Replication Manager
Streams Messaging Manager
Tez
YARN
Zeppelin
Zookeeper
Hotfixes in Cloudera Runtime 7.1.7
▶︎
Known issues in Cloudera Runtime 7.1.7
Atlas
Avro
Cruise Control
DAS
Hadoop
HBase
HDFS
Hive
Hue
Impala
Kafka
Kerberos
Knox
Kudu
Navigator Encrypt
Oozie
Ozone
Parquet
Phoenix
Ranger
Schema Registry
Search
Spark
Streams Replication Manager
Sqoop
Streams Messaging Manager
YARN
Zeppelin
ZooKeeper
▶︎
Behavioral changes in Cloudera Runtime 7.1.7
Cruise Control
Hive
Kafka
Navigator Encrypt
Phoenix
Search
Impala
Streams Replication Manager
YARN
▶︎
Deprecation notices in Cloudera Runtime 7.1.7
Kudu
Kafka
HBase
HDFS
▶︎
CDP Private Cloud Base service groups and component reference
CDP PVC Base - Data Warehouse
CDP PVC Base - Data Engineering
CDP PVC Base - Operational Database
CDP PVC Base - Enterprise Essentials
»
Cloudera Runtime Release Notes
Known Issues in Apache Hadoop
There are no known issues for Hadoop in Cloudera Runtime 7.1.7 SP3.
Parent topic:
Known Issues in Cloudera Runtime 7.1.7 SP3
7.3.1
7.1
7.1.9
7.1.8
7.1.7
7.1.6
7.1.5
7.1.4
7.1.3
7.1.2
7.1.1
7.0.3
This site uses cookies and related technologies, as described in our
privacy policy
, for purposes that may include site operation, analytics, enhanced user experience, or advertising. You may choose to consent to our use of these technologies, or
manage your own preferences.
Accept all