Managing Clusters
Accessing the Cloudera Manager Admin Console
Accessing the Cloudera Manager for Data Lake clusters using workload credentials
Starting, Stopping, Refreshing, and Restarting a Cluster
Pausing a Cluster in AWS
Shutting Down and Starting Up the Cluster
Managing Hosts
Viewing Host Status
Viewing Host Role Assignments
Hosts Disks Overview
Stopping All the Roles on a Host
Starting All the Roles on a Host
Performing Maintenance on a Cluster Host
Decommissioning Hosts
Recommissioning Hosts
Tuning and Troubleshooting Host Decommissioning
Tuning HDFS Prior to Decommissioning DataNodes
Tuning HBase Prior to Decommissioning DataNodes
Performance Considerations
Troubleshooting Performance of Decommissioning
Maintenance Mode
Viewing the Maintenance Mode Status of a Cluster
Managing Roles
Role Instances
Starting, Stopping, and Restarting Role Instances
Decommissioning Role Instances
Recommissioning Role Instances
Configuring Roles to Use a Custom Garbage Collection Parameter
Role Groups
Creating a Role Group
Managing Role Groups
Default User Roles
Managing Cloudera Runtime Services
Comparing Configurations for a Service Between Clusters
Starting a Cloudera Runtime Service on All Hosts
Stopping a Cloudera Runtime Service on All Hosts
Restarting a Cloudera Runtime Service
Rolling Restart
Aborting a Pending Command
Configuring Maximum File Descriptors
Core Configuration Service
Cloudera Manager API
Exporting and Importing Cloudera Manager Configuration
Backing Up and Restoring the Cloudera Manager Configuration
Using Tags in Cloudera Manager
Performance Management
Optimizing Performance in Cloudera Runtime
Disabling Transparent Hugepages (THP)
Setting the vm.swappiness Linux Kernel Parameter
Improving Performance in Shuffle Handler and IFile Reader
Tips and Best Practices for Jobs
Decrease Reserve Space
Choosing and Configuring Data Compression
Resource Management
Static Service Pools
Data Storage for Monitoring Data
Configuring Service Monitor Data Storage
Configuring Host Monitor Data Storage
Viewing Host and Service Monitor Data Storage
Data Granularity and Time-Series Metric Data
Moving Monitoring Data on an Active Cluster
Host Monitor and Service Monitor Memory Configuration
Configuring Memory Allocations
Accessing Storage Using Amazon S3
Referencing S3 Credentials for YARN, MapReduce, or Spark Clients
Referencing Amazon S3 in URIs
Using Fast Upload with Amazon S3
Enabling Fast Upload using Cloudera Manager
How to Configure a MapReduce Job to Access S3 with an HDFS Credstore
Importing Data into Amazon S3 Using Sqoop
Authentication
Using a Credential Provider to Secure S3 Credentials
Sqoop Import into Amazon S3
Import Data from RDBMS into an S3 Bucket
Import Data into S3 Bucket in Incremental Mode
Import Data into an External Hive Table Backed by S3
Accessing Storage Using Microsoft ADLS
Configuring OAuth in Data Hub
Configuring OAuth with core-site.xml
Configuring OAuth with the Hadoop CredentialProvider
Configuring Native TLS Acceleration
Importing Data into Microsoft Azure Data Lake Store (Gen1 and Gen2) Using Sqoop
Prerequisites
Authentication
Sqoop Import into ADLS