Managing Clusters
Accessing the Cloudera Manager Admin Console from Data Hub clusters
Accessing the Cloudera Manager for Data Lake clusters using workload credentials
Starting, Stopping, Refreshing, and Restarting a Cluster
Pausing a Cluster in AWS
Shutting Down and Starting Up the Cluster
Renaming a Cluster
Managing Hosts
Viewing Host Status
Viewing Host Role Assignments
Hosts Disks Overview
Deleting Hosts
Deleting a Host from Cloudera Manager
Removing a Host From a Cluster
Stopping All the Roles on a Host
Starting All the Roles on a Host
Configuring Upgrade Domains
Configuring Upgrade Domains
Changing the Upgrade Domain for hosts
Putting all Hosts in an Upgrade Domain group into Maintenance Mode
Performing Maintenance on a Cluster Host
Decommissioning Hosts
Recommissioning Hosts
Tuning and Troubleshooting Host Decommissioning
Tuning HDFS Prior to Decommissioning DataNodes
Tuning HBase Prior to Decommissioning DataNodes
Performance Considerations
Troubleshooting Performance of Decommissioning
Maintenance Mode
Viewing the Maintenance Mode Status of a Cluster
Managing Roles
Role Instances
Adding a Role Instance
Starting, Stopping, and Restarting Role Instances
Decommissioning Role Instances
Recommissioning Role Instances
Deleting Role Instances
Configuring Roles to Use a Custom Garbage Collection Parameter
Role Groups
Creating a Role Group
Managing Role Groups
Default User Roles
Managing Cloudera Runtime Services
Starting a Cloudera Runtime Service on All Hosts
Stopping a Cloudera Runtime Service on All Hosts
Restarting a Cloudera Runtime Service
Rolling Restart
Aborting a Pending Command
Performance Management
Optimizing Performance in Cloudera Runtime
Disable the tuned Service
Disabling Transparent Hugepages (THP)
Setting the vm.swappiness Linux Kernel Parameter
Improving Performance in Shuffle Handler and IFile Reader
Tips and Best Practices for Jobs
Decrease Reserve Space
Choosing and Configuring Data Compression
Resource Management
Static Service Pools
Enabling and Configuring Static Service Pools
Disabling Static Service Pools
Linux Control Groups (cgroups)
Enabling Resource Management with Control Groups
Configuring Resource Parameters
Configuring Custom Cgroups
Data Storage for Monitoring Data
Configuring Service Monitor Data Storage
Configuring Host Monitor Data Storage
Viewing Host and Service Monitor Data Storage
Data Granularity and Time-Series Metric Data
Moving Monitoring Data on an Active Cluster
Host Monitor and Service Monitor Memory Configuration
Configuring Memory Allocations
Accessing Storage Using Amazon S3
Referencing S3 Credentials for YARN, MapReduce, or Spark Clients
Referencing Amazon S3 in URIs
Using Fast Upload with Amazon S3
Enabling Fast Upload using Cloudera Manager
How to Configure a MapReduce Job to Access S3 with an HDFS Credstore
Importing Data into Amazon S3 Using Sqoop
Authentication
Using a Credential Provider to Secure S3 Credentials
Sqoop Import into Amazon S3
Import Data from RDBMS into an S3 Bucket
Import Data into S3 Bucket in Incremental Mode
Import Data into an External Hive Table Backed by S3
Accessing Storage Using Microsoft ADLS
Configuring OAuth in Data Hub
Configuring OAuth with core-site.xml
Configuring OAuth with the Hadoop CredentialProvider
Configuring Built-in TLS Acceleration
Importing Data into Microsoft Azure Data Lake Store (Gen1 and Gen2) Using Sqoop
Prerequisites
Authentication
Sqoop Import into ADLS