Managing Data Hub Clusters
Accessing the Cloudera Manager Admin Console from Data Hub clusters
Accessing the Cloudera Manager for Data Lake clusters using workload credentials
Starting, Stopping, Refreshing, and Restarting a Cluster
Pausing a Cluster in AWS
Shutting Down and Starting Up the Cluster
Renaming a Cluster
Managing Hosts
Status
Configuration
Roles
Host Templates
Stopping All the Roles on a Host
Starting All the Roles on a Host
Specifying Racks for Hosts
Host Templates
Creating a Host Template
Editing a Host Template
Applying a Host Template to a Host
Performing Maintenance on a Cluster Host
Decommissioning Hosts
Recommissioning Hosts
Tuning and Troubleshooting Host Decommissioning
Tuning HDFS Prior to Decommissioning DataNodes
Tuning HBase Prior to Decommissioning DataNodes
Performance Considerations
Troubleshooting Performance of Decommissioning
Maintenance Mode
Viewing the Maintenance Mode Status of a Cluster
Changing Hostnames
Deleting Hosts
Deleting a Host from Cloudera Manager
Removing a Host From a Cluster
Moving a Host Between Clusters
Managing Roles
Role Instances
Starting, Stopping, and Restarting Role Instances
Decommissioning Role Instances
Recommissioning Role Instances
Configuring Roles to Use a Custom Garbage Collection Parameter
Role Groups
Creating a Role Group
Managing Role Groups
Managing Cloudera Runtime Services
Starting a Cloudera Runtime Service on All Hosts
Stopping a Cloudera Runtime Service on All Hosts
Restarting a Cloudera Runtime Service
Rolling Restart
Aborting a Pending Command
Managing Cloudera Manager
Automatic Logout
Starting, Stopping, and Restarting the Cloudera Manager Server
Configuring Cloudera Manager Server Ports
Moving the Cloudera Manager Server to a New Host
Migrating Embedded PostgreSQL Database to External PostgreSQL Database
Step 1: Identify Roles that Use the Embedded Database Server
Step 2: Migrate Databases from the Embedded Database Server to the External PostgreSQL Database Server
Migrating from PostgreSQL Database Server to MySQL/Oracle Database Server
Migrate from the Cloudera Manager External PostgreSQL Database Server to a MySQL/Oracle Database Server
Managing Cloudera Manager Server Logs
Configuring Cloudera Manager
Cloudera Manager Agents
Starting, Stopping, and Restarting Cloudera Manager Agents
Managing the Cloudera Manager Agent Logs
Default User Roles
Exporting and Importing Cloudera Manager Configuration
Other Tasks and Settings
Cloudera Management Service
Starting the Cloudera Management Service
Stopping the Cloudera Management Service
Restarting the Cloudera Management Service
Starting and Stopping Cloudera Management Service Roles
Configuring Management Service Database Limits
Performance Management
Optimizing Performance in Cloudera Runtime
Disable the tuned Service
Disabling Transparent Hugepages (THP)
Setting the vm.swappiness Linux Kernel Parameter
Improving Performance in Shuffle Handler and IFile Reader
Tips and Best Practices for Jobs
Decrease Reserve Space
Choosing and Configuring Data Compression
Resource Management
Static Service Pools
Enabling and Configuring Static Service Pools
Disabling Static Service Pools
Linux Control Groups (cgroups)
Resource Management with Control Groups
Configuring Resource Parameters
Data Storage for Monitoring Data
Configuring Service Monitor Data Storage
Configuring Host Monitor Data Storage
Viewing Host and Service Monitor Data Storage
Data Granularity and Time-Series Metric Data
Moving Monitoring Data on an Active Cluster
Host Monitor and Service Monitor Memory Configuration
Configuring Memory Allocations
Accessing Storage Using Amazon S3
Referencing S3 Credentials for YARN, MapReduce, or Spark Clients
Referencing Amazon S3 in URIs
Using Fast Upload with Amazon S3
Enabling Fast Upload using Cloudera Manager
How to Configure a MapReduce Job to Access S3 with an HDFS Credstore
Importing Data into Amazon S3 Using Sqoop
Authentication
Using a Credential Provider to Secure S3 Credentials
Sqoop Import into Amazon S3
Import Data from RDBMS into an S3 Bucket
Import Data into S3 Bucket in Incremental Mode
Import Data into an External Hive Table Backed by S3