HDFS Administration
Also available as:
PDF
loading table of contents...

Contents

1. ACLs on HDFS
Configuring ACLs on HDFS
Using CLI Commands to Create and List ACLs
ACL Examples
ACLS on HDFS Features
Use Cases for ACLs on HDFS
2. Archival Storage
Introduction
HDFS Storage Types
Storage Policies: Hot, Warm, and Cold
Configuring Archival Storage
3. Centralized Cache Management in HDFS
Overview
Caching Use Cases
Caching Architecture
Caching Terminology
Configuring Centralized Caching
Using Cache Pools and Directives
4. Configuring HDFS Compression
5. Configuring Rack Awareness On HDP
Create a Rack Topology Script
Add the Topology Script Property to core-site.xml
Restart HDFS and MapReduce
Verify Rack Awareness
6. Customizing HDFS
Customize the HDFS Home Directory
Set the Size of the NameNode Edits Directory
7. Hadoop Archives
Introduction
Hadoop Archive Components
Creating a Hadoop Archive
Looking Up Files in Hadoop Archives
Hadoop Archives and MapReduce
8. JMX Metrics APIs for HDFS Daemons
9. Memory as Storage (Technical Preview)
Introduction
HDFS Storage Types
The LAZY_PERSIST Memory Storage Policy
Configuring Memory as Storage
10. Running DataNodes as Non-Root
Introduction
Configuring DataNode SASL
11. Short Circuit Local Reads On HDFS
Prerequisites
Configuring Short-Circuit Local Reads on HDFS
12. Accidental Deletion Protection
Preventing Accidental Deletion of Files
13. WebHDFS Administrator Guide
14. Backing Up HDFS Metadata
Introduction to HDFS Metadata Files and Directories
Files and Directories
HDFS Commands
Backing Up HDFS Metadata
Get Ready to Backup the HDFS Metadata
Perform a Backup of the HDFS Metadata
15. Balancing in HDFS
Overview of the HDFS Balancer
Why HDFS Data Becomes Unbalanced
Configurations and CLI Options
Configuring the Balancer
Using the Balancer CLI Commands
Recommended Configurations
Cluster Balancing Algorithm
Step 1: Storage Group Classification
Step 2: Storage Group Pairing
Step 3: Block Move Scheduling
Step 4: Block Move Execution
Exit Status