5. Hadoop Archives and MapReduce

To use Hadoop Archives with MapReduce, you must reference files slightly differently than with the default file system. If you have a Hadoop Archive stored in HDFS in /user/ zoo/foo.har, you must specify the input directory as har:///user/zoo/foo.har to use it as a MapReduce input. Since Hadoop Archives are exposed as a file system, MapReduce is able to use all of the logical input files in Hadoop Archives as input.

Legal notices

Contents
Search

1. HDFS Administration
2. Archival Storage
3. Centralized Cache Management in HDFS
4. Configuring HDFS Compression
5. Configuring Rack Awareness On HDP
6. Hadoop Archives
7. JMX Metrics APIs for HDFS Daemons
8. Memory as Storage (Technical Preview)
9. Running DataNodes as Non-Root
- 1. Introduction
- 2. Configuring DataNode SASL
10. Short Circuit Local Reads On HDFS
11. WebHDFS Administrator Guide
12. About HDP

Search Highlighter (On/Off)