This is the documentation for CDH 5.1.x. Documentation for other versions is available at Cloudera Documentation.

Using Snappy compression in Sqoop 1 and Sqoop 2 Imports

  • For Sqoop 1:

    On the command line, use the following option to enable Snappy compression:

    --compression-codec org.apache.hadoop.io.compress.SnappyCodec

    It is a good idea to use the --as-sequencefile option with this compression option.

  • For Sqoop 2:

    When you create a job (sqoop:000> create job), choose 7 (SNAPPY) as the compression format.

Page generated September 3, 2015.