Configuring Services to Use the GPL Extras Parcel
After you install the GPL Extras parcel, reconfigure and restart services that need to use LZO functionality. Any service that does not require the use of LZO need not be configured.
HDFS and MapReduce
- Go to the HDFS service.
- Click the Configuration tab.
- Search for the
io.compression.codecs
property. - In the Compression Codecs property, click in the field, then click the + sign to open a new value field.
- Add the following two codecs:
- com.hadoop.compression.lzo.LzoCodec
- com.hadoop.compression.lzo.LzopCodec
- Save your configuration changes.
- Restart HDFS.
- Redeploy the HDFS client configuration.
Oozie
- Go to /var/lib/oozie on each Oozie server and even if the
LZO JAR is present, symlink the Hadoop LZO JAR:
- CDH 5 - /opt/cloudera/parcels/GPLEXTRAS/lib/hadoop/lib/hadoop-lzo.jar
- CDH 4 - /opt/cloudera/parcels/HADOOP_LZO/lib/hadoop/lib/hadoop-lzo.jar
- Restart Oozie.
HBase
Restart HBase.
Impala
Restart Impala.
Hive
Restart the Hive server.
Sqoop 1
- Add the following entries to the Sqoop 1 Client Client Advanced Configuration Snippet (Safety Valve)
HADOOP_CLASSPATH=$HADOOP_CLASSPATH:/opt/cloudera/parcels/GPLEXTRAS/lib/hadoop/lib/
JAVA_LIBRARY_PATH=$JAVA_LIBRARY_PATH:/opt/cloudera/parcels/GPLEXTRAS/lib/hadoop/lib/native
- Re-deploy the client configuration.