Changing the HBase column family compression type

Learn how to configure compression types. Changing the default Snappy algorithm to Gzip might be required depending on your use case and operational requirements.

  1. Add the atlas.graph.storage.hbase.compression-algorithm property in Atlas Server Advanced Configuration Snippet for conf/atlas-application.properties.
  2. Enter the desired compression type in the property.
    atlas.graph.storage.hbase.compression-algorithm=GZ
  3. Select the Master-(Active) Role Type for the HBase Instance from Cloudera Manager. Then, log in by using SSH in the terminal.
  4. Run kinit -kt ***PATH TO HBASE.KEYTAB*** hbase to authenticate into HBase.
  5. Enter the HBase shell.
    1. Run list to see the available HBase tables.
  6. Run 'desc 'atlas_janus'' to see all column descriptions.
  7. Change the compression type for each required table:
    alter 'atlas_janus', {NAME => 'e' , COMPRESSION => ''GZ''}
    alter 'atlas_janus', {NAME => 'f' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 'g' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 'h' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 'i' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 'l' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 'm' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 's' , COMPRESSION => 'GZ'}
    alter 'atlas_janus', {NAME => 't' , COMPRESSION => 'GZ'}
  8. Run major_compact 'atlas_janus' only in HBase Shell to start compacting your tables.
Your HBase column family uses the newly set compression type.