Compactor properties
You check and change a number of Apache Hive properties to configure the compaction of delta files that accumulate during data ingestion. You need to know the defaults and valid values.
Basic compactor properties
- hive.compactor.initiator.on
- Default=false
- hive.compactor.cleaner.on
- Default=false
- hive.compactor.worker.threads
- Default=0
- hive.metastore.runworker.in
- Default=HS2
- hive.compactor.abortedtxn.threshold
- Default=1000 aborts
- hive.compactor.aborted.txn.time.threshold
- Default=12 hours
Advanced compactor properties
- hive.compactor.worker.timeout
- Default=86400s
- hive.compactor.check.interval
- Default=300s
- hive.compactor.delta.num.threshold
- Default=10
- hive.compactor.delta.pct.threshold
- Default=0.1
- hive.compactor.max.num.delta
- Default=500
- hive.compactor.wait.timeout
- Default=300000
- hive.compactor.initiator.failed.compacts.threshold
- Default=2
- hive.compactor.cleaner.run.interval
- Default=5000ms
- hive.compactor.job.queue
- Specifies the Hadoop queue name to which compaction jobs are submitted. If the value is an empty string, Hadoop chooses the default queue to submit compaction jobs.
- hive.compactor.compact.insert.only
- Default=true
- hive.compactor.crud.query.based
- Default=false
- hive.split.grouping.mode
- Default=query
- hive.compactor.history.retention.succeeded
- Default=3
- hive.compactor.history.retention.failed
- Default=3
- hive.compactor.history.retention.attempted
- Default=2
- hive.compactor.history.reaper.interval
- Default=2m