Known Issues in Apache HBase
Learn about the known issues in HBase, the impact or changes to the functionality, and the workaround.
- OpDB Data Hub cluster fails to initialize if you are reusing a cloud storage location that was used by an older OpDB Data Hub cluster
- Stop HBase using Cloudera Manager before deleting an Operational Database Data Hub cluster.
IntegrationTestReplication
fails if replication does not finish before theverify
phase begins-
During
IntegrationTestReplication
, if theverify
phase starts before thereplication
phase finishes, the test fails because the target cluster does not contain all of the data. If the HBase services in the target cluster does not have enough memory, long garbage-collection pauses might occur. - Bulk load is not supported when the source is the local HDFS
- The bulk load feature (the
completebulkload
command) is not supported when the source is the local HDFS and the target is an object store, such as S3/ABFS. - Snappy compression with /tmp directory mounted with noexec option
- Using the HBase client applications such as hbase hfile on the
cluster with Snappy compression could result in
UnsatisfiedLinkError
. - HBASE-28450: BuckeCache.evictBlocksByHfileName does not work after a cache recovery from a file
- When the persistent cache is recovered after a region server crashes or restarts, blocks for closed regions or compacted files are not evicted, filling the cache indefinitely, after some time, the cache reaches its capacity, and read performance degrades.
- HBASE-28458: BucketCache.notifyFileCachingCompleted might incorrectly consider a fully cached file
- This behavior causes some blocks to be wrongly skipped from getting cached. It confuses region caching ratio metrics, which the cache-aware balancer uses to track how much each region is cached on individual region servers. That affects the efficiency of the cache-aware balancer, which can lead to read performance degradation once regions are moved to region servers with fewer blocks in the cache for the given region.
- HBASE-28804: Bucket cache retrieval from a persistent file is not asynchronous
- When running with a cache close to capacity (1.6TB) and around 30M blocks in the cache, the recovery of the persistent cache following crashes or restarts can delay the region server initialization to about four minutes. That causes rolling restarts or upgrades to fail.
- HBASE-28805: Unable to perform chunked persistence of backing map for persistent bucket cache
- The existing HBase cache persistence flushes the whole cache index into the persistent cache file as a single protocol buffer message. When the cache usage is high, with more than 24M blocks in the cache, the persistence thread gets an error and aborts. The thread does not retain the cache again, leading to a huge cache loss during a crash or restart.