Hortonworks Docs
»
Data Platform 3.1.0
»
Using Apache HBase to store and access data
Using Apache HBase to store and access data
Also available as:
What's New in Apache HBase
Overview of Apache HBase
Apache HBase installation
Installing HBase through Ambari
HBase cluster capacity planning
Configuring HBase cluster for the first time
Increase the request handler thread count
Configure the size and number of WAL files
Configure compactions
Considerations for splitting tables
Tune JVM garbage collection in RegionServers
Node count and JVM configuration
Physical size of the data
Read-Write Throughput
Options to increase HBase Region count and size
Increasing MemStore size for RegionServer
Increasing the size of Region
Enable multitenancy with namespaces
Default HBase namespace actions
Define and drop namespaces
Security features that are available
Managing Apache HBase clusters
Monitoring Apache HBase clusters through Grafana-based dashboard
Optimizing Apache HBase I/O
HBase I/O components
Configuring BlockCache
Configure On-Heap BlockCache
Guidelines for configuring On-Heap BlockCache (LruBlockCache)
Prerequisites to configure Off-Heap Memory (BucketCache)
Configure BucketCache
BlockCache compression
Enable BlockCache compression
BlockCache-and-MemStore-Properties
Import data into HBase with Bulk load
Using Snapshots in HBase
Configure a Snapshot
Take a Snapshot
List Snapshots
Delete Snapshots
Clone a table from a Snapshot
Restore a Snapshot
Snapshot Operations and ACLs
Export data to another cluster
Backing up and restoring Apache HBase datasets
Planning a backup-and-restore Strategy for your environment
Backup within a Cluster
Backup to the dedicated HDFS archive cluster
Backup to the Cloud or a Storage vendor
Best practices for backup-and-restore
Running the backup-and-restore utility
Create and maintain a complete backup image
Command for creating HBase backup image
Monitor backup progress
Using backup sets
Restore a backup image
Administering and deleting backup images
HBase backup commands
Incremental backup-and-restore
Example scenario: Safeguarding application datasets on Amazon S3
Medium Object (MOB) storage support in Apache HBase
Methods to enable MOB storage support
Method 1:Enable MOB Storage support using configure options in the command line
Method 2: Invoke MOB support parameters in a Java API
Test the MOB storage support configuration
MOB storage cache properties
Method 1: Enter property settings using Ambari
Method 2: Enter property settings directly in the hbase-site.xml file
MOB cache properties
HBase quota management
Setting up quotas
General Quota Syntax
Throttle quotas
Throttle quota examples
Space quotas
Quota enforcement
Quota violation policies
Impact of quota violation policy
Live Write Access
Bulk Write Access
Read Access
Metrics and Insight
Examples of overlapping quota policies
Number-of-Tables Quotas
Number-of-Regions Quotas
Understanding Apache HBase Hive integration
Prerequisites
Configuring HBase and Hive
Using HBase Hive integration
HBase Hive integration example
Using Hive to access an existing HBase table example
Understanding Bulk Loading
Understanding HBase Snapshots
HBase Best Practices
Managing Apache HBase clusters
You will get an understanding on how to manage your HBase clusters.
Monitoring Apache HBase clusters through Grafana-based dashboard
If you have an Ambari-managed HBase cluster, you can monitor the cluster performance with Grafana-based dashboards.
Optimizing Apache HBase I/O
You can optimize HBase I/O using several ways. Two HBase key concepts that helps you in the process are BlockCache and MemStore tuning.
Import data into HBase with Bulk load
You can importa data with a bulk load operation to bypass the HBase API and writes content, properly formatted as HBase data files (HFiles), directly to the file system. It uses fewer CPU and network resources than using the HBase API for similar work.
Using Snapshots in HBase
HBase snapshot support enables you to take a snapshot of a table without much impact on RegionServers, because snapshot, clone, and restore operations do not involve data copying. In addition, exporting a snapshot to another cluster has no impact on RegionServers.
© 2012–2019, Hortonworks, Inc.
Document licensed under the
Creative Commons Attribution ShareAlike 4.0 License
.
Hortonworks.com
|
Documentation
|
Support
|
Community