Indexing Data Using Cloudera Search
Indexing Data
Near Real Time Indexing
Lily HBase Near Real Time Indexing for Cloudera Search
Enabling Cluster-wide HBase Replication
Starting the Lily HBase NRT Indexer Service
Using the Lily HBase NRT Indexer Service
Enabling Replication on HBase Column Families
Creating a Collection in Cloudera Search
Creating a Lily HBase Indexer Configuration File
Creating a Morphline Configuration File
Understanding the extractHBaseCells Morphline Command
Registering a Lily HBase Indexer Configuration with the Lily HBase Indexer Service
Verifying that Indexing Works
Using the Indexer HTTP Interface
Configuring Lily HBase Indexer Security
Configure Lily HBase Indexer to use TLS/SSL
Configure Lily HBase Indexer Service to Use Kerberos Authentication
Batch Indexing
Spark Indexing
MapReduce Indexing
MapReduceIndexerTool
MapReduceIndexerTool Input Splits
MapReduceIndexerTool Metadata
MapReduceIndexerTool Usage Syntax
Lily HBase Batch Indexing for Cloudera Search
Populating an HBase Table
Creating a Collection in Cloudera Search
Creating a Lily HBase Indexer Configuration File
Creating a Morphline Configuration File
Understanding the extractHBaseCells Morphline Command
Running HBaseMapReduceIndexerTool
HBaseMapReduceIndexerTool command line reference
Using --go-live with SSL or Kerberos
Understanding --go-live and HDFS ACLs