Using secondary indexing in Apache Phoenix
Apache Phoenix uses a secondary index to serve queries. An index table is an Apache Phoenix table that stores the reference copy of some or all the data in the main table.
You can use a secondary index to access data from its primary data access path. When you use a secondary index, the indexed column qualifiers or rows form a unique row key that allows you to do point lookups and range scans.
Secondary index types
- Use global indexes for read-heavy use cases. Use the covered-global index to save on read-time overheads. Global indexes are used to co-locate related information.
- Use local indexes for write-heavy use cases. Use the functional-local index on arbitrary expressions to query specific combinations of index queries. A local index is an in-partition index that is optimized for writes but requires more data to be read to answer a query.
The following tables list the index type and index scope with a description and example for each index type:
Include the data that you want to access from the primary table in the index rows. The query does not have to access the primary table once the index entry is found.
Benefits: Save read-time overhead by only accessing the index entry. In the following example, column v3 is included in the index to avoid the query to access the primary table to retrieve this information.
The following command creates indexes on the v1 and v2 columns and include the v3 column as well:
Create an index on arbitrary expressions. When your query uses the expression, the index is used to retrieve the results instead of the data table.
Benefits: Useful for certain combinations of index queries.
Run the following command to create a functional index so that you can perform case insensitive searches on the combined first name and last name of a person:
Search on the combined first name and last name using the following command:
You can use this when you have read-heavy use cases. Each global index is stored in its own table, and therefore it is not co-located with the data table.
A Global index is a covered index. It is used for queries only when all columns in that query are included in that index.
Run the following command to create a global index:
|Local||You can use this for write-heavy use cases. Each local index is
stored within the data table.
Run the following command to create a local index: