Configuring intermediate results caching
Learn about the configurations required to enable the intermediate results cache for Impala queries.
To use the intermediate results cache, you must configure the following settings. By default, these features are disabled.
If the intermediate results cache storage is shared with other
elements, such as the data cache or scratch space, you might need to adjust existing
quotas (for example, the --data_cache startup flag) to provide
sufficient space.
Impala now stores intermediate query results in the specified local directory. Subsequent queries with matching plan fragments can retrieve data from the cache, which reduces execution time and resource consumption.
You can monitor cache hits and performance by checking the Impala Query Profile. The profile displays metrics for tuple cache hits under the relevant plan nodes.
