On-demand Metadata
With the on-demand metadata feature, the Impala coordinators pull
metadata as needed from catalogd
and cache it locally. The
cached metadata gets evicted automatically under memory pressure.
The granularity of on-demand metadata fetches is at the partition level between the
coordinator and catalogd
. Common use cases like add/drop partitions do
not trigger unnecessary serialization/deserialization of large metadata.
The feature can be used in either of the following modes.
- Metadata on-demand mode
- In this mode, all coordinators use the metadata on-demand.
- Mixed mode
- In this mode, only some coordinators are enabled to use the metadata on-demand.
HDFS caching is not supported in On-demand
metadata mode coordinators.
INVALIDATE METADATA
Usage Notes:
Through "automatic invalidation" or "HMS event polling" support, Impala automatically
picks up most changes in metadata from the underlying systems. However there are some
scenarios where you might need to run
INVALIDATE METADATA
or
REFRESH
. - when HMS event polling does not detect changes,
- if you override manually to disable "HMS event polling",
- for the list of other cases when a Global
INVALIDATE METADATA
is recommended.