COMPUTE_STATS_MIN_SAMPLE_SIZE Query Option

The COMPUTE_STATS_MIN_SAMPLE_SIZE query option specifies the minimum number of bytes that will be scanned in COMPUTE STATS TABLESAMPLE, regardless of the user-supplied sampling percent. This query option prevents sampling for very small tables where accurate stats can be obtained cheaply without sampling because the minimum sample size is required to get meaningful stats.

Type: integer

Default: 1GB

Added in: CDH 5.15 / Impala 2.12

Usage notes: