Analyzing Tables
When working with data in cloud object stores, the steps for analyzing tables are the same as when working with data in HDFS.
Table statistics can be gathered automatically by setting
hive.stats.autogather=true
or by running analyze table test compute
statistics
command. For example:
ANALYZE TABLE Table1 PARTITION(ds='2008-04-09', hr=11) COMPUTE STATISTICS;
Column statistics are not automatically created. You must manually gather column
statistics by running analyze table test compute statistics for columns
command. For example:
ANALYZE TABLE Table1 PARTITION(ds='2008-04-09', hr=11) COMPUTE STATISTICS FOR COLUMNS;