Accessing Cloud Data
Also available as:
PDF
loading table of contents...

Analyzing Tables

When working with data in cloud object stores, the steps for analyzing tables are the same as when working with data in HDFS.

Table statistics can be gathered automatically by setting hive.stats.autogather=true or by running analyze table test compute statistics command. For example:

ANALYZE TABLE Table1 PARTITION(ds='2008-04-09', hr=11) COMPUTE STATISTICS;

Column statistics are not automatically created. You must manually gather column statistics by running analyze table test compute statistics for columns command. For example:

ANALYZE TABLE Table1 PARTITION(ds='2008-04-09', hr=11) COMPUTE STATISTICS FOR COLUMNS;