Cloud Data Access
Also available as:
PDF
loading table of contents...

Analyzing Tables

When working with data in S3, ADLS or WASB, the steps for analyzing tables are the same as when working with data in HDFS.

Table statistics can be gathered automatically by setting hive.stats.autogather=true or by running analyze table test compute statistics command. For example:

ANALYZE TABLE Table1 PARTITION(ds='2008-04-09', hr=11) COMPUTE STATISTICS;

Column statistics are not automatically created. You must manually gather column statistics by running analyze table test compute statistics for columns command. For example:

ANALYZE TABLE Table1 PARTITION(ds='2008-04-09', hr=11) COMPUTE STATISTICS FOR COLUMNS;