Data Access
Also available as:
PDF
loading table of contents...

Queries on Data Stored in Remote Clusters

Some environments upgrade HDP Hive on a cluster but leave an older version on a different cluster. This setup includes the scenario where you have a compute-only cluster with a recent version of Hive that runs simultaneously with a data-lake cluster hosting an older Hive version. In this scenario, you might want to run queries using the resources of the latest HDP Hive installation on the older Hive installation that maintains the data.

Hive distributed in HDP version 2.6.3 supports functionality that lets you query remote clusters hosting HDP Hive version 2.4.3 data from a cluster that has the Hive distribution in HDP 2.6.3. This functionality includes both READ and WRITE privileges on the data.