This is the documentation for CDH 5.0.x. Documentation for other versions is available at Cloudera Documentation.

About Hive

Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar to SQL.

Note:

As of CDH 5, Hive includes HCatalog, but you still need to install HCatalog separately if you want to use it; see Installing and Using HCatalog.

Install Hive on your client machine(s) from which you submit jobs; you do not need to install it on the nodes in your Hadoop cluster.

HiveServer2

You need to deploy HiveServer2, an improved version of HiveServer that supports a Thrift API tailored for JDBC and ODBC clients, Kerberos authentication, and multi-client concurrency. The CLI for HiveServer2 is Beeline.

Important:

The original HiveServer and command-line interface (CLI) are no longer supported; use HiveServer2 and Beeline.

Page generated September 3, 2015.