Apache Hive overview
Also available as:
PDF

Apache Hive content roadmap

The content roadmap provides links to the available content resources for Apache Hive.

Table 1. Apache Hive Content roadmap
Task Resources Source Description
Understanding

Presentations and Papers about Hive

Apache wiki Contains meeting notes, presentations, and whitepapers from the Apache community.
Getting Started

Hive Tutorial

Apache wiki Provides a basic overview of Apache Hive and contains some examples on working with tables, loading data, and querying and inserting data.
Installing and Upgrading

Ambari Install Guide

Hortonworks Describes Ambari, an end-to-end management and monitoring solution for your HDP cluster. Using the Ambari Web UI and REST APIs, you can deploy, operate, manage configuration changes, and monitor services for all nodes in your cluster from a central point.
Ambari Upgrade Guide Hortonworks Covers how Ambari and the HDP Stack being managed by Ambari can be upgraded independently, getting ready to upgrade Ambari and HDP, upgrading Ambari, and upgrading HDP.
Installing Hive Apache wiki Describes how to install Apache Hive separate from the HDP environment.
Configuring Hive Apache wiki Describes how to configure Apache Hive separate from the HDP environment and troubleshoot Hive in HDP.
Administering Setting Up the Metastore Apache wiki Describes the metastore parameters.
Setting Up Hive Server Apache wiki Describes how to set up the server. How to use a client with this server is described in the HiveServer2 Clients document.
Developing Materialized Views Apache wiki Covers accelerating query processing in data warehouses by pre-computing summaries using materialized views.
Hive transactions Apache wiki Describes ACID operations in Hive.
Hive Streaming API Apache wiki Explains how to use an API for pumping data continuously into Hive using clients such as NiFi and Flume.
Hive Operators and Functions Apache wiki Describes the Language Manual UDF.
Beeline: HiveServer2 Client Apache wiki Describes how to use the Beeline client.
Interactive Queries with Apache Hive LLAP

Setting up Hive LLAP

Hive LLAP on Your Cluster

Hortonworks Apache Hive enables interactive and sub-second SQL through low-latency analytical processing (LLAP), which makes Hive faster by using persistent query infrastructure and optimized data caching.
Hive-Spark Integration

Integrating Apache Hive with Apache Spark - Hive Warehouse Connector

Hortonworks Community Connection

Describes how to read and write data between Spark and Hive.

Hive-HBase Integration

HBaseIntegration wiki

Apache wiki

Describes how to integrate the two data access components so that Hive statements can access HBase tables for both read (SELECT) and write (INSERT) operations.

Reference SQL Language Manual Apache wiki Language reference documentation available in the Apache wiki.
Contributing

Hive Developer FAQ

Apache wiki Resources available if you want to contribute to the Apache community.
How to Contribute Apache wiki
Hive Developer Guide Apache wiki
Plug-in Developer Kit Apache wiki
Unit Test Parallel Execution Apache wiki
Hive Architecture Overview Apache wiki
Hive Design Docs Apache wiki

Project Bylaws

Apache wiki
Other resources

Hive Mailing Lists

Apache wiki Additional resources available.
Hive on Amazon Web Services Apache wiki
Hive on Amazon Elastic MapReduce Apache wiki