Data Access
Also available as:
PDF
loading table of contents...

Content Roadmap

This roadmap provides links to the available content resources for Apache Hive.

Table 2.1. Hive Content roadmap

TaskResourcesSourceDescription
Understanding

Presentations and Papers about Hive

Apache wikiContains meeting notes, presentations, and whitepapers from the Apache community.
Getting Started

Hive Tutorial

Apache wikiProvides a basic overview of Apache Hive and contains some examples on working with tables, loading data, and querying and inserting data.
Hive Tutorial HortonworksUses the Ambari HDFS file view to store massive data files of statistics. Implements implement Hive queries to analyze, process, and filter that data.
Installing and Upgrading

Ambari Automated Install Guide

HortonworksAmbari provides an end-to-end management and monitoring solution for your HDP cluster. Using the Ambari Web UI and REST APIs, you can deploy, operate, manage configuration changes, and monitor services for all nodes in your cluster from a central point.
Non-Ambari Cluster Installation Guide Hortonworks

Describes the information and materials you need to get ready to install the Hortonworks Data Platform (HDP) manually.

Ambari Upgrade GuideHortonworksAmbari and the HDP Stack being managed by Ambari can be upgraded independently. This guide provides information on: Getting ready to upgrade Ambari and HDP, Upgrading Ambari, and Upgrading HDP.
Non-Ambari Cluster Upgrade Guide HortonworksThese instructions cover the upgrade between two minor releases. If you need to upgrade between two maintenance releases, follow the upgrade instructions in the HDP Release Notes.
Installing Hive Apache wikiDescribes how to install Apache Hive separate from the HDP environment.
Configuring Hive Apache wikiDescribes how to configure Apache Hive separate from the HDP environment. Also useful for troubleshooting Hive in HDP.
Administering

Creating your Hive View instance

HortonworksDescribes how to set up and create the Hive 1.0 or Hive 1.5 instance in Ambari.
Setting Up the Metastore Apache wikiDescribes the metastore parameters.
Setting Up Hive Web Interface Apache wikiDescribes the Hive Web Interface, an alternative to using the Hive CLI, its features, configuration and some tips and tricks for using.
Setting Up Hive ServerApache wikiDescribes how to set up the server. How to use a client with this server is described in the HiveServer2 Clients document.
Developing

Using Hive View 2.0

HortonworksShows how to use the Hive view to browse databases, write and execute queries, and manage jobs and history.
Moving data into Hive HortonworksShows the multiple ways to move data into Hive.
Hive Operators and Functions Apache wikiDescribes the Language Manual UDF.
Beeline: HiveServer2 Client Apache wikiDescribes how to use the Beeline client.
Security

Hadoop Security Guide

HortonworksProvides details of the security features implemented in the Hortonworks Data Platform (HDP).
Scheduling WorkflowUsing HDP for Workflow and Scheduling with OozieHortonworksOozie is a server-based workflow engine specialized in running workflow jobs with actions that execute Hadoop jobs, such as MapReduce, Pig, Hive, Sqoop, HDFS operations, and sub-workflows.
High Availability

High Availability for Hive Metastore

HortonworksProvides details for system administrators who need to configure the Hive Metastore service for High Availability.
Performance Tuning

Hive Performance Tuning Guide

HortonworksLists best practices for an HDP Hive cluster, both for users who run interactive queries and for users who analyze a Hive EDW with batch processing.
Interactive Queries with Apache Hive LLAP

Setting up Hive LLAP

Hive LLAP on Your Cluster

YouTube video: Enable Hive LLAP on HDP 2.6 for Interactive SQL

HortonworksApache Hive enables interactive and sub-second SQL through low-latency analytical processing (LLAP)., a new component introduced in Hive 2.0 that makes Hive faster by using persistent query infrastructure and optimized data caching.
Hive-HBase Integration

HBaseIntegration wiki

Apache wiki

Describes how to integrate the two data access components so that HiveQL statements can access HBase tables for both read (SELECT) and write (INSERT) operations.

Reference

Javadocs

Apache wikiLanguage reference documentation available in the Apache wiki.
SQL Language Manual Apache wiki
Contributing

Hive Developer FAQ

Apache wikiResources available if you want to contribute to the Apache community.
How to Contribute Apache wiki
Hive Developer Guide Apache wiki
Plug-in Developer Kit Apache wiki
Unit Test Parallel ExecutionApache wiki
Hive Architecture OverviewApache wiki
Hive Design Docs Apache wiki
Full-Text Search over All Hive Resources Apache wiki

Project Bylaws

Apache wiki
Other resources

Hive Mailing Lists

Apache wikiAdditional resources available.
Hive on Amazon Web ServicesApache wiki
Hive on Amazon Elastic MapReduce Apache wiki