Data Access
Also available as:
PDF
loading table of contents...

Content Roadmap

This roadmap provides links to the available content resources for Apache Hive.

Table 2.1. Hive Content roadmap

TaskResourcesSourceDescription
Understanding

Presentations and Papers about Hive

Apache wikiContains meeting notes, presentations, and whitepapers from the Apache community.
Getting Started

Hive Tutorial

Apache wikiProvides a basic overview of Apache Hive and contains some examples on working with tables, loading data, and querying and inserting data.
Hive Tutorial HortonworksUses the Ambari HDFS file view to store massive data files of baseball statistics. Implements implement Hive queries to analyze, process and filter that data.
Installing and Upgrading

Ambari Automated Install Guide

HortonworksAmbari provides an end-to-end management and monitoring solution for your HDP cluster. Using the Ambari Web UI and REST APIs, you can deploy, operate, manage configuration changes, and monitor services for all nodes in your cluster from a central point.
Non-Ambari Cluster Installation Guide Hortonworks

Describes the information and materials you need to get ready to install the Hortonworks Data Platform (HDP) manually.

Ambari Upgrade GuideHortonworksAmbari and the HDP Stack being managed by Ambari can be upgraded independently. This guide provides information on: Getting ready to upgrade Ambari and HDP, Upgrading Ambari, and Upgrading HDP.
Non-Ambari Cluster Upgrade Guide HortonworksThese instructions cover the upgrade between two minor releases. If you need to upgrade between two maintenance releases, follow the upgrade instructions in the HDP Release Notes.
Installing Hive Apache wikiDescribes how to install Apache Hive separate from the HDP environment.
Configuring Hive Apache wikiDescribes how to configure Apache Hive separate from the HDP environment. Also useful for troubleshooting Hive in HDP.
Administering

Creating your Hive View instance

HortonworksDescribes how to set up and create the Hive 1.0 or Hive 1.5 instance in Ambari.
Setting Up the Metastore Apache wikiDescribes the metastore parameters.
Setting Up Hive Web Interface Apache wikiDescribes the Hive Web Interface, an alternative to using the Hive CLI, its features, configuration and some tips and tricks for using.
Setting Up Hive ServerApache wikiDescribes how to set up the server. How to use a client with this server is described in the HiveServer2 Clients document.
Developing

Using the Hive View

HortonworksShows how to use the Hive view to browse databases, write and execute queries, and manage jobs and history.
Moving data into Hive HortonworksShows the multiple ways to move data into Hive.
Hive Operators and Functions Apache wikiDescribes the Language Manual UDF.
Beeline: HiveServer2 Client Apache wikiDescribes how to use the Beeline client.
Security

Hadoop Security Guide

HortonworksProvides details of the security features implemented in the Hortonworks Data Platform (HDP).
Scheduling WorkflowUsing HDP for Workflow and Scheduling with OozieHortonworksOozie is a server-based workflow engine specialized in running workflow jobs with actions that execute Hadoop jobs, such as MapReduce, Pig, Hive, Sqoop, HDFS operations, and sub-workflows.
High Availability

High Availability for Hive Metastore

HortonworksProvides details for system administrators who need to configure the Hive Metastore service for High Availability.
Performance Tuning

Hive Performance Tuning Guide

HortonworksProvides advanced users architectural overviews and best practices to fine tune their performance.
Interactive Query with Apache Hive LLAP HortonworksApache Hive enables interactive and sub-second SQL through LLAP, a new component introduced in Hive 2.0 that makes Hive faster by using persistent query infrastructure and optimized data caching.
Troubleshooting Hive Queries Running on Tez

How to Analyze or Debug Hive Queries

Hortonworks Community ConnectionA short how-to article and links to demonstration videos about how you can use Tez View to monitor and troubleshoot Hive queries running on Tez.
Using the Tez View in the Ambari Views GuideHortonworksDocuments how you can use the Ambari Tez View GUI to analyze Hive query execution.
Hive-HBase Integration

HBaseIntegration wiki

Apache wiki

Describes how to integrate the two data access components so that HiveQL statements can access HBase tables for both read (SELECT) and write (INSERT) operations.

Reference

Javadocs

Apache wikiLanguage reference documentation available in the Apache wiki.
SQL Language Manual Apache wiki
Contributing

Hive Developer FAQ

Apache wikiResources available if you want to contribute to the Apache community.
How to Contribute Apache wiki
Hive Developer Guide Apache wiki
Plug-in Developer Kit Apache wiki
Unit Test Parallel ExecutionApache wiki
Hive Architecture OverviewApache wiki
Hive Design Docs Apache wiki
Full-Text Search over All Hive Resources Apache wiki

Project Bylaws

Apache wiki
Other resources

Hive Mailing Lists

Apache wikiAdditional resources available.
Hive on Amazon Web ServicesApache wiki
Hive on Amazon Elastic MapReduce Apache wiki