Content Roadmap
This roadmap provides links to the available content resources for Apache Hive.
Table 2.1. Hive Content roadmap
Task | Resources | Source | Description |
---|---|---|---|
Understanding | Apache wiki | Contains meeting notes, presentations, and whitepapers from the Apache community. | |
Getting Started | Apache wiki | Provides a basic overview of Apache Hive and contains some examples on working with tables, loading data, and querying and inserting data. | |
Hive Tutorial | Hortonworks | Uses the Ambari HDFS file view to store massive data files of baseball statistics. Implements implement Hive queries to analyze, process and filter that data. | |
Installing and Upgrading | Hortonworks | Ambari provides an end-to-end management and monitoring solution for your HDP cluster. Using the Ambari Web UI and REST APIs, you can deploy, operate, manage configuration changes, and monitor services for all nodes in your cluster from a central point. | |
Non-Ambari Cluster Installation Guide | Hortonworks |
Describes the information and materials you need to get ready to install the Hortonworks Data Platform (HDP) manually. | |
Ambari Upgrade Guide | Hortonworks | Ambari and the HDP Stack being managed by Ambari can be upgraded independently. This guide provides information on: Getting ready to upgrade Ambari and HDP, Upgrading Ambari, and Upgrading HDP. | |
Non-Ambari Cluster Upgrade Guide | Hortonworks | These instructions cover the upgrade between two minor releases. If you need to upgrade between two maintenance releases, follow the upgrade instructions in the HDP Release Notes. | |
Installing Hive | Apache wiki | Describes how to install Apache Hive separate from the HDP environment. | |
Configuring Hive | Apache wiki | Describes how to configure Apache Hive separate from the HDP environment. Also useful for troubleshooting Hive in HDP. | |
Administering | Hortonworks | Describes how to set up and create the Hive 1.0 or Hive 1.5 instance in Ambari. | |
Setting Up the Metastore | Apache wiki | Describes the metastore parameters. | |
Setting Up Hive Web Interface | Apache wiki | Describes the Hive Web Interface, an alternative to using the Hive CLI, its features, configuration and some tips and tricks for using. | |
Setting Up Hive Server | Apache wiki | Describes how to set up the server. How to use a client with this server is described in the HiveServer2 Clients document. | |
Developing | Hortonworks | Shows how to use the Hive view to browse databases, write and execute queries, and manage jobs and history. | |
Moving data into Hive | Hortonworks | Shows the multiple ways to move data into Hive. | |
Hive Operators and Functions | Apache wiki | Describes the Language Manual UDF. | |
Beeline: HiveServer2 Client | Apache wiki | Describes how to use the Beeline client. | |
Security | Hortonworks | Provides details of the security features implemented in the Hortonworks Data Platform (HDP). | |
Scheduling Workflow | Using HDP for Workflow and Scheduling with Oozie | Hortonworks | Oozie is a server-based workflow engine specialized in running workflow jobs with actions that execute Hadoop jobs, such as MapReduce, Pig, Hive, Sqoop, HDFS operations, and sub-workflows. |
High Availability | Hortonworks | Provides details for system administrators who need to configure the Hive Metastore service for High Availability. | |
Performance Tuning | Hortonworks | Provides advanced users architectural overviews and best practices to fine tune their performance. | |
Interactive Query with Apache Hive LLAP | Hortonworks | Apache Hive enables interactive and sub-second SQL through LLAP, a new component introduced in Hive 2.0 that makes Hive faster by using persistent query infrastructure and optimized data caching. | |
Troubleshooting Hive Queries Running on Tez | Hortonworks Community Connection | A short how-to article and links to demonstration videos about how you can use Tez View to monitor and troubleshoot Hive queries running on Tez. | |
Using the Tez View in the Ambari Views Guide | Hortonworks | Documents how you can use the Ambari Tez View GUI to analyze Hive query execution. | |
Hive-HBase Integration |
Apache wiki |
Describes how to integrate the two data access components so that HiveQL statements can access HBase tables for both read (SELECT) and write (INSERT) operations. | |
Reference | Apache wiki | Language reference documentation available in the Apache wiki. | |
SQL Language Manual | Apache wiki | ||
Contributing | Apache wiki | Resources available if you want to contribute to the Apache community. | |
How to Contribute | Apache wiki | ||
Hive Developer Guide | Apache wiki | ||
Plug-in Developer Kit | Apache wiki | ||
Unit Test Parallel Execution | Apache wiki | ||
Hive Architecture Overview | Apache wiki | ||
Hive Design Docs | Apache wiki | ||
Full-Text Search over All Hive Resources | Apache wiki | ||
Apache wiki | |||
Other resources | Apache wiki | Additional resources available. | |
Hive on Amazon Web Services | Apache wiki | ||
Hive on Amazon Elastic MapReduce | Apache wiki |