Hive JDBC and ODBC Drivers
Hortonworks provides Hive JDBC and ODBC drivers that let you connect to popular Business Intelligence (BI) tools to query, analyze and visualize data stored within the Hortonworks Data Platform.
For information about using the Hive ODBC drivers and to download a driver, on the Hortonworks Downloads page, click Addons.
Current Hive JDBC client jars can be found on one of the edge nodes in your cluster at
/usr/hdp/current/hive-client/lib/hive-jdbc.jar
after you have installed HDP,
or you can download them from Hive JDBC driver archive.
JDBC URLs have the following format:
jdbc:hive2://<host>:<port>/<dbName>;<sessionConfs>?<hiveConfs>#<hiveVars>
JDBC Parameter | Description |
---|---|
|
The cluster node hosting HiveServer2. |
|
The port number to which HiveServer2 listens. |
|
The name of the Hive database to run the query against. |
|
Optional configuration parameters for the JDBC/ODBC driver in the following
format: |
|
Optional configuration parameters for Hive on the server in the following
format: The configurations last for the duration of the user session. |
|
Optional configuration parameters for Hive variables in the following format:
The configurations last for the duration of the user session. |
The specific JDBC connection URL for a HiveServer2 client depends on several factors:
How is HiveServer2 deployed on the cluster?
What type of transport does HiveServer2 use?
Does HiveServer2 use transport-layer security?
Is HiveServer2 configured to authenticate users?
The rest of this section describes how to use session configuration variables to format the JDBC connection URLs for all these scenarios.
Note | |
---|---|
Some HiveServer2 clients may need to run on a host outside of the Hadoop cluster.
These clients require access to the following .jar files to successfully use the Hive JDBC
driver in both HTTP and HTTPS modes:
|
Embedded and Remote Modes
In embedded mode, HiveServer2 runs within the Hive client rather than in a separate process. No host or port number is necessary for the JDBC connection. In remote mode, HiveServer2 runs as a separate daemon on a specified host and port, and the JDBC client and HiveServer2 interact using remote procedure calls with the Thrift protocol.
Embedded Mode
jdbc:hive2://
Remote Mode
jdbc:hive2://<host>:<port>/<dbName>;<sessionConfs>?<hiveConfs>#<hiveVars>
Note | |
---|---|
The rest of the example JDBC connection URLs in this topic are valid only for HiveServer2 configured in remote mode. |
TCP and HTTP Transport
The JDBC client and HiveServer2 can use either HTTP or TCP-based transport to exchange
RPC messages. Specify the transport used by HiveServer2 with the
transportMode
and httpPath
session configuration
variables. The default transport is TCP.
transportMode Variable Value | Description |
---|---|
|
Connect to HiveServer2 using HTTP transport. |
|
Connect to HiveServer2 using TCP transport. |
HTTP Transport
jdbc:hive2://<host>:<port>/<dbName>;transportMode=http;httpPath=<http_endpoint>;<otherSessionConfs>?<hiveConfs>#<hiveVars>
Note | |
---|---|
The JDBC driver assumes a value of |
TCP Transport
jdbc:hive2://<host>:<port>/<dbName>;<otherSessionConfs>?<hiveConfs>#<hiveVars>
Because the default transport is TCP, there is no need to specify
transportMode=binary
if TCP transport is desired.
User Authentication
HiveServer2 supports Kerberos, LDAP, Pluggable Authentication Modules (PAM), and custom plugins for authenticating the JDBC user connecting to HiveServer2. The format of the JDBC connection URL for authentication with Kerberos differs from the format for other authentication models.
User Authentication Variable | Description |
---|---|
|
A string that uniquely identifies a Kerberos user. |
|
Quality of protection for the SASL framework. The level of quality is negotiated between the client and server during authentication. Used by Kerberos authentication with TCP transport. |
|
Username for non-Kerberos authentication model. |
|
Password for non-Kerberos authentication model. |
Kerberos Authentication
jdbc:hive2://<host>:<port>/<dbName>;principal=<HiveServer2_kerberos_principal>;<otherSessionConfs>?<hiveConfs>#<hiveVars>
Kerberos Authentication with Sasl QOP
jdbc:hive2://<host>:<port>/<dbName>;principal=<HiveServer2_kerberos_principal>;saslQop=<qop_value>;<otherSessionConfs>?<hiveConfs>#<hiveVars>
Non-Kerberos Authentication
jdbc:hive2://<host>:<port>/<dbName>;user=<username>;password=<password>;<otherSessionConfs>?<hiveConfs>#<hiveVars>
Transport Layer Security
HiveServer2 supports SSL and Sasl QOP for transport-layer security. The format of the JDBC connection URL for SSL differs from the format used by Sasl QOP.
SSL Variable | Description |
---|---|
|
Specifies whether to use SSL |
|
The path to the SSL TrustStore. |
|
The password to the SSL TrustStore. |
jdbc:hive2://<host>:<port>/<dbName>;ssl=true;sslTrustStore=<ssl_truststore_path>;trustStorePassword=<truststore_password>;<otherSessionConfs>?<hiveConfs>#<hiveVars>
When using TCP for transport and Kerberos for security, HiveServer2 uses Sasl QOP for encryption rather than SSL.
Sasl QOP Variable | Description |
---|---|
|
A string that uniquely identifies a Kerberos user. |
|
The level of protection desired. For authentication, checksum, and encryption,
specify |
jdbc:hive2://<host>:<port>/<dbName>;principal=<HiveServer2_kerberos_principal>;saslQop=auth-conf;<otherSessionConfs>?<hiveConfs>#<hiveVars>