Apache Hive

Learn how to set up and configure Apache Hive ODBC connections, including driver installation, DSN creation, and metadata verification.

  1. Download and Install Hive ODBC Driver.
    1. Download the ODBC driver for Hive from the Hive ODBC Driver Download website.
    2. Ensure that you choose the correct driver version, either 32-bit or 64-bit, based on your environment and client tool.
  2. Configure the DSN (Data Source Name).
    1. Open the ODBC Data Source Administrator.
      1. Search for ODBC Data Source in the Start menu.
      2. Select either 32-bit or 64-bit, depending on the installed driver.
    2. Create a New System DSN.
      1. Provide a friendly name, for example Hive_ODBC.
      2. (Optional) Add details, for example Hive ODBC connection.
      3. Enter the hostname or IP address of the Hive service.
      4. Set the port. The default port is 10000 for Hive but you must confirm the value in your cluster setup.
      5. Specify a default database, for example default.
      6. Use the User Name and Password authentication method.
      7. If required, configure SSL settings for a secure connection.
  3. Verify the extracted metadata file. Access the Cloudera Octopai Target Folder (TGT) and troubleshoot issues as needed.
    1. Navigate to the TGT Folder on the server where the Cloudera Octopai Client is installed.
      The default location is C:\Program Files (x86)\Octopai\Service\TGT.
    2. Locate the ZIP file with the Hive Connector name.
      For example, Hive_Metadata_Export.zip.
    3. Verify the file content by checking the quantity and quality of the included files.

If an error occurred during the extraction, perform the following troubleshooting steps:

  1. Check permissions on the Hive server and ODBC connection.
  2. Verify the DSN configuration, including the correct hostname, port, and authentication.
  3. Send the log file with the connector name and number to Cloudera Support.

    You can find the log at C:\Program Files (x86)\Octopai\Service\log.