Submit a Hive Warehouse Connector Python app

You can submit a Python app based on the HiveWarehouseConnector library by submitting a Scala or Java application, and then adding a Python package.

  1. Locate the hive-warehouse-connector-assembly jar in the /hive_warehouse_connector/ directory.
  2. Add the connector jar to the app submission using the --jars option.
    spark-shell --jars <path to jars>/hive_warehouse_connector/hive-warehouse-connector-assembly-<version>.jar
  3. Locate the pyspark_hwc zip package in the /hive_warehouse_connector/ directory.
  4. Add the Python package to app submission:
    spark-shell --jars <path>/hive_warehouse_connector/hive-warehouse-connector-assembly-1.0.0.jar
  5. Add the Python package for the connector to the app submission.
    pyspark --jars <path to jars>/hive_warehouse_connector/hive-warehouse-connector-assembly-<version>.jar --py-files /usr/hdp/current/hive_warehouse_connector/pyspark_hwc-<version>.zip