Configuring Cloud Data Access
Also available as:
PDF

Configure ADLS storage locations

After configuring access to ADLS, you can optionally use that ADLS storage account as a base storage location; this storage location is mainly for the Hive Warehouse Directory (used for storing the table data for managed tables).

Steps

  1. When creating a cluster, on the Cloud Storage page in the advanced cluster wizard view, select Use existing ADLS storage and enter information related to your ADLS account, as described in the instructions for configuring access to ADLS.
  2. Under Storage Locations, enable Configure Storage Locations by clicking the button.
  3. Provide your existing directory name under Base Storage Location.
    Note
    Note

    Make sure that the bucket already exists within the account.

  4. Under Path for Hive Warehouse Directory property (hive.metastore.warehouse.dir), Cloudbreak automatically suggests a location within the directory. For example, if the directory that you specified is my-test-dir then the suggested location will be my-test-adls-account.azuredatalakestore.net/my-test-dir/apps/hive/warehouse. You may optionally update this path or select Do not configure.
    Note
    Note

    This directory structure will be created in your specified container upon the first activity in Hive.