Cloud Data Access
Also available as:
PDF
loading table of contents...

Chapter 5. Getting Started with WASB

Windows Azure Storage Blob (WASB) is a general-purpose object store.

The features of WASB include:

  • Object store with flat namespace.

  • Storage account consists of containers, which in turn have data in the form of blobs.

  • Authentication based on shared secrets - Account Access Keys (for account-level authorization) and Shared Access Signature Keys (for account, container, or blob authorization).

Overview of Configuring and Using WASB with HDP

The following table provides an overview of tasks related to configuring and using HDP with WASB. Click on the linked topics to get more information about specific tasks.

TaskDescription
Meet the prerequisites

To use Azure Blob Storage, you must have:

  1. An Azure subscription for Storage Blobs.

  2. A storage account. For instructions on how to create one, refer to Microsoft Azure documentation.

Configure authentication

In order to access data in an WASB account, you must configure authentication with the WASB account by providing the access key.

Configure optional features:

You can optionally configure these additional features.

Work with WASB data:

Once you've configured authentication with your blob storage account, you can access data stored in this account from Hive (via external tables) and Spark, and perform other related tasks such as copying data between HDFS and WASB when needed.