Deployment Guide for Azure IaaS
Also available as:
PDF

Gathering Your Deployment Information

Deployment of the HDP Standard offering requires that you provide information about the type of cluster you want to configure.

Before you deploy the HDP Standard offering, be sure to gather the cluster layout information specified in the following table:

Table 1.3. Cluster Layout Information for Your Hortonworks Data Platform Standard

InformationDescription
Cluster name

Name of the HDP cluster you want to create.

The cluster name must be between 3 and 24 characters, and can contain only numbers and lower case letters.

Once created, you can search the Azure Portal dashboard for this cluster name.

Cluster admin Username User name for the virtual machine user login and the Ambari administrator.
Authentication Type

Type of authentication to use.

The following authentication types are supported:

  • Password authentication.

    Authentication password must be between six and 72 characters, and have three of the following requirements:

    • One lower case character

    • One upper case character

    • One number

    • One special character

  • OpenSSH public key.

    Generate an OpenSSH public key with ssh-keygen or PuttyGen, depending on your OS.

Ambari PasswordPassword to authenticate the Ambari administrator. This password can be the same as the authentication password, if you have selected password authentication. The same length and character requirements apply.
Do you want to create a High Availability (HA) cluster? High Availability (HA) clusters run the storage and processing services in redundant mode, enabling your tasks to complete, even if a service they depend on has a failure. HA is recommended for production clusters. HA clusters are deployed with five master nodes, and non-HA clusters are deployed with three master nodes.
Master Node Type

In a Hadoop cluster, a master node oversees storage, processing, and management services. You can select from the following, which all have size, strength, speed, and price implications:

  • A-Series - A4, A7, A10

  • D-Series - DS4, DS13, DS14, D13_V2, D14_V2, D4_V2, D4, D13

  • G-Series - G4, G5, GS3, GS4, GS5

For more information regarding machine pricing on Azure, see Virtual Machines Pricing.

Number of Worker Nodes

In a Hadoop cluster, worker nodes make up the majority of virtual machines and perform data storage and processing jobs.

When you are creating an HDP cluster in Azure, you can select either three or five worker nodes.

Worker Node Type

The choices of machines here are the same as the Master Node Type section.