Step 6: Install CDH and Other Software

Step 1: Configure a Repository Step 2: Install Java Step 3: Install Cloudera Manager Server Step 4: Install Databases Step 5: Set Up the Cloudera Manager Database Step 6: Install CDH and Other Software Step 7: Set Up a Cluster

After setting up the Cloudera Manager database, start Cloudera Manager Server, and log in to the Cloudera Manager Admin Console:

  1. Start Cloudera Manager Server:
    • RHEL 7 compatible, Ubuntu, SLES:
      sudo systemctl start cloudera-scm-server
    • RHEL 6 compatible:
      sudo service cloudera-scm-server start
  2. Wait several minutes for the Cloudera Manager Server to start. To observe the startup process, run the following on the Cloudera Manager Server host:
    sudo tail -f /var/log/cloudera-scm-server/cloudera-scm-server.log
    When you see this log entry, the Cloudera Manager Admin Console is ready:
    INFO WebServerImpl:com.cloudera.server.cmf.WebServerImpl: Started Jetty server.

    If the Cloudera Manager Server does not start, see Troubleshooting Installation Problems.

  3. In a web browser, go to http://<server_host>:7180, where <server_host> is the FQDN or IP address of the host where the Cloudera Manager Server is running.
  4. Log into Cloudera Manager Admin Console. The default credentials are:

    Username: admin

    Password: admin

  5. After you log in, the End User License Terms and Conditions page displays. Read the terms and conditions and then check the box labeled Yes, I accept the End User License Terms and Conditions to accept them. Click Continue, and the installation wizard launches.

Welcome

The Welcome page provides a brief overview of Cloudera Manager, and links to the release notes for the version you are installing. Click Continue to proceed with the installation.

Accept License

The Accept License page provides the End User License Terms and Conditions. Read the license agreement and click the checkbox labeled Yes, I accept the End User License Terms and Conditions if you accept the terms and conditions of the license agreement.

Select Edition

On the Select Edition page, you can select the edition of Cloudera Manager to install and, optionally, install a license:

  1. Choose which edition to install:
    • Cloudera Express, which does not require a license, but provides a limited set of features.
    • Cloudera Enterprise Cloudera Enterprise Trial, which does not require a license, but expires after 60 days and cannot be renewed.
    • Cloudera Enterprise with one of the following license types:
      • Essentials Edition
      • Data Science and Engineering Edition
      • Operational Database Edition
      • Data Warehouse Edition
      • Enterprise Data Hub Edition
    If you choose Cloudera Express or Cloudera Enterprise Cloudera Enterprise Trial, you can upgrade the license at a later time. See Managing Licenses.
  2. If you select Cloudera Enterprise, install a license:
    1. Click the Select License File field.
    2. Browse to the location of your license file, click the file, and click Open.
    3. Click Upload.
  3. Information is displayed indicating what the CDH installation includes. At this point, you can click the Support drop-down menu to access online Help or the Support Portal.
  4. Click Continue to proceed with the installation.

Welcome (Add Cluster - Installation)

The Welcome page of the Add Cluster - Installation wizard provides a brief overview of the installation and configuration procedure, as well as some links to relevant documentation. Click Continue to proceed with the installation.

Setup Auto-TLS

The Setup Auto-TLS page provides instructions for initializing the certificate manager for auto-TLS if you have not done so already. If you already initialized the certificate manager in Step 3: Install Cloudera Manager Server, the wizard displays a message indicating that auto-TLS has been initialized. Click Continue to proceed with the installation.

If you have not already initialized the certificate manager, and you want to enable auto-TLS, follow the instructions provided on the page before continuing. For more information, see Configuring TLS Encryption for Cloudera Manager and CDH Using Auto-TLS.

Specify Hosts

Choose which hosts will run CDH and other managed services

  1. To enable Cloudera Manager to automatically discover hosts on which to install CDH and managed services, enter the cluster hostnames or IP addresses. You can also specify hostname and IP address ranges. For example:
    Range Definition Matching Hosts
    10.1.1.[1-4] 10.1.1.1, 10.1.1.2, 10.1.1.3, 10.1.1.4
    host[1-3].example.com host1.example.com, host2.example.com, host3.example.com
    host[07-10].company.com host07.example.com, host08.example.com, host09.example.com, host10.example.com

    You can specify multiple addresses and address ranges by separating them with commas, semicolons, tabs, or blank spaces, or by placing them on separate lines. Use this technique to make more specific searches instead of searching overly wide ranges. Only scans that reach hosts running SSH will be selected for inclusion in your cluster by default. You can enter an address range that spans over unused addresses and then clear the nonexistent hosts later in the procedure, but wider ranges require more time to scan.

  2. Click Search. If there are a large number of hosts on your cluster, wait a few moments to allow them to be discovered and shown in the wizard. If the search is taking too long, you can stop the scan by clicking Abort Scan. You can modify the search pattern and repeat the search as many times as you need until you see all of the expected hosts.
  3. Verify that the number of hosts shown matches the number of hosts where you want to install services. Clear host entries that do not exist or where you do not want to install services.
  4. Click Continue.

    The Select Repository screen displays.

Select Repository

The Select Repository page allows you to specify repositories for Cloudera Manager Agent and CDH and other software.

In the Cloudera Manager Agent section:

  1. Select either Public Cloudera Repository or Custom Repository for the Cloudera Manager Agent software.
  2. If you select Custom Repository, do not include the operating system-specific paths in the URL. For instructions on setting up a custom repository, see Configuring a Local Package Repository.

In the CDH and other software section:

  1. Select the repository type to use for the installation. In the Install Method section select one of the following:
    • Use Parcels (Recommended)

      A parcel is a binary distribution format containing the program files, along with additional metadata used by Cloudera Manager. Parcels are required for rolling upgrades. For more information, see Parcels.

    • Use Packages

      A package is a standard binary distribution format that contains compiled code and meta-information such as a package description, version, and dependencies. Packages are installed using your operating system package manager.

  2. Select the version of CDH to install.
    1. If you selected Use Parcels and you do not see the version you want to install, click the More Options button to add the repository URL for your version. Repository URLs for CDH 6 version are documented in CDH 6 Download Information. After adding the repository, click Save Changes and wait a few seconds for the version to appear. If your Cloudera Manager host uses an HTTP proxy, click the Proxy Settings button to configure your proxy.
    2. If you selected Use Packages, and the version you want to install is not listed, you can select Custom Repository to specify a repository that contains the desired version. Repository URLs for CDH 6 version are documented in CDH 6 Download Information.
  3. If you selected Use Parcels, specify any Additional Parcels you want to install. If you are installing CDH 6, do not select the KAFKA, KUDU, or SPARK parcels, because they are included in CDH 6.
  4. Click Continue.

The Accept JDK License page displays.

Accept JDK License

To allow Cloudera Manager to automatically install the Oracle JDK on cluster hosts, read the JDK license and check the box labeled Install Oracle Java SE Development Kit (JDK) if you accept the terms. If you installed your own Oracle JDK version in Step 2: Install Java Development Kit, leave the box unchecked.

If you allow Cloudera Manager to install the JDK, a second checkbox appears, labeled Install Java Unlimited Strength Encryption Policy Files. These policy files are required to enable AES-256 encryption in JDK versions lower than 1.8u161. JDK 1.8u161 and higher enable unlimited strength encryption by default, and do not require policy files.

After reading the license terms and checking the applicable boxes, click Continue.

Enter Login Credentials

  1. Select root for the root account, or select Another user and enter the username for an account that has password-less sudo privileges.
  2. Select an authentication method:
    • If you choose password authentication, enter and confirm the password.
    • If you choose public-key authentication, provide a passphrase and path to the required key files.

    You can modify the default SSH port if necessary.

  3. Specify the maximum number of host installations to run at once. The default and recommended value is 10. You can adjust this based on your network capacity.
  4. Click Continue.

The Install Agents page displays.

Install Agents

The Install Agents page displays the progress of the installation. You can click on the Details link for any host to view the installation log. If the installation is stalled, you can click the Abort Installation button to cancel the installation and then view the installation logs to troubleshoot the problem.

If the installation fails on any hosts, you can click the Retry Failed Hosts to retry all failed hosts, or you can click the Retry link on a specific host.

If you selected the option to manually install agents, see Manually Install Cloudera Manager Agent Packages for the procedure and then continue with the next steps on this page.

After installing the Cloudera Manager Agent on all hosts, click Continue.

If you are using parcels, the Install Parcels page displays. If you chose to install using packages, the Inspect Hosts page displays.

Install Parcels

If you selected parcels for the installation method, the Install Parcels page reports the installation progress of the parcels you selected earlier. After the parcels are downloaded, progress bars appear representing each cluster host. You can click on an individual progress bar for details about that host.

After the installation is complete, click Continue.

The Inspect Hosts page displays.

Inspect Hosts

The Inspect Hosts page runs the Host Inspector to search for common configuration problems. View the results and address any identified problems. Click the Run Again button to update the results after making any changes.

After addressing identified problems, click Finish.

This completes the Cluster Installation wizard and launches the Cluster Setup wizard.

Continue to Step 7: Set Up a Cluster Using the Wizard.