This is the documentation for CDH 5.0.x. Documentation for other versions is available at Cloudera Documentation.

Installing and Upgrading the HCatalog RPM or Debian Packages

Installing the HCatalog RPM or Debian packages is more convenient than installing the HCatalog tarball because the packages:

  • Handle dependencies
  • Provide for easy upgrades
  • Automatically install resources to conventional locations

HCatalog comprises the following packages:

  • hive-hcatalog - HCatalog wrapper for accessing the Hive metastore, libraries for MapReduce and Pig, and a command-line program
  • hive-webhcat - A REST API server for HCatalog
  • hive-webhcat-server - Installs hive-webhcat and a server init script
  Important:

If you have not already done so, install Cloudera's yum, zypper/YaST or apt repository before using the following commands to install HCatalog. For instructions, see CDH 5 Installation.

Upgrading HCatalog from CDH 4 to CDH 5

To upgrade HCatalog from CDH 4 to CDH 5, proceed as follows.

  Note:

If you have already performed the steps to uninstall CDH 4 and all components, as described under Upgrading to CDH 5 from CDH 4, you can skip Step 1 below and proceed with installing the new CDH 5 version of HCatalog.

Step 1: Remove the CDH 4 version of HCatalog

To remove HCatalog on a Red Hat-compatible system:

$ sudo yum remove webhcat-server hcatalog 

To remove HCatalog on an Ubuntu or other Debian system:

$ sudo apt-get remove webhcat-server hcatalog 

To remove HCatalog on a SLES system:

$ sudo zypper remove webhcat-server hcatalog

Step 2: Install the new version of WebHCat and HCatalog

Follow instructions under Installing the WebHCat REST Server and Installing HCatalog for Use with Pig and MapReduce.

  Important: Configuration files
  • If you install a newer version of a package that is already on the system, configuration files that you have modified will remain intact.
  • If you uninstall a package, the package manager renames any configuration files you have modified from <file> to <file>.rpmsave. If you then re-install the package (probably to install a new version) the package manager creates a new <file> with applicable defaults. You are responsible for applying any changes captured in the original configuration file to the new configuration file. In the case of Ubuntu and Debian upgrades, you will be prompted if you have made changes to a file for which there is a new version; for details, see Automatic handling of configuration files by dpkg.

The upgrade is now complete.

Upgrading HCatalog from an Earlier CDH 5 Release

Follow instructions under Installing the WebHCat REST Server and Installing HCatalog for Use with Pig and MapReduce.

  Important: Configuration files
  • If you install a newer version of a package that is already on the system, configuration files that you have modified will remain intact.
  • If you uninstall a package, the package manager renames any configuration files you have modified from <file> to <file>.rpmsave. If you then re-install the package (probably to install a new version) the package manager creates a new <file> with applicable defaults. You are responsible for applying any changes captured in the original configuration file to the new configuration file. In the case of Ubuntu and Debian upgrades, you will be prompted if you have made changes to a file for which there is a new version; for details, see Automatic handling of configuration files by dpkg.

The upgrade is now complete.

Installing the WebHCat REST Server

  Note:

It is not necessary to install WebHCat if you will not be using the REST API. Pig and MapReduce do not need it.

To install the WebHCat REST server on a Red Hat system:

$ sudo yum install hive-webhcat-server

To install the WebHCat REST server components on an Ubuntu or other Debian system:

$ sudo apt-get install hive-webhcat-server

To install the WebHCat REST server components on a SLES system:

$ sudo zypper install hive-webhcat-server
  Note:
  • You can change the default port 50111 by creating or editing the following file and restarting WebHCat:
    /etc/webhcat/conf/webhcat-site.xml
    The property to change is:
    <configuration>
      <property>
        <name>templeton.port</name>
        <value>50111</value>
        <description>The HTTP port for the main server.</description>
      </property>
    </configuration>
  • To uninstall WebHCat you must remove two packages: hive-webhcat-server and hive-webhcat.

Installing HCatalog for Use with Pig and MapReduce

On hosts that will be used to launch Pig scripts or MapReduce applications using table information, install HCatalog as follows:

To install the HCatalog client components on a Red Hat system:

$ sudo yum install hive-hcatalog

To install the HCatalog client components on an Ubuntu or other Debian system:

$ sudo apt-get install hive-hcatalog

To install the HCatalog client components on a SLES system:

$ sudo zypper install hive-hcatalog
Page generated September 3, 2015.