5. Configure Hive on HDFS

To use Hive, in HDFS create the Hive warehouse directory, the Hive and WebHcat user directories directory, and the WebHCat application folder. And then set permissions on the directory to allow all users access:

  1. Open the command prompt with the Hadoop user account:

    runas /user:hadoop cmd
  2. Make a user directory for hive and the hive warehouse directory as follows:

    %HADOOP_HOME%\bin\hadoop.cmd dfs -mkdir -p /user/hive /hive/warehouse
  3. Make a user and application directory for WebHcat as follows:

    %HADOOP_HOME%\bin\hadoop.cmd dfs -mkdir -p /user/hcat   
    %HADOOP_HOME%\bin\hadoop.cmd dfs -mkdir -p /apps/webhcat   
  4. Change the owner and permissions as follows:

    %HADOOP_HOME%\bin\hadoop.cmd dfs -chown hadoop:hadoop /user/hive
    %HADOOP_HOME%\bin\hadoop.cmd dfs -chmod -R 755 /user/hive
    %HADOOP_HOME%\bin\hadoop.cmd dfs -chown -R hadoop:users  /hive/warehouse
    %HADOOP_HOME%\bin\hadoop.cmd dfs -chown -R hadoop:hadoop /user/hcat   
    %HADOOP_HOME%\bin\hadoop.cmd dfs -chmod -R 777 /hive/warehouse
    %HADOOP_HOME%\bin\hadoop.cmd dfs -chown -R hadoop:users /apps/webhcat 
    %HADOOP_HOME%\bin\hadoop.cmd dfs -chmod -R 755 /apps/webhcat


loading table of contents...