Set up the development environment

You can create a Hive UDF in a development environment using IntelliJ, for example, and build the UDF with Hive and Hadoop JARS that you download from your Cloudera cluster.

On your cluster, locate the hadoop-common-<version>.jar and hive-exec-<version>.jar.
For example:
```
ls /opt/cloudera/parcels/CDH-7.0.0-* |grep -v test
/opt/cloudera/parcels/CDH-7. . ..jar 
```
Download the JARs to your development computer to add to your IntelliJ project later.
Open IntelliJ and create a new Maven-based project. Click Create New Project. Select Maven and the supported Java version as the Project SDK. Click Next.
Add archetype information.
For example:
- GroupId: com.mycompany.hiveudf
- ArtifactId: hiveudf
Click Next and Finish.
The generated pom.xml appears in sample-hiveudf.

To the pom.xml, add properties to facilitate versioning.

For example:

<properties>
   <hadoop.version>TBD</hadoop.version>
   <hive.version>TBD</hive.version>
</properties>

In the pom.xml, define the repositories.
Use internal repositories if you do not have internet access.

Define dependencies.

For example:

<dependencies>
  <dependency>
       <groupId>org.apache.hive</groupId>
       <artifactId>hive-exec</artifactId>
       <version>${hive.version}</version>
    </dependency>
    <dependency>
       <groupId>org.apache.hadoop</groupId>
       <artifactId>hadoop-common</artifactId>
       <version>${hadoop.version}</version>
    </dependency>
</dependencies>

Select File > Project Structure. Click Modules. On the Dependencies tab, click + to add JARS or directories. Browse to and select the JARs you downloaded in step 1.

Set up the development environment

We want your opinion

How can we improve this page?