Configuring Sqoop 2
This section explains how to configure the Sqoop 2 server.
Configuring which Hadoop Version to Use
The Sqoop 2 client does not interact directly with Hadoop MapReduce, and so it does not require any MapReduce configuration.
The Sqoop 2 server can work with either MRv1 or YARN. It cannot work with both simultaneously.
- To use
YARN:
alternatives --set sqoop2-tomcat-conf /etc/sqoop2/tomcat-conf.dist
- To use
MRv1:
alternatives --set sqoop2-tomcat-conf /etc/sqoop2/tomcat-conf.mr1
In earlier releases, the mechanism for setting the MapReduce version was the CATALINA_BASEvariable in the /etc/defaults/sqoop2-server file. This does not work as of CDH 5 Beta 2, and in fact could cause problems. Check your /etc/defaults/sqoop2-server file and make sure CATALINA_BASE is not set.
Installing the JDBC Drivers
Sqoop 2 does not ship with third party JDBC drivers. You must download them separately and save them to the /var/lib/sqoop2/ directory. The following sections show how to install the most common JDBC Drivers.
Installing the MySQL JDBC Driver
- Download the MySQL JDBC driver onto the Sqoop 2 server from http://www.mysql.com/downloads/connector/j/5.1.html. You will need to sign up for an account if you don't already have one, and log in, before you can download it.
- Copy the JAR file to the /var/lib/sqoop2/ directory on the Sqoop 2 server. For example:
$ sudo cp mysql-connector-java-version/mysql-connector-java-version-bin.jar /var/lib/sqoop2/
At the time of publication, version was 5.1.31, but the version may have changed by the time you read this.
Important: Make sure you have at least version 5.1.31. Some systems ship with an earlier version that may not work correctly with Sqoop.
- Restart the Sqoop 2 server to load the driver.
Installing the Oracle JDBC Driver
- Download the JDBC Driver from the Oracle website, for example http://www.oracle.com/technetwork/database/enterprise-edition/jdbc-112010-090769.html. You must accept the license agreement before you can download the driver. Download the ojdbc6.jar file.
- Copy the JAR file to /var/lib/sqoop2/ directory:
$ sudo cp ojdbc6.jar /var/lib/sqoop2/
- Restart the Sqoop 2 server to load the driver.
Installing the Microsoft SQL Server JDBC Driver
- Download the Microsoft SQL Server JDBC driver from http://www.microsoft.com/en-us/download/details.aspx?displaylang=en&id=11774
- Copy the JAR file to the /var/lib/sqoop2/ directory. For example:
$ curl -L 'http://download.microsoft.com/download/0/2/A/02AAE597-3865-456C-AE7F-613F99F850A8/sqljdbc_4.0.2206.100_enu.tar.gz' | tar xz $ sudo cp sqljdbc_4.0/enu/sqljdbc4.jar /var/lib/sqoop2/
- Restart the Sqoop 2 server to load the driver.
Installing the PostgreSQL JDBC Driver
- Download the PostgreSQL JDBC driver from http://jdbc.postgresql.org/download.html
- Copy the JAR file to the /var/lib/sqoop2/ directory. For example:
$ curl -L 'http://jdbc.postgresql.org/download/postgresql-9.2-1002.jdbc4.jar' -o postgresql-9.2-1002.jdbc4.jar $ sudo cp postgresql-9.2-1002.jdbc4.jar /var/lib/sqoop2/
- Restart the Sqoop 2 server to load the driver.
<< Installing Sqoop 2 | Starting, Stopping, and Accessing the Sqoop 2 Server >> | |