Sqoop 2 Installation

Sqoop 2 is a server-based tool designed to transfer data between Hadoop and relational databases. You can use Sqoop 2 to import data from a relational database management system (RDBMS), such as MySQL or Oracle, into the Hadoop Distributed File System (HDFS), transform the data with Hadoop MapReduce, and then export it back into an RDBMS.

Sqoop 2 has three packaging options for installation:

  • Tarball (.tgz) that contains both the Sqoop 2 server and the client.
  • Separate RPM packages for Sqoop 2 server (sqoop2-server) and client (sqoop2-client)
  • Separate Debian packages for Sqoop 2 server (sqoop2-server) and client (sqoop2-client)