1. Apache Sqoop Connectors

Sqoop uses a connector-based architecture which supports plugins that provide connectivity to external systems. Using specialized connectors that work with JDBC drivers, Sqoop connects with external systems that have optimized import and export facilities. Connectors also enable Sqoop to connect with external systems that do not support native JDBC. Connectors are plugin components based on the Sqoop extension framework and can be added to any existing Sqoop installation.

It might be necessary to download and install JDBC drivers for some connectors to function correctly. You can obtain JDBC drivers from the client distribution for your external system's operating system or from the manufacturer of the external system. To install, copy the JDBC driver to the $SQOOP_HOME/lib directory of your Sqoop installation.

Hortonworks provides the following connectors for Sqoop:

  • MySQL connector: This connector is included in the HDP 2 distribution; the instructions for invoking this connector are available here.

  • Netezza connector: This connector is included in the HDP 2 distribution and installs with Sqoop; see here and below for more information.

  • Oracle JDBC connector: The instructions on using this connector are available here.

  • PostgreSQL connector: This connector is included in the HDP 2 distribution and installs with Sqoop; the instructions for invoking this connector are here.

  • Microsoft SQL Server connector: Instructions for using this connector are here.

A Sqoop connector for Teradata is available from the Hortonworks Add-ons page:

  • Teradata connector: The connector and its documentation can be downloaded from here.