What's New in Sqoop

Learn about the new features of Sqoop in Cloudera Runtime 7.1.9.

To access the latest Sqoop documentation on Cloudera's documention web site, go to Sqoop Documentation 1.4.7.7.1.6.0.

Sqoop enhancements to the Hive import process

This release introduces several Sqoop enhancements that enable you to configure how Sqoop imports data from relational databases into Hive. With these enhancements, you can now specify custom Beeline arguments, define custom Hive JDBC arguments, choose how tables are created in Hive using custom CREATE TABLE statements, and configure custom Hive table properties. The changes allow users to control the imported data according to their specific requirements. For more information, see Sqoop enhancements to the Hive import process.

Sqoop Teradata Connector support for ORC file format

A new version of Cloudera Connector Powered by Teradata version 1.8.5.1c7 is released which includes ORC support in the Sqoop-Connector-Teradata component. You can use Teradata Manager to import data from the Teradata server to Hive in ORC format. For more information, see Cloudera Connector Powered by Teradata Release Notes

Discontinued maintenance of direct mode

The Sqoop direct mode feature is no longer maintained. This feature was primarily designed to import data from an abandoned database, which is no longer updated. Using direct mode has several drawbacks:
  • Imports can cause an intermittent and overlapping input split.
  • Imports can generate duplicate data.
  • Many problems, such as intermittent failures, can occur.
  • Additional configuration is required.

Do not use the --direct option in Sqoop import or export commands.

Sqoop direct mode is disabled by default. However, if you still want to use it, enable it by either setting the sqoop.enable.deprecated.direct property globally in Cloudera Manager for Sqoop or by specifying it in the command-line through -Dsqoop.enable.deprecated.direct=true.