Import Data from RDBMS into an S3 Bucket
The --target-dir
option must be set to the target location in the S3
bucket to import data from RDBMS into an S3 bucket.
Example command: Import data into a target directory in an Amazon S3 bucket.
sqoop import --connect $CONN --username $USER --password $PWD --table $TABLENAME --target-dir s3a://example-bucket/target-directory
Data from RDBMS can be imported into S3 as Sequence or Avro file format too.
Parquet
import into S3 is also supported if the Parquet Hadoop API based implementation is used,
meaning that the --parquet-configurator-implementation
option is set to hadoop
.
Example command: Import data into a target directory in an Amazon S3 bucket as Parquet file.
sqoop import --connect $CONN --username $USER --password $PWD --table $TABLENAME --target-dir s3a://example-bucket/target-directory --as-parquetfile --parquet-configurator-implementation hadoop