Setting up Amazon S3 data connection

Amazon S3 object store connection is automatically created for CML Workspaces to make it easier to connect to the data stored within the same environment. Other Data Connections can be configured to other S3 locations manually. You can also set up a connection manually.

Amazon S3 data connections are available only on AWS workspaces where RAZ (Ranger Authorization Service) is enabled to authorize connections to the environment’s S3 buckets.
  1. In the Cloudera Data Platform (CDP) console, click the Machine Learning tile.
    The Home page displays.
  2. Click Site Administration in the left navigation menu.
    The Site Administration page displays.
  3. In the Data Connections page, click New Connection.
    The New Data Connection window is displayed.
  4. In the Name field, enter a name for the connection.
  5. In the Type drop-down list, select the type as S3 Object Store.
  6. Click Create.

    This data connection only supports per-bucket operations. For information on using the S3 data connection, and connection wrapper for the S3 boto client, see Amazon Boto documentation.

    The data connection is available to users by default. To change availability, click the Available toggle. This switch determines if the data connection is displayed in Projects created within the workspace.