Setting up a data lake
Setting up a data lake involves meeting the prerequisites, registering external resources in Cloudbreak, and creating a data lake. Once a data lake is running, you can create workload clusters attached to the data lake.
Refer to the following table to learn more about the data lakes prerequisites:
Step | Where to perform |
---|---|
Review available data lake blueprints and select one that you would like to use. | Documentation or Cloudbreak web UI |
Meet the prerequisites:
|
You must create these resources on your own, outside of Cloudbreak. You may use one database instance and create two databases. |
Register the two databases and LDAP | In the Cloudbreak web UI > External Sources |
Create a data lake | In the Cloudbreak web UI > Create cluster |
Create clusters attached to the data lake | In the Cloudbreak web UI > Create cluster |