Loading CSV Data into an Impala Table
For this demonstration, we will be using the tips.csv dataset.
- Create a new Cloudera Data Science workbench project.
-
Create a folder called
data
and upload tips.csv to this folder. - The next steps require access to services on the CDH cluster. If Kerberos has been enabled on the cluster, enter your credentials (username, password/keytab) in Cloudera Data Science Workbench to enable access.
- Navigate back to the project Overview page and click Open Workbench.
- Launch a new session (Python or R).
-
Open the Terminal.