How to manage Unknown Data Objects (UNK)

Encountering Unknown Data Types (UNK) for certain Data Objects in Cloudera Octopai is a common issue that can occur when an object has not been extracted and uploaded to Cloudera Octopai, but appears in the lineage due to its association with a script.

When viewing the lineage in Cloudera Octopai, you might come across Data Objects that are displayed as Unknown Data Type (UNK). This happens when Cloudera Octopai lacks sufficient information to determine the type of the Data Object, even though it is directly related to a Report or ETL. Cloudera Octopai fails to recognize the associated Database Name and Schema through their keys, resulting in the UNK classification.

  1. Click the Information i button in the object radial button, and copy the object name from the Properties section.
  2. Paste the object name in the Cloudera Octopai Discovery space.
    Figure 1. Discovery space
    You can identify that the Unknown Data Object is part of SSIS scripts but was not physically uploaded to Cloudera Octopai during the Metadata extraction process.
    To resolve this issue, perform the following steps:
    1. Verify the permissions on the Database associated with the object, based on the tool prerequisites.
    2. Take note of the path of the Report or ETL shown in the Data Object Properties section and paste it to the Admin Console > Connection Parameters tab. The example is valid for ETL.
      Figure 2. Path in the Properties section
      Figure 3. Switch to Admin Console
      Figure 4. Admin Console
      Figure 5. Add the connection parameters
    3. An Cloudera Octopai night job will handle the rest.
  3. Edit bulk connection parameters.
    1. Export the list to Excel.
    2. Complete the Reports - Database and Schemaand ETL - Database missing data in the Excel file.
    3. Upload the modified file to Cloudera Octopai.
    4. An Cloudera Octopai night job will handle the rest.

If you still encounter UNK objects, contact Cloudera Support.