Apache Atlas Reference

Apache Atlas provides comprehensive metadata management capabilities for Cloudera Runtime. Atlas collects, catalogs, and governs metadata from various data sources including Hive, HBase, Kafka, Spark, NiFi, and Schema Registry. This reference documentation covers search capabilities, attribute definitions, metadata collection methods, and migration information.

Search and Query

Apache Atlas Advanced Search Language Reference

Provides reference information for the Atlas domain-specific search language with SQL-like syntax.

Metadata and Attributes

Apache Atlas Statistics Reference

Provides reference information for entity and server statistics collected by Atlas.

Apache Atlas Metadata Attributes

Provides reference information for attribute types including technical, system, business metadata, and classification attributes.

Defining Apache Atlas Enumerations

Provides reference information for creating and using enumerations as attribute values.

Configuration and Management

Dynamic Handling of Failure in Updating Index

Provides reference information for JanusGraph transaction recovery and write-ahead configuration.

Purging Deleted Entities

Provides reference information for permanently removing deleted entities using REST API calls.

Migration

Apache Atlas Technical Metadata Migration Reference

Provides comprehensive reference for migrating Cloudera Navigator metadata to Atlas entities.

Metadata Collection

NiFi Metadata Collection

Provides reference information for the ReportLineageToAtlas reporting task and NiFi flow lineage.

HiveServer Metadata Collection

Provides reference information for Atlas metadata collection from HiveServer including queries and data assets.

HBase Metadata Collection

Provides reference information for Atlas metadata collection from HBase data assets.

Schema Registry Metadata Collection

Provides reference information for integrating Schema Registry with Atlas to persist and view schemas.

Impala Metadata Collection

Provides reference information for Atlas metadata collection from Impala queries and Hive Metastore.

Kafka Metadata Collection

Provides reference information for Atlas metadata collection from Kafka using metadata namespaces.

Spark Metadata Collection

Provides reference information for the Spark Atlas Connector and metadata collection from Spark operations.

Atlas REST API

Apache Atlas REST API Reference

Provides Apache Atlas REST API reference information.