Fixed Issues in Atlas

Review the list of Apache Atlas issues that are resolved in Cloudera Runtime 7.2.17.

CDPD-54852: Backward compatibility for check provided for AttributeName in Parent and Child TypeDef
This patch provides backward compatibility for 2 changes mentioned: https://issues.apache.org/jira/browse/ATLAS-3872 Restrict typedef creation when a child type attribute conflicts with parent type attribute of same name https://issues.apache.org/jira/browse/ATLAS-4522 Updating typedef with new supertype should be allowed only if attributes are unique compared to other existing supertypes
CDPD-53717: Failed to extract ADLS Meta Data in Azure cdp_atlas_nat run
Fixed failed to extract ADLS Meta Data in Azure cdp_atlas_nat run
CDPD-50914: Atlas - Upgrade reactor-netty to 1.0.24+ due to CVE-2022-31684
Upgrade reactor-netty to 1.0.24+ due to CVE-2022-31684
CDPD-49792: Dynamic Index Recovery issues and improvements
Merged in CDH-7.2.17.x
CDPD-48720: The attributes of __AtlasAuditEntry type where not getting indexed in the Solr, because the entities of __AtlasAuditEntry gets created before the attributes gets indexed (TypeDef gets created).
This issue is resolved.
CDPD-49053: Atlas - Upgrade Tinkerpop to 3.5.4
Upgrade Tinkerpop to 3.5.4
CDPD-48166: Atlas - Upgrade snakeyaml due to CVE-2022-1471
Upgrade snakeyaml due to CVE-2022-1471
CDPD-48090: Atlas - Upgrade icu4j to 66.1+ due to CVE-2020-21913
Upgrade icu4j to 66.1+ due to CVE-2020-21913
CDPD-48034: Atlas - Upgrade jettison to 1.5.2 due to CVE-2022-45685 and CVE-2022-45693
Upgrade jettison to 1.5.2 due to CVE-2022-45685 and CVE-2022-45693
CDPD-48026: Atlas - Upgrade Woodstox to 5.4.0/6.4.0 due to multiple CVEs
Upgrade Woodstox to 5.4.0/6.4.0 due to multiple CVEs
CDPD-47988: Atlas - Upgrade Netty to 4.1.86.Final due to CVE-2022-41881, CVE-2022-41915
Upgrade Netty to 4.1.86.Final due to CVE-2022-41881, CVE-2022-41915
CDPD-47912: Atlas - Upgrade moment.js to 2.29.4 due to CVE-2022-24785, CVE-2022-31129
Updated moment.js version to 2.29.4
CDPD-46519: aws-s3-extractor.sh fails with ClassNotFoundException: com.fasterxml.jackson.core.util.JacksonFeature
Fixed aws-s3-extractor.sh Atlas script fails in 7.2.16 AWS test runs
CDPD-46409: Atlas - Upgrade Spring-security to 5.6.9/ 5.7.5 due to CVE-2022-31692, CVE-2022-31690
Upgrade Spring-security to 5.7.5 due to CVE-2022-31692, CVE-2022-31690
CDPD-45097: Atlas - Prevent re-indexing of deleted relationship edges to improve entity-update execution process
Implementing this fix along with fix in CDPD-36992 together would ensure that deleted relationship edges are not reattempted for deletion and re-indexing is not done for such edges. As a result, execution time for messages piling on the Atlas hook (Kafka lag) would be reduced drastically leading to significant performance improvement.
CDPD-44811: [PbC] Atlas - Replace log4j 1.x with log4j2
Merge to CDH 7.2.17 (7.2.17.x), pre-cdpd-master (7.2.17) branch,
CDPD-44104: Every hive insert generates an Atlas audit event
As per the current architecture, Atlas maintains audit data for all DML events also. By introducing "DML audit filter" feature, Atlas can skip all the DML event processing which improves performance by reducing unnecessary processing efforts from Atlas. Eventually, there will not be a need to store any such audit events which will reduce use of storage. The DML audit filter will be enabled by default with the below explained configuration. If the user wants to have audit data for DML events, it can be configured using the same configuration. Filtering DML events will be handled by Atlas hook, DML audit filter configuration should be enabled at Hive hook. Note that DML Audit filter is enabled by default, set to false to disable it Hive configuration Property : Hive Service Advanced Configuration Snippet (Safety Valve) for atlas-application.properties Configuration value : atlas.hook.hive.skip.dml.messages=true
CDPD-40822: Atlas - Upgrade Spring Framework to 5.3.20 due to CVE-2022-22971, CVE-2022-22970
Upgrade Spring Framework to 5.3.20 due to CVE-2022-22971, CVE-2022-22970
CDPD-36991: Backward compatibility for check provided for AttributeName in Parent and Child TypeDef
This patch provides backward compatibility for 2 changes mentioned: https://issues.apache.org/jira/browse/ATLAS-3872 Restrict typedef creation when a child type attribute conflicts with parent type attribute of same name https://issues.apache.org/jira/browse/ATLAS-4522 Updating typedef with new supertype should be allowed only if attributes are unique compared to other existing supertypes
CDPD-35438: When classification is created with multiple super types having same attributes , Atlas doesn't throw an exception
This patch restricts classification typedef creation when a child type attribute conflicts with parent type attribute of same name.
CDPD-31728: Audits : For db create , there are 2 update audits instead of 1 create 1 update
This patch will check for the Shell entities created and then assign the Entity status to the Audit entry.
CDPD-29307: Kafka producer entity stays in incomplete state in Atlas
The Kafka-Atlas plugin now fully creates Producer and Consumer entities and won't generate incomplete ones.

Apache patch information

  • ATLAS-4746
  • ATLAS-4576
  • ATLAS-4743
  • ATLAS-4736
  • ATLAS-4733
  • ATLAS-4727
  • ATLAS-4719
  • ATLAS-4622
  • ATLAS-4735
  • ATLAS-4679
  • ATLAS-4655
  • ATLAS-4668
  • ATLAS-4335
  • ATLAS-4666
  • ATLAS-4464
  • CDPD-56588 : [Atlas] test_purge_entity_api failed
  • CDPD-56198: Regression: Atlas logs are not generated
  • CDPD-55533: [Regression] hive_process and hive_process_execution (lineage) being generated for simple DML UPDATE queries run via hive
  • CDPD-53364: Reinstate Atlas Docker images for 7.2.17
  • CDPD-55282: Atlas - 7.2.17 - Update NOTICE files
  • CDPD-53717 : Failed to extract ADLS Meta Data in Azure cdp_atlas_nat run
  • CDPD-49792: Dynamic Index Recovery Improvments
  • CDPD-44811: Atlas - Replace log4j 1.x with log4j2
  • CDPD-50412: ATLAS-4733 : Download Basic and DSL search results
  • CDPD-49302 : [PbC] Expose Iceberg tables in Atlas
  • CDPD-49495: Fixed OOM Issue with DSL search caching
  • CDPD-44104 : Every hive insert generates an Atlas audit event
  • CDPD-54630 : ATLAS-4743: UI:Remove unused libraries from build package
  • CDPD-53904: UI-Sync dashboardV2/Header.js with Apache master
  • CDPD-50682: Fixed the Regression where Data Migration root URI was not getting redirected to the migration status page
  • CDPD-48936: Fixed default value for atlas.entity.audit.differential to true
  • CDPD-51876: ATLAS-4736 : Audit tab reporting a date of 1970 in a few properties.
  • CDPD-50639 : Messages are sent to ATLAS_HOOK_UNSORTED but not seen in ATLAS_HOOK
  • CDPD-48788 : Provide HA support for REST notification server
  • CDPD-48789 : Provide Image support for REST notification server This commit does not contain secrets
  • CDPD-48790 : PAM authentication not supported by REST notification server. This commit does not contain secrets
  • CDPD-46359: HiveProcess - Output entity creation is ignored in case of same entity exists both inputs and outputs
  • CDPD-48026 : Upgrade Woodstox to 5.4.0
  • CDPD-47912 : (UI)Upgrade moment.js to 2.29.4 due to CVE-2022-24785, CVE-2022-31129 in 7.2.16.x
  • CDPD-49053 : ATLAS-4728 - Upgrade Tinkerpop to 3.5.4
  • CDPD-49393 : Atlas is failing in P2 phase for CDH 7.2.16.1
  • CDPD-47599: import-hbase.sh script fails with UriBuilder Exception
  • CDPD-47908 : UI:Upgrade Swagger to the latest version(Upgrade DOMPurify to the latest version to avoid Security issues): This commit does not contain secrets.
  • CDPD-36991 : ATLAS-4576 Backward compatibility for check provided for AttributeName in Parent and Child TypeDef
  • CDPD-48720: ATLAS-4727: admin/audits , admin/purge APIs fail with '[__AtlasAuditEntry.startTime] is not indexed in the targeted index [vertex_index]'
  • CDPD-48684: Lineage is missing for CTAS tables created via impala-shell/beeline
  • CDPD-47603: Added Atlas Preprocessor at SerialEntityProcessor to handle S3 V2 directory objectPrefix
  • CDPD-46688 : Provide HA support for REST notification server
  • CDPD-46687 : Provide Image support for REST notification server This commit does not contain secrets
  • CDPD-48166 : ATLAS-4723 - Upgrade snakeyaml to 1.33
  • CDPD-48034 : Atlas - Upgrade jettison to 1.5.2
  • CDPD-48122 : admin/audits , admin/purge fail with '[__AtlasAuditEntry.startTime] is not indexed in the targeted index [vertex_index]'
  • CDPD-47988 : Upgrade Netty to 4.1.86.Final
  • CDPD-47173: Log4j migration: Replace DailyRollingFileAppender with RollingFileAppender
  • CDPD-47813 : Atlas - Do the fix for CVE-2022-34271
  • CDPD-46339 : ATLAS-4695:UI: Clicking on term assigned to deleted entity in entity details page results in error.
  • CDPD-40822: Atlas - Upgrade Spring Framework in REST Notification module
  • CDPD-47435 : ATLAS-4715 :(UI) Option to delete assigned term from deleted entity is visible
  • CDPD-25084: Implemented concurrent ingest
  • CDPD-46686 : PAM authentication not supported by REST notification server. This commit does not contain secrets
  • CDPD-47307 : Term assigned to historical entity is not visible in Search results page
  • CDPD-47356: REST Notification - messages from same source with same creation time are getting ignored
  • CDPD-42014 : Upgrade jackson-databind to 2.12.7.1
  • CDPD-46292 : Updating NOTICE file for cdpd-master
  • CDPD-46519 : aws-s3-extractor.sh fails with ClassNotFoundException: com.fasterxml.jackson.core.util.JacksonFeature
  • CDPD-46569 : Atlas - Upgrade Apache Ivy to 2.5.1 due to CVE-2022-37865, CVE-2022-37866
  • CDPD-46492 : ATLAS-4710 : (UI)Relationship Attribute Filter button keeps loading
  • CDPD-45331 : Embedded Server to host HTTP REST endpoint to handle notification messages This commit does not contain secrets
  • CDPD-46409 : Upgrade Spring-security to 5.7.5
  • CDPD-46089 : ATLAS-4677 - Atlas client lib throws - NoClassDefFoundError: org/apache/commons/configuration2/Configuration
  • CDPD-43109 : ATLAS-4693,ATLAS-4661,ATLAS-4613 : (UI)Switching within Terms page does not land you in the same tab
  • CDPD-43820 : Upgrade azure-storage-blob to 12.18.0
  • CDPD-31728 : ATLAS-4666 : Intermittently, the audits for creation of hive_db registered are different than expected
  • CDPD-46074: ATLAS-4689:UI: Basic Search: Invalid attributes passed in request
  • CDPD-34249: REST Notification HOOK side - various hook topics patch
  • CDPD-45763 : Atlas - Upgrade Apache Commons Text to 1.10.0 due to CVE-2022-42889
  • CDPD-45671 :ATLAS-4691: Discrepancy in the atlas debug metrics between the active and the not active servers
  • CDPD-45375 : Upgrade snakeyaml to 1.32
  • CDPD-44695 : Regression : adls-extraction fails with ClassNotFoundException: com.fasterxml.jackson.core.util.JacksonFeature
  • CDPD-35438 : ATLAS-4668 : When classification is created with multiple super types having same attributes , Atlas doesn't throw an exception
  • CDPD-44508: ATLAS-4674: Regression: Classification tagging is not sending appropriate notification to ATLAS_ENTITIES
  • CDPD-45576 : Upgrade jettison to 1.5.1
  • CDPD-45221: ATLAS-4681: Relationship Search : SortBy is not working
  • CDPD-45097 : Atlas kafka topic lag is very high and is not decreasing
  • CDPD-35212: ATLAS:4558: Fix AtlasRepairIndex tool when tls is enabled