Cloudera Runtime Release NotesPDF version

Impala

You can review the list of reported issues and their fixes for Impala in 7.3.1.100.

CDPD-78366: Session management for Hue logins with Impala
When logging into the Hue web UI, users occasionally encountered the following error:
The user authorized on the connection 'hue/gateway0.xyz.site@XYZ.SITE' does not match the session username 'hue/gateway1.xyz.site@XYZ.SITE'

This issue arised when a Hue user’s request was routed to a different Hue backend other than the one that initiated the original HS2 session with Impala. With Kerberos authentication, Impala restricts session reuse across different hosts or realms, even if the user remains the same.

To address this, proxy clients like Hue can now reuse HS2 sessions across multiple hosts or realms. This is allowed as these proxy clients can be trusted. This improvement enhances session management and ensures a smoother user experience when using Hue.

Apache Jira: IMPALA-11298

CDPD-78368: Executor crash during runtime filter generation
Enabling the MIN_MAX runtime filter in nested loop Joins for certain queries caused out-of-bounds access, leading to executor crashes that prevented any queries from running, disrupting production.
The issue is resolved by ensuring ScalarExprEvaluator properly calls the Open() function, preventing out-of-bounds access and stabilizing query execution.

Apache Jira: IMPALA-12582

CDPD-78364: Impalad crash due to inconsistent tuple IDs during query execution
Avoid impalad crash in RowDescriptor::InitTupleIdxMap() by verifying tuple ids in descriptor table received in executor side.

Apache Jira: IMPALA-13378

CDPD-78170: Boolean literals in OR conditions are not simplified as expected
Impala currently fails to simplify expressions like "id = 0 OR false" to "id = 0". The root cause is that the CompoundPredicate generated by NormalizeExprsRule is not analyzed, preventing SimplifyConditionalsRule from applying the expected rewrite.
This fix addresses the issue by ensuring that the rewritten CompoundPredicate is analyzed properly, allowing boolean literals in AND/OR conditions to be simplified as expected.

Apache Jira: IMPALA-13203

CDPD-78177: Conjunct registration issue during query rewrite in Impala
This issue, introduced in version Cloudera Base on premises7.1.9, caused some queries with expressions that could be rewritten to false to fail with an IllegalStateException: Illegal reference to non-materialized slot
The issue was addressed by reverting an optimization that skipped registering certain conjuncts, ensuring proper registration and assignment during query analysis.

Apache Jira: IMPALA-13302

CDPD-78353: Failed table loads not retried after metastore recovery
Tables failed to load when the metastore was down, and queries continued to fail even after the metastore was back up, requiring manual invalidation.
The system now automatically retries loading failed tables when a query is executed, ensuring successful loads once the metastore is up, without manual intervention.

Apache Jira: IMPALA-13120

CDPD-78175: Incorrect results due to predicate pushdown in join condition
Impala can incorrectly push predicates to scan nodes, causing wrong results in some join scenarios. For example, a query that should return 0 rows may return incorrect results when specific predicates are used.
Removing analytic predicates with self-referencing TupleIds during query execution prevents incorrect pushdown and resolves the issue.

Apache Jira: IMPALA-13262

CDPD-78351: Support x5c Parameter in JSON Web Keys (JWK)
Impala coordinator failed to parse in JSON Web Key Set (JWK) and failed to start the coordinator. This issue is now resolved.

Apache Jira: IMPALA-12559

CDPD-78564: Partition value mismatch during INSERT event processing
Impala failed to process some INSERT events due to partition value mismatches. Hive encodes partition strings in file paths using URL encoding, but partition strings in HMS events are not encoded. Impala mismatched partitions during HMS events processing by decoding them incorrectly, causing event processing failures.
The fix includes decoding partition strings only when derived from file paths and not from HMS events to ensure accurate partition mapping.

Apache Jira: IMPALA-13691

CDPD-78559: Prometheus metric name incompatibility with JDK11 and JDK17
Impala metrics derived from JVM MemoryPoolMXBean names contain apostrophe characters in JDK11 and JDK17. Since Prometheus does not support apostrophes in metric names, these metrics cannot be consumed by Prometheus.
To address the issue, apostrophes in Prometheus metric names are now automatically translated to underscores, ensuring compatibility with Prometheus requirements.

Apache Jira: IMPALA-13638

CDPD-78161: Database deletion during metadata fetch
Running SHOW DATABASES in Impala while simultaneously dropping a database in Hive caused an InconsistentMetadataFetchException due to missing database metadata.
Improved exception handling for operations listing databases and tables. If a database is dropped during metadata fetch, the operation now handles the missing database.

Apache Jira: IMPALA-13170

CDPD-78164: Prevent Impala executor crash on restart
Impala crashed when executors received queries with zero fragment instances due to network issues.
Executors now ignore queries with zero fragment instances, and coordinators ensure they do not send such queries.

Apache Jira: IMPALA-13107

CDPD-78363: DDL hangs with SYNC_DDL when Catalogd switches to standby
When Catalogd changes from active to standby while waiting for SYNC_DDL version, it stops receiving catalog topic updates from the statestore. This caused DDL queries to hang indefinitely.
To address the issue, Catalogd now regenerates its service ID when changing to standby and throws an exception if the service ID changes while waiting for SYNC_DDL version.

Apache Jira: IMPALA-13134

CDPD-78365: Queries canceled after statestore failover
Queries were canceled after a statestore failover due to temporary inconsistencies. While a grace period was already in place for statestore restarts, failovers were not handled the same way.
The fix applies the existing post-recovery grace period to statestore failovers, preventing query cancellations caused by temporary disruptions.

Apache Jira: IMPALA-13159

CDPD-78563: Configurable TCP keepalive for client connections
Client connections get lost without an explicit close due to machine resets, network disruptions, or load balancer idle timeouts, leading to resource leaks in Impala.
To address this, new startup options allow you to enable and configure TCP keepalive settings for client connections. These options help detect and close dead connections, preventing premature disconnections.

Apache Jira: IMPALA-13253

CDPD-78162: Handling invalid column stats in table loading
Tables with illegal column stats failed to load, preventing operations like DROP STATS or DROP TABLE.
Impala now normalizes invalid stats to -1 and logs warnings, allowing table recovery.

Apache Jira: IMPALA-13102