Known Issues in Apache Hive
Learn about the known issues in Hive, the impact or changes to the functionality, and the workaround.
- TSB-732 2024: Incorrect results are generated by Hive JOIN when bloom filter is activated
- The bloom filter implemented in HIVE-23880 was designed to enhance performance for queries with JOIN statements, where one small table and another significantly larger table is joined on partition keys. However, the bloom filter introduced an issue in Apache Hive (Hive), when dynamic semijoin redaction is involved that generates incorrect query results. This issue is corrected in HIVE-26655.
- Upstream JIRA
- Hive-23880(cause)HIVE-26655(fix)
- Knowledge article
- For the latest update on this issue see the corresponding Knowledge article: TSB 2024-732: Incorrect results are generated by Hive JOIN when bloom filter is activated
- CDPD-40730: Parquet change can cause incompatibility
- Parquet files written by the parquet-mr library in CDP 7.2.16, where the schema contains
a timestamp with no UTC conversion will not be compatible with older versions of Parquet
readers. The effect is that the older versions will still consider these timestamps as
they would require UTC conversions and will thus end up with a wrong result. You can
encounter this problem only when you write Parquet-based tables using Hive, and tables
have the non-default configuration
hive.parquet.write.int64.timestamp=true
. - CDPD-60770: Beeline Authentication Issue with Special Characters in Passwords
- When LDAP is enabled, users cannot authenticate with Beeline if
the password contains a special character. For example, the following string
fails:
beeline -u jdbc:hive2://<host>:<port>/<dbName>;user=user@XXX;password='R3G#xpXyoy1MOJb1'
- CDPD-15518: ACID tables you write using the Hive Warehouse Connector cannot be read from an Impala virtual warehouse.
- Read the tables from a Hive virtual warehouse or using Impala queries in Data Hub.
- CDPD-10848: HiveServer Web UI displays incorrect data
- If you enabled auto-TLS for TLS encryption, the HiveServer2 Web UI does not display the correct data in the following tables: Active Sessions, Open Queries, Last Max n Closed Queries