Changes after upgrading
To locate and use your Apache Hive 3.x tables after an upgrade, you need to understand the changes that occur during the upgrade process. Changes to the location of tables, permissions to HDFS directories, table types, ACID-compliance occur, and other changes occur.
Hive changes to table references using dot notation
Upgrading to CDP includes the Hive-16907 bug fix, which rejects
`db.table`
in SQL queries. The dot (.) is not allowed in table names. To
reference the database and table in a table name, both must be enclosed in backticks as
follows: `db`.`table`
.
Hive changes to ACID properties
HDP Hive 2.x and 3.x can have transactional and non-transactional tables. Transactional tables have atomic, consistent, isolation, and durable (ACID) properties. In Hive 2.x, the initial version of ACID transaction processing is ACID v1. In Hive 3.x, the mature version of ACID is ACID v2, which is the default table type in HDP 3.0.
Native and non-native storage formats
Storage formats are a factor in upgrade changes to table types. Hive 2.x and 3.x supports the following Hadoop native and non-native storage formats:
- Native: Tables with built-in support in Hive, such as those in the following file
formats:
- Text
- Sequence File
- RC File
- AVRO File
- ORC File
- Parquet File
- Non-native: Tables that use a storage handler, such as the DruidStorageHandler or HBaseStorageHandler
Upgrade changes to table types
HDP 2.x | HDP 3.x | ||||
---|---|---|---|---|---|
Table Type | ACID v1 | Format | Owner (user) of Hive Table File | Table Type | ACID v2 |
External | No | Native or non-native | hive or non-hive | External | No |
Managed | Yes | ORC | hive or non-hive | Managed, updatable | Yes |
Managed | No | ORC | hive | Managed, updatable | Yes |
non-hive | External, with data delete | No | |||
Managed | No | Native (but non-ORC) | hive | Managed, insert only | Yes |
non-hive | External, with data delete | No | |||
Managed | No | Non-native | hive or non-hive | External, with data delete | No |
Removal of Hive View and Tez View
CDP does not include Hive View or Tez View. In lieu of these capabilities, users who upgrade to CDP can use Data Analytics Studio.