Behavioral Changes - Hortonworks Data Platform

Release Notes

Also available as:

PDF

Behavioral Changes

Behavioral changes denote a marked change in behavior from the previously released version to this version of software. In HDP 2.5.0, behavioral changes affect the following Hadoop components.

Table 1.15. Behavioral Changes

Hortonworks Bug ID	Apache Component	Apache JIRA	Summary	Details
N/A	Falcon	N/A	Berkeley DB JAR file must be downloaded	Starting with the HDP 2.5.0 release, customers must obtain the Berkeley DB JAR file (available under open source license from Oracle) as part of a Falcon installation or upgrade. See the Data Movement and Integration guide for more information.
BUG-59164	Hive	N/A	Data type conversions are different between Hive 1 and Hive 2	Component Affected: Hive Scenario: In hive2, the table column data type conversion with ALTER TABLE CHANGE COLUMNS and ALTER TABLE REPLACE COLUMNS was changed. Previous Behavior: Previously, hive1 was very permissive and allowed changing from any primitive data type to any other primitive data type with the DDL ALTER statement. New Behavior: It is now more restrictive. By default, it only allows a small set of safe conversions. In other words, the data type being converted into is larger -- it can handle more values. For example, changing a table column data type from INT to BIGINT is safe because BIGINT can represent more values than INT. In hive2, when the environment variable `hive.metastore.disallow.incompatible.col.type.changes` is true, ALTER TABLE CHANGE COLUMNS and ALTER TABLE REPLACE COLUMNS is restricted to safe conversions. The default for `hive.metastore.disallow.incompatible.col.type.changes` is `true`. To permit the old very permissive data type conversion behavior, you need to set this environment variable to `false`. Here are the safe conversions: STRING_GROUP to DOUBLE STRING_GROUP to DECIMAL DATE_GROUP to STRING_GROUP NUMERIC_GROUP to STRING_GROUP STRING_GROUP to STRING_GROUP SMALLINT -> INT -> BIGINT -> FLOAT -> DOUBLE -> DECIMAL The last line shows increasing value ranges of the numeric types. For example, INT to FLOAT is a valid progression. FLOAT to INT is not. STRING_GROUP is STRING, CHAR(n), or VARCHAR(n) NUMERIC_GROUP is SMALLINT, INT, BIGINT, FLOAT, DOUBLE, and DECIMAL DATE_GROUP is DATE and TIMESTAMP
BUG-60199	Ranger	RANGER-1025	Ranger API change of behavior for HDP 2.4.0	Component Affected: Ranger admin Scenario: Search filter is not working as expected. For example : If expected search result is after first record and if in search request page size is 1. http://localhost:6080/service/public/api/policy?repositoryName=Sandbox_hadoop&resourceName=test&pageSize=1 Above search policy URL not returning a policy as expected. It seems as if the filtering is happening after retrieving the first n policies where n is the pageSize, while it should fetch all matching results first and apply the pageSize limit later. Previous Behavior: Searches for a policy by name limits the search to the default page size, search returns no records if the policy is farther down the list. If pageSize is big enough to include the policies we are searching for then we get the results we expect. It seems as if the filtering was happening after retrieving the first n policies where n is the pageSize. New Behavior: Able to search across all policies after new implementation. New pagination implementation shall send results according to requested page size after filtering the result.
BUG-60495	Hive	HIVE-14022	Left semi join should throw SemanticException if where clause contains columnname from right table	Scenario: Specifying left semi joins provides predictable results but only exposes columns from one table. Previous Behavior: Previously, you could specify left semi joins while at the same time accessing columns from the "right" table. New Behavior: Specifying left semi joins is no longer possible. As a result, you might experience this as a regression, which will require a query change to move forward.
BUG-61629	Zeppelin	N/A	Interpreters are now available on the top right corner, as a dropdown from the user Login button	Component Affected: Zeppelin JDBC Interpreters Scenario: Configuration of interpreter, JDBC (Generic), Spark, Livy, Shell Interpreters Previous Behavior: Earlier the Interpreter configuration would be present under the Notebook drop-down, which has now moved to the top-right of the page. Earlier MySQL, PgSQL, Hive, Phoenix Interpreters needed to be configured separately. New Behavior: Now these above described JDBC interpreters have a Generic JDBC interpreter configuration, which requires the user to additionally provide the driver class. Example - org.apache.hadoop.hive.jdbc.HiveDriver as part of interpreter properties
BUG-62267	Storm	STORM-1202	Migrate APIs to org.apache.storm, but try to provide backwards compatibility as a bridge	Component Affected: Storm core / trident APIs Scenario: Package name changed from: backtype.storm to: org.apache.storm Previous Behavior: You need to create a dependency on storm-core to build topologies and also import relevant classes in their code. For example: import backtype.storm.topology.BasicOutputCollector; New Behavior: With Apache Storm 1.0, all of the core and trident classes are moved from backtype.storm to org.apache.storm. You can import the same storm-core and trident API classes by using org.apache instead of backtype. import org.apache.storm.topology.BasicOutputCollector; For existing topologies you can deploy without changing the code by using the following class. client.jartransformer.class: org.apache.storm.hack.StormShadeTransformer Make sure you add the following configuration to `storm.yaml`.
BUG-63146	Storm	N/A	Parameter type change in org.apache.storm.spout.Scheme	Component Affected: Storm Scenario: Any user who is implementing the Scheme interface from Storm. Previous Behavior: Pass the byte[] parameter to the Scheme interface. New Behavior: Instead of passing byte[], pass ByteBuffer to the Scheme interface. See the following link for a code example: https://github.com/apache/storm/blob/1.x-branch/external/storm-kafka/src/jvm/org/apache/storm/kafka/StringScheme.java
RMP-4106	Falcon	FALCON-1107	Server Side extension infrastructure	Scenario: Mirroring jobs executed from the command line Previous Behavior: Falcon recipe tool was the client interface to execute mirroring jobs New Behavior: The extension support in Falcon CLI is used to execute mirroring jobs. Please check the upgrade documentation.
RMP-4486	Atlas	N/A	HBase Integration	Component Affected: HBase Scenario: HBase is now available as the default ATLAS storage backend via Ambari Previous Behavior: Previously an undocumented feature New Behavior: HBase can now be used to configure an HBase instance managed via Ambari or a custom HBase instance
RMP-5035	Ranger	AMBARI-15914 AMBARI-15916 RANGER-271 RANGER-900	Ranger: Remove option to store audit in DB In Ranger Audits, Audit to DB is no longer available. Users using Audit to DB must migrate to Solr. Use the HDP Security Guide - Migrating Audit Logs from DB to Solr in Ambari Clusters.	Scenario: Ranger Audits users who are currently using Audit to DB must migrate to Audit to Solr. Previous Behavior: Ranger Audit can be configured to go with any of the following destinations: DB, SOLR, and HDFS. New Behavior: Ranger Audit can no longer be configured to the destination DB. Ranger Audit can only be configured to go with the following destinations: SOLR and HDFS. During upgrade to HDP 2.5, If you have not enabled ranger-audit to SOLR, then you will have to configure audit to Solr post-upgrade. Otherwise, you will not see audit activities in Ranger UI. You can either use an externally managed Solr or Ambari managed Solr. For details on configuring these, refer to the Solr Audit configuration section in installation guide.
RMP-5421	Atlas	ATLAS-492	Hive Hook Phase II	Component Affected: Hive Scenario: Added support for capturing metadata changes for a table/database/column. Previous Behavior: Metadata changes were ignored or were known to have issues. New Behavior: These commands will successfully preserve the noted metadata changes: rename table -https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RenameTable alter table column related commands like alter table add column, replace column, change column type, position comment etc - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AlterColumn alter table set TBLPROPERTIES - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AlterTableProperties alter database - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Create/Drop/Alter/UseDatabase alter table storage properties - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AlterTableStorageProperties alter table set SERDEPROPERTIES - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AddSerDeProperties Added support for Capturing dataset lineage in the following cases: INSERT INTO/OVERWRITE - lineage between HDFS paths and tables - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML#LanguageManualDML-InsertingdataintoHiveTablesfromqueries https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML#LanguageManualDML-Writingdataintothefilesystemfromqueries lineage between newly created EXTERNAL tables and the HDFS paths that they are sourced from LOAD - lineage between HDFS paths and table - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML#LanguageManualDML-Loadingfilesintotables IMPORT/EXPORT - lineage between HDFS path and table - https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport Added support for metadata capture when tables and database are dropped* - DROP TABLE, DROP DATABASE Deprecated support for hive_partition entity and no lineage for partitions are captured for hive tables. A few data model changes in the hive metadata to deprecate unused hive types and normalize the data types for consistent metadata capture
RMP-5498	Atlas	ATLAS-491	Business Taxonomy (Catalog)	Component Affected: Atlas Scenario: Enhanced search and data management New Behavior: Browse business taxonomy hierarchically through graphical interface Create child taxonomy terms Search by taxonomy term Search by tags Search by combination of keyword, tag, free text in search field Assign tags to assets (Hive, Falcon, HDFS, HBase, Storm, Kafka, Sqoop) Assign terms to assets (Hive, Falcon, HDFS, HBase, Storm, Kafka, Sqoop) Show audit of Atlas activity ⁃ Show current state of object (deleted or active)