hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sergio Pena" <sergio.p...@cloudera.com>
Subject Review Request 32499: HIVE-10086: Hive throws error when accessing Parquet file schema using field name match
Date Wed, 25 Mar 2015 22:42:21 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/32499/
-----------------------------------------------------------

Review request for hive.


Bugs: HIVE-10086
    https://issues.apache.org/jira/browse/HIVE-10086


Repository: hive-git


Description
-------

Attached is the patch that handles schema that do not match between Parquet and Hive.

The access to Parquet data is with name matching in this case. The table column may have different
schema order, but if the name matches the parquet column name, then the value is retrieved.

Also, if the Hive schema has columns and struct elements that do not match with the Parquet
schema, then it will return NULL values instead.


Diffs
-----

  ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/DataWritableReadSupport.java 57ae7a9740d55b407cadfc8bc030593b29f90700

  ql/src/test/queries/clientpositive/parquet_schema_evolution.q PRE-CREATION 
  ql/src/test/queries/clientpositive/parquet_table_with_subschema.q PRE-CREATION 
  ql/src/test/results/clientpositive/parquet_schema_evolution.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/parquet_table_with_subschema.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/32499/diff/


Testing
-------


Thanks,

Sergio Pena


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message