hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mohit Sabharwal (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-8205) Using strings in group type fails in ParquetSerDe
Date Sat, 20 Sep 2014 02:22:33 GMT
Mohit Sabharwal created HIVE-8205:
-------------------------------------

             Summary: Using strings in group type fails in ParquetSerDe
                 Key: HIVE-8205
                 URL: https://issues.apache.org/jira/browse/HIVE-8205
             Project: Hive
          Issue Type: Bug
          Components: Serializers/Deserializers
            Reporter: Mohit Sabharwal
            Assignee: Mohit Sabharwal


In HIVE-7735, schema info was plumbed to ETypeConverter to disambiguate between hive Char,
Varchar and String types, which are all represented as PrimitiveType "binary" and OriginalType
"utf8" in parquet.

However, this does not work for parquet nested types (that map to hive Array, Map, etc.) containing
these values, because schema lookup for nested values was not implemented.  It's also non-trivial
to do that in the current parquet serde implementation. Instead of plumbing in the schema,
we should convert 
these types to the same Text writeable and let the object inspectors handle the final conversion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message