hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris Drome (JIRA)" <>
Subject [jira] [Created] (HIVE-7189) Hive does not store column names in ORC
Date Fri, 06 Jun 2014 20:55:03 GMT
Chris Drome created HIVE-7189:

             Summary: Hive does not store column names in ORC
                 Key: HIVE-7189
             Project: Hive
          Issue Type: Bug
          Components: File Formats
    Affects Versions: 0.13.0, 0.12.0
            Reporter: Chris Drome

We uncovered the following discrepancy between writing ORC files through Pig and Hive:

ORCFile header contains the name of the columns. Storing through Pig (ORCStorage or HCatStorer),
the column names are stored fine. But when stored through hive they are stored as _col0, _col1,....,_col99
and hive uses the partition schema to map the column names. Reading the same file through
Pig then has problems as user will have to manually map columns.

This message was sent by Atlassian JIRA

View raw message