hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Siddharth Seth (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-14800) Handle off by 3 in ORC split generation based on split strategy used
Date Tue, 20 Sep 2016 18:24:20 GMT
Siddharth Seth created HIVE-14800:
-------------------------------------

             Summary: Handle off by 3 in ORC split generation based on split strategy used
                 Key: HIVE-14800
                 URL: https://issues.apache.org/jira/browse/HIVE-14800
             Project: Hive
          Issue Type: Bug
            Reporter: Siddharth Seth


BI will apparently generate splits starting at offset 0.
ETL will skip the ORC header and generate a split starting at offset 3.

There's a workaround in the HiveSplitGenreator to handle this for consistent splits. Ideally,
Orc split generation should take care of this.


cc [~prasanth_j], [~gopalv]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message