hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-10114) Split strategies for ORC
Date Fri, 27 Mar 2015 01:42:53 GMT
Prasanth Jayachandran created HIVE-10114:
--------------------------------------------

             Summary: Split strategies for ORC
                 Key: HIVE-10114
                 URL: https://issues.apache.org/jira/browse/HIVE-10114
             Project: Hive
          Issue Type: Improvement
    Affects Versions: 1.2.0
            Reporter: Prasanth Jayachandran
            Assignee: Prasanth Jayachandran


ORC split generation does not have clearly defined strategies for different scenarios (many
small orc files, few small orc files, many large files etc.). Few strategies like storing
the file footer in orc split, making entire file as a orc split already exists. This JIRA
to make the split generation simpler, support different strategies for various use cases (BI,
ETL, ACID etc.) and to lay the foundation for HIVE-7428.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message