hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-4246) Implement predicate pushdown for ORC
Date Thu, 20 Jun 2013 06:26:20 GMT

     [ https://issues.apache.org/jira/browse/HIVE-4246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Owen O'Malley updated HIVE-4246:
--------------------------------

    Status: Patch Available  (was: Open)

This patch:
  * Adds the column names for the required columns
  * Uses the SearchArgument interface added in HIVE-4579
  * Updates the ORC reader to skip over sets of rows that aren't useful.
  * Extends InStream to read from multiple sets of byte buffers
  * Updates the ORC reader to skip over ignored rows after each next
                
> Implement predicate pushdown for ORC
> ------------------------------------
>
>                 Key: HIVE-4246
>                 URL: https://issues.apache.org/jira/browse/HIVE-4246
>             Project: Hive
>          Issue Type: New Feature
>          Components: File Formats
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: HIVE-4246.D11415.1.patch
>
>
> By using the push down predicates from the table scan operator, ORC can skip over 10,000
rows at a time that won't satisfy the predicate. This will help a lot, especially if the file
is sorted by the column that is used in the predicate.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message