hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <>
Subject [jira] [Updated] (HIVE-11705) refactor SARG stripe filtering for ORC into a separate method
Date Wed, 16 Sep 2015 06:57:47 GMT


Prasanth Jayachandran updated HIVE-11705:
    Affects Version/s: 2.0.0

> refactor SARG stripe filtering for ORC into a separate method
> -------------------------------------------------------------
>                 Key: HIVE-11705
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 2.0.0
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>             Fix For: 2.0.0
>         Attachments: HIVE-11705.01.patch, HIVE-11705.02.patch, HIVE-11705.03.patch, HIVE-11705.patch
> For footer cache PPD to metastore, we'd need a method to do the PPD. Tiny item to create
it on OrcInputFormat.
> For metastore path, these methods will be called from expression proxy similar to current
objectstore expr filtering; it will change to have serialized sarg and column list to come
from request instead of conf; includedCols/etc. will also come from request instead of assorted
java objects. 
> -The types and stripe stats will need to be extracted from HBase. This is a little bit
of a problem, since ideally we want to be inside HBase filter/coprocessor/.... I'd need to
take a look to see if this is possible... since that filter would need to either deserialize
orc, or we would need to store types and stats information in some other, non-ORC manner on
write. The latter is probably a better idea, although it's dangerous because there's no sync
between this code and ORC itself.-
> Meanwhile minimize dependencies for stripe picking to essentials (and conf which is easy
to remove).

This message was sent by Atlassian JIRA

View raw message