hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Saurabh Seth (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-20730) Do delete event filtering even if hive.acid.index is not there
Date Fri, 09 Nov 2018 17:10:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-20730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Saurabh Seth updated HIVE-20730:
--------------------------------
    Attachment: HIVE-20730.3.patch
        Status: Patch Available  (was: In Progress)

Had uploaded an incorrect patch accidentally.
Also fixed the TestOrcRawRecordMerger.testNewBase test failure and uploaded new patch.

> Do delete event filtering even if hive.acid.index is not there
> --------------------------------------------------------------
>
>                 Key: HIVE-20730
>                 URL: https://issues.apache.org/jira/browse/HIVE-20730
>             Project: Hive
>          Issue Type: Improvement
>          Components: Transactions
>    Affects Versions: 4.0.0
>            Reporter: Eugene Koifman
>            Assignee: Saurabh Seth
>            Priority: Major
>         Attachments: HIVE-20730.2.patch, HIVE-20730.3.patch, HIVE-20730.patch
>
>
> since HIVE-16812 {{VectorizedOrcAcidRowBatchReader}} filters delete events based on min/max
ROW__ID in the split which relies on {{hive.acid.index}} to be in the ORC footer.  
> There is no way to generate {{hive.acid.index}} from a plain query as in HIVE-20699 and
so we need to make sure that we generate a SARG into delete_delta/bucket_x based on stripe
stats even the index is missing



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message