hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eugene Koifman (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-16177) non Acid to acid conversion doesn't handle _copy_N files
Date Fri, 10 Mar 2017 20:52:04 GMT
Eugene Koifman created HIVE-16177:
-------------------------------------

             Summary: non Acid to acid conversion doesn't handle _copy_N files
                 Key: HIVE-16177
                 URL: https://issues.apache.org/jira/browse/HIVE-16177
             Project: Hive
          Issue Type: Bug
          Components: Transactions
            Reporter: Eugene Koifman
            Priority: Critical


insert into T(a,b) values(1,2)
insert into T(a,b) values(1,3)

    //we should now have bucket files 000001_0 and 000001_0_copy_1

but OrcRawRecordMerger.OriginalReaderPair.next() doesn't know that there can be copy_N files
and numbers rows in each bucket from 0 thus generating duplicate IDs


[~owen.omalley], do you have any thoughts on a good way to handle this?

attached patch has a few changes to make Acid even recognize copy_N but this is just a pre-requisite.
 The new UT demonstrates the issue.




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message