hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-8966) Delta files created by hive hcatalog streaming cannot be compacted
Date Wed, 26 Nov 2014 20:49:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-8966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14226794#comment-14226794
] 

Alan Gates commented on HIVE-8966:
----------------------------------

This flush length file should be removed when the batch is closed.  Are you closing the transaction
batch on a regular basis?

> Delta files created by hive hcatalog streaming cannot be compacted
> ------------------------------------------------------------------
>
>                 Key: HIVE-8966
>                 URL: https://issues.apache.org/jira/browse/HIVE-8966
>             Project: Hive
>          Issue Type: Bug
>          Components: HCatalog
>    Affects Versions: 0.14.0
>         Environment: hive
>            Reporter: Jihong Liu
>            Assignee: Alan Gates
>            Priority: Critical
>
> hive hcatalog streaming will also create a file like bucket_n_flush_length in each delta
directory. Where "n" is the bucket number. But the compactor.CompactorMR think this file also
needs to compact. However this file of course cannot be compacted, so compactor.CompactorMR
will not continue to do the compaction. 
> Did a test, after removed the bucket_n_flush_length file, then the "alter table partition
compact" finished successfully. If don't delete that file, nothing will be compacted. 
> This is probably a very severity bug. Both 0.13 and 0.14 have this issue



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message