accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-1044) bulk imported files showing up in metadata after bulk import fails
Date Tue, 02 Apr 2013 14:43:15 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-1044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13619863#comment-13619863
] 

Eric Newton commented on ACCUMULO-1044:
---------------------------------------

This may be impossible to fix without a complete restructuring of bulk import.

There's a race condition between the update of the metadata with a bulk-file-loaded flag,
and the closing of the transaction.  The current code keeps this window very small, but it
is still possible.

Another "fix" is to never move files to the failed directory: always copy them.  However,
the race condition is just moved from the Master to the Garbage Collector.

The work-around now is to increase the number of retries to a very high number.
                
> bulk imported files showing up in metadata after bulk import fails
> ------------------------------------------------------------------
>
>                 Key: ACCUMULO-1044
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1044
>             Project: Accumulo
>          Issue Type: Bug
>          Components: master, tserver
>    Affects Versions: 1.4.2
>            Reporter: Eric Newton
>            Assignee: Eric Newton
>            Priority: Critical
>             Fix For: 1.5.0, 1.4.4
>
>
> Bulk import fails.  The file is moved to the failures directory.
> But references in the !METADATA table remain.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message