accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Elser (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (ACCUMULO-3289) BulkFileIT failed to import files
Date Wed, 03 Dec 2014 05:18:12 GMT

     [ https://issues.apache.org/jira/browse/ACCUMULO-3289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Josh Elser resolved ACCUMULO-3289.
----------------------------------
    Resolution: Fixed

Haven't had this fail in the same way recently. Added some extra logic in here that will hopefully
help debug future failures (if they happen).

Since there were commits for it, I'm marking it as fixed as opposed to cannot reproduce.

> BulkFileIT failed to import files
> ---------------------------------
>
>                 Key: ACCUMULO-3289
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3289
>             Project: Accumulo
>          Issue Type: Bug
>          Components: test
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>             Fix For: 1.6.2, 1.7.0
>
>         Attachments: accumulo-3289.tar.gz
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> Had a failure with BulkFileIT. Looking at the master logs, it appears like the following
might have happened:
> * 2 Tservers
> * One of the tservers doesn't respond to communication
> * The master repeatedly contacts it to try to tell it to perform the bulk load
> * The tserver that isn't communicating w/ the master has no errors
> * That tserver logged an assignment that never finished
> * That tserver also got a single bulk import request and the last thing it logged WRT
that bulk import was that "Assigning 1 map files to 3 tablets at ...". The 2nd tserver doesn't
appear to have anything from that tserver about the import request which should have been
incoming.
> * Eventually the master tried to stop that other tserver, but the test timed out in ~30s
later (not sure if the tserver would've actually stopped).
> The fact that I see an incomplete assignment and inexplicable bulk load hangs gives me
pause in light of ACCUMULO-3276. Will attach some logs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message