accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Christopher Tubbs (JIRA)" <>
Subject [jira] [Commented] (ACCUMULO-4389) ReplicationOperations().drain(..) may return too quickly
Date Wed, 31 Aug 2016 03:11:20 GMT


Christopher Tubbs commented on ACCUMULO-4389:

Yeah. There isn't really much in the server side logs to help troubleshoot this. There's just
what appears to be client disconnects (indicated by Connection reset messages, IOException
in internalRead).

> ReplicationOperations().drain(..) may return too quickly
> --------------------------------------------------------
>                 Key: ACCUMULO-4389
>                 URL:
>             Project: Accumulo
>          Issue Type: Bug
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>            Priority: Critical
>             Fix For: 1.7.3, 1.8.1
> Was taking a look at some logs from automated tests that [~romil.choksi] sent my way
and noticed that MultiInstanceReplicationIT was failing infrequently.
> Looking at the output, I can see that the call was returning very quickly (essentially
in the amount of time the RPC would take on the slow test hardware)
> {noformat}
> Drain completed in 25ms
> {noformat}
> Looking at the implementation of {{MasterClientServiceHandler.drainReplicationTable(...)}},
it's not handling the references we read from the metadata table correctly. I believe this
is causing the test to return too quickly.

This message was sent by Atlassian JIRA

View raw message