accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <>
Subject [jira] [Commented] (ACCUMULO-2206) close consistency check failure
Date Thu, 16 Jan 2014 18:21:23 GMT


Eric Newton commented on ACCUMULO-2206:

The first occurrence was was on node3 (found by scanning the monitor log):

2014-01-16 13:10:33,451 [tserver.Tablet] ERROR: tserver:node3 Closed tablet 3;2de232;2dc237
has walog entries in accumulo.metadata [3;2de232; hdfs://node2:9000/home/user/accumulo/wal/node5+46372/c0260a2a-487c-47a6-991d-50007e357a63

This tablet was loaded at 13:10:18:
2014-01-16 13:10:18,795 [master.EventCoordinator] INFO : tablet 3;2de232;2dc237 was loaded
on node3:34029

When the tablet was loaded, the WAL was used for recovery, with no mutations applied.

The !METADATA table WAL had been deleted, so I could not verify the mutations.  The metadata
tablet !0;~< and the root table +r were online and did not have any recovery run on them.

Re-running the test with the trash on.

> close consistency check failure
> -------------------------------
>                 Key: ACCUMULO-2206
>                 URL:
>             Project: Accumulo
>          Issue Type: Bug
>          Components: tserver
>         Environment: 10-node test cluster
>            Reporter: Eric Newton
>            Assignee: Eric Newton
>              Labels: 16_qa_bug
> while running the continuous ingest, with agitation, there are multiple close-consistency
check failures in the logs:
> {noformat}
> Failed to do close consistency check for tablet 3;2de232;2dc237
> 	java.lang.RuntimeException: Closed tablet 3;2de232;2dc237 has walog entries in accumulo.metadata
[3;2de232; hdfs://namenode:9000/home/user/accumulo/wal/host5+46372/c0260a2a-487c-47a6-991d-50007e357a63
> 		at org.apache.accumulo.tserver.Tablet.closeConsistencyCheck(
> 		at org.apache.accumulo.tserver.Tablet.completeClose(
> 		at org.apache.accumulo.tserver.Tablet.close(
> 		at org.apache.accumulo.tserver.TabletServer$
> 		at
> 		at
> 		at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(
> 		at java.util.concurrent.ThreadPoolExecutor$
> 		at
> 		at
> {noformat}
> Running verification to see if there was any data loss.

This message was sent by Atlassian JIRA

View raw message