accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Elser (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-3435) Sporadic MetadataMaxFilesIT failure
Date Fri, 19 Dec 2014 06:03:13 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-3435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14253013#comment-14253013
] 

Josh Elser commented on ACCUMULO-3435:
--------------------------------------

Caught a failure locally. The root table came online, but the metadata table was stuck offline.
The TabletServer failed recovery of that tablet due to a checksum error out of the filesystem:

{noformat}
org.apache.hadoop.fs.ChecksumException: Checksum error: file:/.../test/target/mini-tests/org.apache.accumulo.test.functional.MetadataMaxFilesIT_test/accumulo/wal/HW10447.local+64012/11533b18-f046-4c9b-96e5-40ac2e33b6a0
at 3144704
        at org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.readChunk(ChecksumFileSystem.java:251)
        at org.apache.hadoop.fs.FSInputChecker.readChecksumChunk(FSInputChecker.java:275)
        at org.apache.hadoop.fs.FSInputChecker.fill(FSInputChecker.java:213)
        at org.apache.hadoop.fs.FSInputChecker.read1(FSInputChecker.java:231)
        at org.apache.hadoop.fs.FSInputChecker.read(FSInputChecker.java:195)
        at java.io.DataInputStream.readFully(DataInputStream.java:195)
        at java.io.DataInputStream.readFully(DataInputStream.java:169)
        at org.apache.accumulo.core.data.Mutation.readFields(Mutation.java:470)
        at org.apache.accumulo.server.data.ServerMutation.readFields(ServerMutation.java:52)
        at org.apache.accumulo.tserver.logger.LogFileValue.readFields(LogFileValue.java:45)
        at org.apache.accumulo.tserver.log.LogSorter$LogProcessor.sort(LogSorter.java:141)
        at org.apache.accumulo.tserver.log.LogSorter$LogProcessor.process(LogSorter.java:95)
        at org.apache.accumulo.server.zookeeper.DistributedWorkQueue$1.run(DistributedWorkQueue.java:101)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at org.apache.accumulo.trace.instrument.TraceRunnable.run(TraceRunnable.java:47)
        at org.apache.accumulo.core.util.LoggingRunnable.run(LoggingRunnable.java:34)
        at java.lang.Thread.run(Thread.java:745)
{noformat}

We should be using the RawLocalFileSystem for this test instead of the ChecksumFileSystem.

> Sporadic MetadataMaxFilesIT failure
> -----------------------------------
>
>                 Key: ACCUMULO-3435
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3435
>             Project: Accumulo
>          Issue Type: Bug
>          Components: test
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>            Priority: Blocker
>             Fix For: 1.6.2
>
>
> This test is failing occasionally for me on my jenkins instance, more often than it passe,
actually.
> The failure condition seems to be that the test reports that it finds no tablets at all
which seems very unlikely, where as successful runs of the test have non-zero tablet reports.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message