hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rakesh R (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-8370) Erasure Coding: TestRecoverStripedFile#testRecoverOneParityBlock is failing
Date Mon, 11 May 2015 13:43:00 GMT

    [ https://issues.apache.org/jira/browse/HDFS-8370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537947#comment-14537947
] 

Rakesh R commented on HDFS-8370:
--------------------------------

As per the initial analysis, recovery operation is getting failed during encoding/decoding
function:

{code}
2015-05-11 17:45:33,871 WARN  datanode.DataNode (ErasureCodingWorker.java:run(402)) - Failed
to recover striped block: BP-890762290-192.168.1.2-1431346474544:blk_-9223372036854775776_1002
java.lang.ArrayIndexOutOfBoundsException: 79667
	at org.apache.hadoop.io.erasurecode.rawcoder.util.GaloisField.remainder(GaloisField.java:427)
	at org.apache.hadoop.io.erasurecode.rawcoder.RSRawEncoder.doEncode(RSRawEncoder.java:76)
	at org.apache.hadoop.io.erasurecode.rawcoder.AbstractRawErasureEncoder.encode(AbstractRawErasureEncoder.java:40)
	at org.apache.hadoop.hdfs.server.datanode.erasurecode.ErasureCodingWorker$ReconstructAndTransferBlock.recoverTargets(ErasureCodingWorker.java:560)
	at org.apache.hadoop.hdfs.server.datanode.erasurecode.ErasureCodingWorker$ReconstructAndTransferBlock.run(ErasureCodingWorker.java:384)
	at java.lang.Thread.run(Unknown Source)
{code}

> Erasure Coding: TestRecoverStripedFile#testRecoverOneParityBlock is failing
> ---------------------------------------------------------------------------
>
>                 Key: HDFS-8370
>                 URL: https://issues.apache.org/jira/browse/HDFS-8370
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Rakesh R
>            Assignee: Rakesh R
>
> This jira is to analyse more on the failure of this unit test. 
> {code}
> java.io.IOException: Time out waiting for EC block recovery.
> 	at org.apache.hadoop.hdfs.TestRecoverStripedFile.waitForRecoveryFinished(TestRecoverStripedFile.java:333)
> 	at org.apache.hadoop.hdfs.TestRecoverStripedFile.assertFileBlocksRecovery(TestRecoverStripedFile.java:234)
> 	at org.apache.hadoop.hdfs.TestRecoverStripedFile.testRecoverOneParityBlock(TestRecoverStripedFile.java:98)
> {code}
> Exception occurred during recovery packet transferring:
> {code}
> 2015-05-09 15:08:08,910 INFO  datanode.DataNode (BlockReceiver.java:receiveBlock(826))
- Exception for BP-1332677436-67.195.81.147-1431184082022:blk_-9223372036854775792_1001
> java.io.IOException: Premature EOF from inputStream
> 	at org.apache.hadoop.io.IOUtils.readFully(IOUtils.java:203)
> 	at org.apache.hadoop.hdfs.protocol.datatransfer.PacketReceiver.doReadFully(PacketReceiver.java:213)
> 	at org.apache.hadoop.hdfs.protocol.datatransfer.PacketReceiver.doRead(PacketReceiver.java:134)
> 	at org.apache.hadoop.hdfs.protocol.datatransfer.PacketReceiver.receiveNextPacket(PacketReceiver.java:109)
> 	at org.apache.hadoop.hdfs.server.datanode.BlockReceiver.receivePacket(BlockReceiver.java:472)
> 	at org.apache.hadoop.hdfs.server.datanode.BlockReceiver.receiveBlock(BlockReceiver.java:787)
> 	at org.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceiver.java:803)
> 	at org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.opWriteBlock(Receiver.java:137)
> 	at org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.processOp(Receiver.java:74)
> 	at org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:250)
> 	at java.lang.Thread.run(Thread.java:745)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message