hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-5522) Datanode disk error check may be incorrectly skipped
Date Tue, 13 May 2014 02:55:16 GMT

    [ https://issues.apache.org/jira/browse/HDFS-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995980#comment-13995980
] 

Hudson commented on HDFS-5522:
------------------------------

FAILURE: Integrated in Hadoop-trunk-Commit #5604 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/5604/])
HDFS-5522. Datanode disk error check may be incorrectly skipped. Contributed by Rushabh Shah.
(kihwal: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1594055)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/BlockReceiver.java
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/DataNode.java
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/datanode/TestDiskError.java


> Datanode disk error check may be incorrectly skipped
> ----------------------------------------------------
>
>                 Key: HDFS-5522
>                 URL: https://issues.apache.org/jira/browse/HDFS-5522
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 0.23.9, 2.2.0
>            Reporter: Kihwal Lee
>            Assignee: Rushabh S Shah
>             Fix For: 3.0.0, 2.5.0
>
>         Attachments: HDFS-5522-v2.patch, HDFS-5522-v3.patch, HDFS-5522.patch
>
>
> After HDFS-4581 and HDFS-4699, {{checkDiskError()}} is not called when network errors
occur during processing data node requests.  This appears to create problems when a disk is
having problems, but not failing I/O soon. 
> If I/O hangs for a long time, network read/write may timeout first and the peer may close
the connection. Although the error was caused by a faulty local disk, disk check is not being
carried out in this case. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message