hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-5922) DN heartbeat thread can get stuck in tight loop
Date Tue, 25 Feb 2014 01:41:20 GMT

    [ https://issues.apache.org/jira/browse/HDFS-5922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13911099#comment-13911099
] 

Aaron T. Myers commented on HDFS-5922:
--------------------------------------

+1, the latest patch looks great to me, and I agree that the test failure is unrelated.

Thanks a lot for taking care of this, Arpit. I think this is a much simpler solution.

> DN heartbeat thread can get stuck in tight loop
> -----------------------------------------------
>
>                 Key: HDFS-5922
>                 URL: https://issues.apache.org/jira/browse/HDFS-5922
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: datanode
>    Affects Versions: 2.3.0
>            Reporter: Aaron T. Myers
>            Assignee: Arpit Agarwal
>         Attachments: HDFS-5922.01.patch, HDFS-5922.02.patch
>
>
> We saw an issue recently on a test cluster where one of the DN threads was consuming
100% of a single CPU. Running jstack indicated that it was the DN heartbeat thread. I believe
I've tracked down the cause to a bug in the accounting around the value of {{pendingReceivedRequests}}.
> More details in the first comment.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message