hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tsz Wo Nicholas Sze (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-7048) Incorrect Dispatcher#Source wait/notify leads to early termination
Date Sat, 27 Feb 2016 01:01:31 GMT

    [ https://issues.apache.org/jira/browse/HDFS-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170214#comment-15170214
] 

Tsz Wo Nicholas Sze commented on HDFS-7048:
-------------------------------------------

Thanks  Chengbing for working on this.   Some suggestion:

- Since the wait time is at most 1 second, how about we simply change the wait(..) to sleep(..)
and completely remove the notify(..) calls?
- The new log message may not be useful for common users.  How about removing it or changing
it to debug?

> Incorrect Dispatcher#Source wait/notify leads to early termination
> ------------------------------------------------------------------
>
>                 Key: HDFS-7048
>                 URL: https://issues.apache.org/jira/browse/HDFS-7048
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: balancer & mover
>    Affects Versions: 2.6.0, 2.7.0
>            Reporter: Andrew Wang
>            Assignee: Chengbing Liu
>         Attachments: HDFS-7048.01.patch
>
>
> Split off from HDFS-6621. The Balancer attempts to wake up scheduler threads early as
sources finish, but the synchronization with wait and notify is incorrect. This ticks the
failure count, which can lead to early termination.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message