zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon" <edwardy...@apache.org>
Subject Re: Distributed locks for global barrier synchronization
Date Tue, 20 Sep 2011 00:04:58 GMT
Thanks.

And, we found this - https://issues.apache.org/jira/browse/ZOOKEEPER-1011

On Thu, Sep 8, 2011 at 3:06 PM, Ted Dunning <ted.dunning@gmail.com> wrote:
> It might help to have a different znode for synchronization at each
> iteration.  That way, if slow nodes are still just getting around to
> deleting the old node, the fast nodes creating their new nodes will not
> interfere.
>
> On Wed, Sep 7, 2011 at 7:01 PM, Edward J. Yoon <edwardyoon@apache.org>wrote:
>
>> Hi,
>>
>> I'm using Zookeeper for global barrier synchronization of Hama BSP
>> computing engine. Current implementation is based on 'ZooKeeper
>> Recipes and Solutions'[1] but there's a problem.
>>
>> The problem is that, before the last process leaving the barrier
>> completely, other processors are starting to create their node[2]. So,
>> that last process hangs forever at "2. if no children, exit" step.
>> This problem intermittently occurs on high-performance environments.
>>
>> Can anyone advise me?
>>
>> 1. http://zookeeper.apache.org/doc/trunk/recipes.html
>> 2.
>> https://issues.apache.org/jira/browse/HAMA-387?focusedCommentId=13037785&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13037785
>>
>> --
>> Best Regards, Edward J. Yoon
>> @eddieyoon
>>
>



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Mime
View raw message