zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon" <edwardy...@apache.org>
Subject Distributed locks for global barrier synchronization
Date Thu, 08 Sep 2011 02:01:21 GMT

I'm using Zookeeper for global barrier synchronization of Hama BSP
computing engine. Current implementation is based on 'ZooKeeper
Recipes and Solutions'[1] but there's a problem.

The problem is that, before the last process leaving the barrier
completely, other processors are starting to create their node[2]. So,
that last process hangs forever at "2. if no children, exit" step.
This problem intermittently occurs on high-performance environments.

Can anyone advise me?

1. http://zookeeper.apache.org/doc/trunk/recipes.html
2. https://issues.apache.org/jira/browse/HAMA-387?focusedCommentId=13037785&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13037785

Best Regards, Edward J. Yoon

View raw message