Return-Path: X-Original-To: apmail-incubator-hama-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-hama-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1FB6B4959 for ; Tue, 21 Jun 2011 12:34:58 +0000 (UTC) Received: (qmail 84597 invoked by uid 500); 21 Jun 2011 12:34:58 -0000 Delivered-To: apmail-incubator-hama-dev-archive@incubator.apache.org Received: (qmail 84569 invoked by uid 500); 21 Jun 2011 12:34:58 -0000 Mailing-List: contact hama-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hama-dev@incubator.apache.org Delivered-To: mailing list hama-dev@incubator.apache.org Received: (qmail 84561 invoked by uid 99); 21 Jun 2011 12:34:57 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2011 12:34:57 +0000 Received: from localhost (HELO mail-iw0-f175.google.com) (127.0.0.1) (smtp-auth username edwardyoon, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2011 12:34:57 +0000 Received: by iwn19 with SMTP id 19so4978824iwn.6 for ; Tue, 21 Jun 2011 05:34:57 -0700 (PDT) MIME-Version: 1.0 Received: by 10.42.155.10 with SMTP id s10mr7786026icw.54.1308659697077; Tue, 21 Jun 2011 05:34:57 -0700 (PDT) Received: by 10.231.37.201 with HTTP; Tue, 21 Jun 2011 05:34:57 -0700 (PDT) Date: Tue, 21 Jun 2011 21:34:57 +0900 Message-ID: Subject: PageRank hang From: "Edward J. Yoon" To: hama-dev@incubator.apache.org Content-Type: text/plain; charset=UTF-8 Hi, I talked w/ thomas about HAMA-395 on twitter, Thomas Jungblut: @eddieyoon the main problem is that I can't figure it out why. The BSP method finishes and written out the complete results. But no cleanup 13 minutes ago in reply to eddieyoon from web And, I'm reply to hama-dev@ so that we can share about this. As mentioned http://incubator.apache.org/hama/docs/r0.2.0/ApacheHama-0.2_UserGuide.pdf, The BSP job will be finished only when all the processes have no more local and outgoing queue entries and all processes are done, or killed by the user. Hence, you have to clear all queues manually after escape from main while loop. + // Clears all queues entries. + peer.clear(); + // finally save the chunk of pageranks + PageRankBase.savePageRankMap(peer, conf, lastTentativePagerank); + LOG.info("Finished with iteration " + iteration + "!"); HAMA-395-v9.patch fixes that problem, and works well on my test cluster. :) -- Best Regards, Edward J. Yoon @eddieyoon