Return-Path: X-Original-To: apmail-giraph-user-archive@www.apache.org Delivered-To: apmail-giraph-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AFEAF90BF for ; Tue, 29 May 2012 11:49:22 +0000 (UTC) Received: (qmail 18002 invoked by uid 500); 29 May 2012 11:49:22 -0000 Delivered-To: apmail-giraph-user-archive@giraph.apache.org Received: (qmail 17846 invoked by uid 500); 29 May 2012 11:49:22 -0000 Mailing-List: contact user-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@giraph.apache.org Delivered-To: mailing list user@giraph.apache.org Received: (qmail 17819 invoked by uid 99); 29 May 2012 11:49:21 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 May 2012 11:49:21 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ssc.open@googlemail.com designates 209.85.214.52 as permitted sender) Received: from [209.85.214.52] (HELO mail-bk0-f52.google.com) (209.85.214.52) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 May 2012 11:49:14 +0000 Received: by bkcjc3 with SMTP id jc3so3952445bkc.11 for ; Tue, 29 May 2012 04:48:54 -0700 (PDT) Received: by 10.204.151.82 with SMTP id b18mr5688168bkw.132.1338292134084; Tue, 29 May 2012 04:48:54 -0700 (PDT) Received: from [130.149.23.180] (poodle-6.dima.cs.tu-berlin.de. [130.149.23.180]) by mx.google.com with ESMTPS id fw10sm17600999bkc.11.2012.05.29.04.48.51 (version=SSLv3 cipher=OTHER); Tue, 29 May 2012 04:48:52 -0700 (PDT) Message-ID: <4FC4B79E.8060000@apache.org> Date: Tue, 29 May 2012 13:48:46 +0200 From: Sebastian Schelter Reply-To: ssc@apache.org User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1 MIME-Version: 1.0 To: user@giraph.apache.org Subject: Re: SimplePageRankVertex implementation, dangling nodes and sending messages to all nodes... References: <4FB509F4.4040407@googlemail.com> <4FB52A7A.7030601@apache.org> <4FB5713B.2080504@googlemail.com> <4FB5758F.2060203@apache.org> <4FB580BC.9040307@googlemail.com> <4FB624F6.5030306@googlemail.com> <4FB627A8.6070307@apache.org> <4FB62ACA.9070508@googlemail.com> <4FC3AA2B.5080706@googlemail.com> <4FC3BC1E.3070607@apache.org> <4FC4AF58.4050008@googlemail.com> In-Reply-To: <4FC4AF58.4050008@googlemail.com> X-Enigmail-Version: 1.5pre Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit On 29.05.2012 13:13, Paolo Castagna wrote: > Hi Sebastian > > Sebastian Schelter wrote: >> Why do you only recompute the pageRank in each second superstep? Can we >> not use the aggregated value of the dangling nodes from the last superstep? > > I removed the computing of PageRank values every each second superstep. > However, I needed to use a couple of aggregators for the dangling nodes > contribution instead of just one: "dangling-current" and "dangling-previous". > > Each superstep, I need to reset the dangling-current aggregator, at the > same time, I need to know the value of the aggregator at a previous > superstep. You can save the value from the previous step in a static variable in the WorkerContext before resetting the aggregator. > > I hope it makes sense, let me know if you have a better idea. > >> Overall I think we're on a good way to a robust, real-world PageRank >> implementation, I managed to implement the convergence check with an >> aggregator, will post an updated patch soon. > > I think I've just done it, have a look [1] and let me know if you would have > done it differently. > > Paolo > > [1] > https://github.com/castagna/jena-grande/blob/11f07dd897562f7a4bf8d6e4845128d7f2cdd2ff/src/main/java/org/apache/jena/grande/giraph/pagerank/PageRankVertex.java#L90 > >