Return-Path: X-Original-To: apmail-giraph-user-archive@www.apache.org Delivered-To: apmail-giraph-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 88711905B for ; Tue, 29 May 2012 11:14:04 +0000 (UTC) Received: (qmail 13738 invoked by uid 500); 29 May 2012 11:14:04 -0000 Delivered-To: apmail-giraph-user-archive@giraph.apache.org Received: (qmail 12798 invoked by uid 500); 29 May 2012 11:14:00 -0000 Mailing-List: contact user-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@giraph.apache.org Delivered-To: mailing list user@giraph.apache.org Received: (qmail 12519 invoked by uid 99); 29 May 2012 11:13:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 May 2012 11:13:59 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of castagna.lists@googlemail.com designates 74.125.82.180 as permitted sender) Received: from [74.125.82.180] (HELO mail-we0-f180.google.com) (74.125.82.180) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 May 2012 11:13:52 +0000 Received: by weyt11 with SMTP id t11so3134492wey.11 for ; Tue, 29 May 2012 04:13:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:x-enigmail-version:content-type :content-transfer-encoding; bh=Eq1bK1Eo8A34Igr9ABb/nfDTy1LGcYuRzOAXoQ2cAws=; b=vj1NcXCSrUo/HjkbbgR8vqlEbnwOdDS5kDwVsm+FHo7GV0xmzrcqWhDjN4VQybtNx1 p4WHGVSB51nbYm20vPeXbiD5omDAwaiToi520h+YV6jQ+Wjjt7BeZP+v/jgHvXv5VQIZ AT2/ZK9hUA3u6AKyt2QRnbY309WiG2URwCZQO69Ng2YjUUS4YfdQRZEEMok1YmVprBMe Cut5+Flp8laIPRaoA1Nrl+2KgcZz38HL4fHrKSLiUmHWhgtbWwl5urpd86atgR0qe/FZ JNCDa2fMpsXh3lVJV+owISCUK+Bj08wJR2OG3RJB5bAOBaPEdwBEPeSPRAMCDhap46TC fh9w== Received: by 10.216.227.101 with SMTP id c79mr7380170weq.188.1338290012648; Tue, 29 May 2012 04:13:32 -0700 (PDT) Received: from [192.168.2.4] (80-42-202-54.dynamic.dsl.as9105.com. [80.42.202.54]) by mx.google.com with ESMTPS id gv4sm44077934wib.8.2012.05.29.04.13.29 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 29 May 2012 04:13:30 -0700 (PDT) Message-ID: <4FC4AF58.4050008@googlemail.com> Date: Tue, 29 May 2012 12:13:28 +0100 From: Paolo Castagna User-Agent: Thunderbird 2.0.0.24 (X11/20101027) MIME-Version: 1.0 To: user@giraph.apache.org Subject: Re: SimplePageRankVertex implementation, dangling nodes and sending messages to all nodes... References: <4FB509F4.4040407@googlemail.com> <4FB52A7A.7030601@apache.org> <4FB5713B.2080504@googlemail.com> <4FB5758F.2060203@apache.org> <4FB580BC.9040307@googlemail.com> <4FB624F6.5030306@googlemail.com> <4FB627A8.6070307@apache.org> <4FB62ACA.9070508@googlemail.com> <4FC3AA2B.5080706@googlemail.com> <4FC3BC1E.3070607@apache.org> In-Reply-To: <4FC3BC1E.3070607@apache.org> X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hi Sebastian Sebastian Schelter wrote: > Why do you only recompute the pageRank in each second superstep? Can we > not use the aggregated value of the dangling nodes from the last superstep? I removed the computing of PageRank values every each second superstep. However, I needed to use a couple of aggregators for the dangling nodes contribution instead of just one: "dangling-current" and "dangling-previous". Each superstep, I need to reset the dangling-current aggregator, at the same time, I need to know the value of the aggregator at a previous superstep. I hope it makes sense, let me know if you have a better idea. > Overall I think we're on a good way to a robust, real-world PageRank > implementation, I managed to implement the convergence check with an > aggregator, will post an updated patch soon. I think I've just done it, have a look [1] and let me know if you would have done it differently. Paolo [1] https://github.com/castagna/jena-grande/blob/11f07dd897562f7a4bf8d6e4845128d7f2cdd2ff/src/main/java/org/apache/jena/grande/giraph/pagerank/PageRankVertex.java#L90