Return-Path: X-Original-To: apmail-incubator-giraph-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-giraph-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2FE419864 for ; Tue, 20 Dec 2011 15:53:44 +0000 (UTC) Received: (qmail 11375 invoked by uid 500); 20 Dec 2011 15:53:44 -0000 Delivered-To: apmail-incubator-giraph-user-archive@incubator.apache.org Received: (qmail 11353 invoked by uid 500); 20 Dec 2011 15:53:44 -0000 Mailing-List: contact giraph-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: giraph-user@incubator.apache.org Delivered-To: mailing list giraph-user@incubator.apache.org Received: (qmail 11345 invoked by uid 99); 20 Dec 2011 15:53:44 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Dec 2011 15:53:44 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of claudio.martella@gmail.com designates 209.85.210.175 as permitted sender) Received: from [209.85.210.175] (HELO mail-iy0-f175.google.com) (209.85.210.175) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Dec 2011 15:53:37 +0000 Received: by iakh37 with SMTP id h37so6842872iak.6 for ; Tue, 20 Dec 2011 07:53:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=Kd/v9BRNKLCcf1BMZ0ajNP6zZYsl4yUnHURx9bAb3v0=; b=ZPjnYCwOJchBO6Ua/3uB8BIan72n5O6YdxgU3tR90KAK29ktnHRtHTzgxU4gmPPb4i tSjbUTKcZzA1WwsM7flune/4yI1mzaBy+xM/SxZxKQfCKdzOJAk9OYumdms7KeMg+7mR oqJW8SWXHxHgQWuM/qRW/YfjKB9bTv9kentfQ= Received: by 10.50.135.71 with SMTP id pq7mr2937695igb.26.1324396397169; Tue, 20 Dec 2011 07:53:17 -0800 (PST) MIME-Version: 1.0 Received: by 10.50.3.66 with HTTP; Tue, 20 Dec 2011 07:52:56 -0800 (PST) In-Reply-To: <20111220152906.236690@gmx.net> References: <20111220152906.236690@gmx.net> From: Claudio Martella Date: Tue, 20 Dec 2011 16:52:56 +0100 Message-ID: Subject: Re: Is there a global state I can use? To: giraph-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi, a general way of collecting data from all the vertices is using an Aggregator. An aggregator collects messages from all the vertices (who decide to write to it) and it can be read by all the vertices. You could easily implement your statistics from there. Aggregators are computed both on the workers and on the master, so it could be quite scalable. Hope it helps, Claudio On Tue, Dec 20, 2011 at 4:29 PM, wrote: > > Hi all, > > a plan to use Giraph for a use case where nodes send messages depending o= n some global distribution of a value. For instance, nodes have a numeric v= alue. Thus there is a global distribution of that value. Now I want all nod= es to take an action, i.e., send messages, that have a value in say the top= 1% of all values. > How could I do this? > Thinking in Hadoop MapReduce I'd use the distributed cache in order to ma= intain a fingerprint of the global distribution. > Would this work in giraph too? > > Thanks and BR! > christoph --=20 =A0 =A0Claudio Martella =A0 =A0claudio.martella@gmail.com