Return-Path: X-Original-To: apmail-incubator-giraph-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-giraph-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8A5DD7B3E for ; Mon, 21 Nov 2011 20:44:06 +0000 (UTC) Received: (qmail 53091 invoked by uid 500); 21 Nov 2011 20:44:06 -0000 Delivered-To: apmail-incubator-giraph-dev-archive@incubator.apache.org Received: (qmail 53066 invoked by uid 500); 21 Nov 2011 20:44:06 -0000 Mailing-List: contact giraph-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: giraph-dev@incubator.apache.org Delivered-To: mailing list giraph-dev@incubator.apache.org Received: (qmail 53058 invoked by uid 99); 21 Nov 2011 20:44:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Nov 2011 20:44:05 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of claudio.martella@gmail.com designates 209.85.210.175 as permitted sender) Received: from [209.85.210.175] (HELO mail-iy0-f175.google.com) (209.85.210.175) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Nov 2011 20:43:59 +0000 Received: by iahk25 with SMTP id k25so7756376iah.6 for ; Mon, 21 Nov 2011 12:43:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=NjEtBQ2+ESwJHR9RUXosuywRpp3itHCNgRqrR0HFqSk=; b=wfBAqE9oZBp0yfaEtWhluzO676u1OmmOtEEbgnCMTEIKVQGHOuCz7Kofla7kRsvIFG vbrHLu12UeLEh5lX6gSagFvWOykmzs4GKvPjxTqftR71Dcbr/sy7yiLrSyuT6UzpD5wL HFDUMBx2bsugyduuVQzRbaf/4desWBKiNRzFs= Received: by 10.42.161.70 with SMTP id s6mr16091342icx.40.1321908219088; Mon, 21 Nov 2011 12:43:39 -0800 (PST) MIME-Version: 1.0 Received: by 10.50.3.66 with HTTP; Mon, 21 Nov 2011 12:43:18 -0800 (PST) From: Claudio Martella Date: Mon, 21 Nov 2011 21:43:18 +0100 Message-ID: Subject: Apache Giraph talk @ FOSDEM To: giraph-dev@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi devs, FOSDEM has announced a devroom completely dedicated to Graph Processing: https://lists.fosdem.org/pipermail/fosdem/2011-November/001344.html I'm going to submit for a talk there. Here's the draft, feedback is welcome= :) Title: "Apache Giraph: distributed graph processing in the cloud." Abstract: Web and online social graphs have been rapidly growing in size and scale during the past decade. In 2008, Google estimated that the number of web pages reached over a trillion. Online social networking and email sites, including Yahoo!, Google, Microsoft, Facebook, LinkedIn, and Twitter, have hundreds of millions of users and are expected to grow much more in the future. Processing these graphs plays a big role in relevant and personalized information for users, such as results from a search engine or news in an online social networking site. The Apache Giraph (http://incubator.apache.org/giraph) project is a faul-tolerant in-memory distributed graph processing system which runs on top of a standard Hadoop cluster and is capable of running any standard Bulk Synchronous Parallel (BSP) operation over any large generic data set which can be represented as a graph. Apache Giraph is a loose implementation of Google Pregel. Giraph entered the ASF Incubator in July 2011, where it has enlisted the aid of committers from Yahoo!, Facebook, LinkedIn, and Twitter. The talk will present why running MapReduce jobs for graph processing can be a problem, introducing the reason why Google designed Pregel at first place. Later, the BSP model will be presented focusing on how it can be used to implement a distributed graph processing engine. The last part of the talk will be dedicated to Apache Giraph, with a description of the programming model (i.e. the API, some typical examples such as PageRank and Single Source Shortest Path) along with a technical overview of how the architecture of Giraph works and how it leverages the Hadoop infrastructure. Best, Claudio --=20 =A0 =A0Claudio Martella =A0 =A0claudio.martella@gmail.com