Return-Path: X-Original-To: apmail-giraph-dev-archive@www.apache.org Delivered-To: apmail-giraph-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 50C24DD2D for ; Sat, 30 Jun 2012 17:47:54 +0000 (UTC) Received: (qmail 27024 invoked by uid 500); 30 Jun 2012 17:47:54 -0000 Delivered-To: apmail-giraph-dev-archive@giraph.apache.org Received: (qmail 26990 invoked by uid 500); 30 Jun 2012 17:47:54 -0000 Mailing-List: contact dev-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@giraph.apache.org Delivered-To: mailing list dev@giraph.apache.org Received: (qmail 26980 invoked by uid 99); 30 Jun 2012 17:47:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Jun 2012 17:47:54 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of praveensripati@gmail.com designates 209.85.212.52 as permitted sender) Received: from [209.85.212.52] (HELO mail-vb0-f52.google.com) (209.85.212.52) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Jun 2012 17:47:47 +0000 Received: by vbzb23 with SMTP id b23so3226460vbz.11 for ; Sat, 30 Jun 2012 10:47:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=2BbQH5mFK92tux+qbclAmFLVzeRcTK3HzT3pzYt0m5s=; b=Y8ls20a2nURd/kujBTZjaNyyWqDgMRlwG5Pu5fw2FWzEDfR2jgSdvZMmq/yAm6vJfW e8zt2Ou0bJG5SP4sAFZHpHaiJmAwHiKGMCltqULz/DpuKxy8pVzi67oBivnOmo8JHRRT sp2tJT9x2LzP6YKT39Wv3JxysgkIobbO8Aq7IVdUm/xPxvyKhHTF+Z4FzBult5aQ8NGj +g6X4XBplYP69762Kf77/LBBnirRnUmu7zmGh/8uA+3h6DI+bKNAe2F6JBqr+kW3oMaH ffHtJJPkI2Za7rhhZoKC/ykb/sAZdJDsXvnK41AUatkwgrybDVUZuJlvpZ0/1bJ6uyj3 SMhg== MIME-Version: 1.0 Received: by 10.52.174.52 with SMTP id bp20mr2735661vdc.29.1341078446613; Sat, 30 Jun 2012 10:47:26 -0700 (PDT) Received: by 10.52.98.35 with HTTP; Sat, 30 Jun 2012 10:47:26 -0700 (PDT) In-Reply-To: References: Date: Sat, 30 Jun 2012 23:17:26 +0530 Message-ID: Subject: Re: How does scaling work in Giraph? From: Praveen Sripati To: dev@giraph.apache.org Content-Type: multipart/alternative; boundary=bcaec51b18e36d229b04c3b4286e X-Virus-Checked: Checked by ClamAV on apache.org --bcaec51b18e36d229b04c3b4286e Content-Type: text/plain; charset=ISO-8859-1 Could someone respond to the below mail please? Thanks, Praveen On Thu, Jun 28, 2012 at 7:04 PM, Praveen Sripati wrote: > > During the 24th minute of the recent Hadoop Summit Video [1] Avery Ching > talks about how Giraph is made scalable. I am interested in Hama which is > also based on the BSP model and would like to know more details on how > Giraph is made scalable. > > Basically, at the end of each super step, the BSP tasks sends some metrics > to the master and the master partitions the data in the most loaded BSP > tasks and uses the free map available slot to process them. > > 1) Where is the code for the above logic? I am new to Giraph. > > 2) What is the logic behind the partitioning of the data in the master > after the super step? Let's say that the data has been partitioned using > Hash partitioning. > > 3) Similarly will Giraph also scale down? Will the partitions be merged? > > Thanks, > Praveen > > [1] - http://www.youtube.com/watch?v=b5Qmz4zPj-M > --bcaec51b18e36d229b04c3b4286e--