giraph-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Praveen Sripati <praveensrip...@gmail.com>
Subject Re: How does scaling work in Giraph?
Date Sat, 30 Jun 2012 17:47:26 GMT
Could someone respond to the below mail please?

Thanks,
Praveen

On Thu, Jun 28, 2012 at 7:04 PM, Praveen Sripati
<praveensripati@gmail.com>wrote:

>
> During the 24th minute of the recent Hadoop Summit Video [1] Avery Ching
> talks about how Giraph is made scalable. I am interested in Hama which is
> also based on the BSP model and would like to know more details on how
> Giraph is made scalable.
>
> Basically, at the end of each super step, the BSP tasks sends some metrics
> to the master and the master partitions the data in the most loaded BSP
> tasks and uses the free map available slot to process them.
>
> 1) Where is the code for the above logic? I am new to Giraph.
>
> 2) What is the logic behind the partitioning of the data in the master
> after the super step? Let's say that the data has been partitioned using
> Hash partitioning.
>
> 3) Similarly will Giraph also scale down? Will the partitions be merged?
>
> Thanks,
> Praveen
>
> [1] - http://www.youtube.com/watch?v=b5Qmz4zPj-M
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message