hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon" <edwardy...@apache.org>
Subject Re: Benchmarks: Hama vs. Spark
Date Sun, 11 Aug 2013 23:49:03 GMT
Moreover, there is still improvement to be made in BSP-based KMeans
algorithm. I'll describe it on new JIRA ticket.

On Mon, Aug 12, 2013 at 7:57 AM, Yexi Jiang <yexijiang@gmail.com> wrote:
> That's cool!
>
>
> 2013/8/11 Edward J. Yoon <edwardyoon@apache.org>
>
>> Here's some interesting benchmarks (by Leonidas Fegaras) showing off
>> the performance of Hama compared to the Spark. Pagerank and KMeans
>> were run via MRQL query, which is not as fast as the native BSP code.
>> Moreover, 0.5 is very slow. I've started to think that latest Hama may
>> be faster than Spark. :-)
>>
>> ----
>> On laptop with 8 cores:
>>                                   Hama 0.5 Spark
>> Pagerank 500K/2M: 211 341
>> KMeans 1M: 31 22
>> KMeans 2M: 41 40
>> KMeans 4M: 165 77
>>
>> On cluster with 64 cores:
>>                                    Hama 0.5 Spark
>> Pagerank 1M/10M: 3590 428
>> KMeans 10M: 87 82
>> KMeans 20M: 129 134
>>
>> On cluster with 32 cores:
>>                                    Hama 0.5 Spark
>> Pagerank 1M/10M: 4419 434
>> KMeans 10M: 98 74
>> KMeans 20M: 273 74
>>
>>
>> --
>> Best Regards, Edward J. Yoon
>> @eddieyoon
>>
>
>
>
> --
> ------
> Yexi Jiang,
> ECS 251,  yjian004@cs.fiu.edu
> School of Computer and Information Science,
> Florida International University
> Homepage: http://users.cis.fiu.edu/~yjian004/



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Mime
View raw message