hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Thomas Bentsen ...@bentzn.com>
Subject Re: Performance
Date Mon, 24 Feb 2014 18:22:54 GMT
Thanks Dieter!
I'll look into it.

Still... It would be nice to hear something from the real world. Would
any of you working with Hadoop in a prod env be willing to share
something?

/th




On Mon, 2014-02-24 at 16:56 +0100, Dieter De Witte wrote:
> Hi,
> 
> The terasort benchmark is probably the most common. It has mappers and
> reducers doing 'nothing', this way you only use the framework's
> mergesort functionalities.
> 
> 
> Regards, Dieter
> 
> 
> 
> 2014-02-24 16:42 GMT+01:00 Thomas Bentsen <th@bentzn.com>:
>         Hi everyone
>         
>         I am still beginning Hadoop.
>         Is there any benchmarks or 'performance heuristics' for
>         Hadoop?
>         Is it possible to say something like 'You can process X lines
>         of GZipped
>         log file on a medium AWS server in Y minutes"? I would like to
>         get an
>         idea of what kind of workflow is possible.
>         
>         Thanks in advance
>         
>         Thomas Bentsen
>         
> 
> 



Mime
View raw message