hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shrinivas Joshi <jshrini...@gmail.com>
Subject Re: Benchmarking pipelined MapReduce jobs
Date Wed, 23 Feb 2011 03:01:46 GMT
I am not sure about this but you might want to take a look at the GridMix
config file. FWIU, it lets you define the # of jobs for different workloads
and categories.


On Tue, Feb 22, 2011 at 10:46 AM, David Saile <david@uni-koblenz.de> wrote:

> Hello everybody,
> I am trying to benchmark a Hadoop-cluster with regards to throughput of
> pipelined MapReduce jobs.
> Looking for benchmarks, I found the "Gridmix" benchmark that is supplied
> with Hadoop. In its README-file it says that part of this benchmark is a
> "Three stage map/reduce job".
> As this seems to match my needs, I was wondering if it possible to
> configure "Gridmix", in order to only run this job (without the rest of the
> "Gridmix" benchmark)?
> Or do I have to build my own benchmark? If this is the case, which classes
> are used by this "Three stage map/reduce job"?
> Thanks for any help!
> David

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message