hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amandeep Khurana <ama...@gmail.com>
Subject Re: Memory config for Hadoop cluster
Date Fri, 05 Nov 2010 08:53:47 GMT
Right. I meant I'm not using fair or capacity scheduler. I'm getting out of
memory in some jobs and was trying to optimize the memory settings, number
of tasks etc. I'm running 0.20.2.

Why can't the mapred.job.map.memory.mb and mapred.job.reduce.memory.mb
be not put in the mapred-site.xml and just default to the equivalent cluster
baked if they are not set in the job either?


On Nov 5, 2010, at 1:43 AM, Hemanth Yamijala <yhemanth@gmail.com> wrote:


I'm not using any scheduler.. Dont have multiple jobs running at the same

time on the cluster.

That probably means you are using the default scheduler. Please note
that the default scheduler does not have the ability to schedule tasks
intelligently using the memory configuration parameters you specify.
Could you tell us what you'd like to achieve ?

The documentation here: http://bit.ly/cCbAab (and the link it has to
similar documentation in the Cluster Setup guide) will probably shed
more light on how the parameters should be used. Note that this is in
Hadoop 0.21, and the names of the parameters are different, though you
can see the correspondence with similar variables in Hadoop 0.20.



On Fri, Nov 5, 2010 at 12:21 AM, Hemanth Yamijala <yhemanth@gmail.com>wrote:


Which scheduler are you using ?



On Tue, Nov 2, 2010 at 2:44 AM, Amandeep Khurana <amansk@gmail.com> wrote:

How are the following configs supposed to be used?







These were included in 0.20 in HADOOP-5881.

Now, here's what I'm setting only the following out of the above in my




When I run job, I get the following error:

TaskTree [pid=1958,tipID=attempt_201011012101_0001_m_000000_0] is

running beyond memory-limits. Current usage : 1358553088bytes. Limit :

-1048576bytes. Killing task.

I'm not sure how it got the Limit as -1048576bytes... Also, what are the

cluster.max params supposed to be set as? Are they the max on the entire

cluster or on a particular node?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message