Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: common-user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of lleung@ddn.com designates
 74.62.46.229 as permitted sender)
From: Leo Leung <lleung@ddn.com>
To: "common-user@hadoop.apache.org" <common-user@hadoop.apache.org>
Date: Fri, 11 May 2012 10:48:32 -0700
Subject: RE: Question on MapReduce
Thread-Topic: Question on MapReduce
Thread-Index: Ac0vlmaHWHcuVFyWSOaFrumDhXBFQQABuYyA
Message-ID: 
 <DF5E339AC1B1EB40B8C0E673B245B1DEA7B0FA64C6@MAILBOXCLUSTER.datadirect.datadirectnet.com>
References: 
 <CA+Omw9gwdJhA3HHyR4WTK=2Zpm3r4Chk867ReLQ_yXRU4UDZAg@mail.gmail.com>
In-Reply-To: 
 <CA+Omw9gwdJhA3HHyR4WTK=2Zpm3r4Chk867ReLQ_yXRU4UDZAg@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
acceptlanguage: en-US
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0

Nope, you must tune the config on that specific super node to have more M/R=
 slots (this is for 1.0.x)
This does not mean the JobTracker will be eager to stuff that super node wi=
th all the M/R jobs at hand.

It still goes through the scheduler,  Capacity Scheduler is most likely wha=
t you have.  (check your config)

IMO, If the data locality is not going to be there, your cluster is going t=
o suffer from Network I/O.


-----Original Message-----
From: Satheesh Kumar [mailto:nkseam@gmail.com]=20
Sent: Friday, May 11, 2012 9:51 AM
To: common-user@hadoop.apache.org
Subject: Question on MapReduce

Hi,

I am a newbie on Hadoop and have a quick question on optimal compute vs.
storage resources for MapReduce.

If I have a multiprocessor node with 4 processors, will Hadoop schedule hig=
her number of Map or Reduce tasks on the system than on a uni-processor sys=
tem? In other words, does Hadoop detect denser systems and schedule denser =
tasks on multiprocessor systems?

If yes, will that imply that it makes sense to attach higher capacity stora=
ge to store more number of blocks on systems with dense compute?

Any insights will be very useful.

Thanks,
Satheesh