Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CBCE4FE3E for ; Sat, 13 Apr 2013 01:46:05 +0000 (UTC) Received: (qmail 11091 invoked by uid 500); 13 Apr 2013 01:46:00 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 10979 invoked by uid 500); 13 Apr 2013 01:46:00 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 10972 invoked by uid 99); 13 Apr 2013 01:46:00 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 13 Apr 2013 01:46:00 +0000 X-ASF-Spam-Status: No, hits=2.9 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [106.10.148.228] (HELO nm10-vm5.bullet.mail.sg3.yahoo.com) (106.10.148.228) by apache.org (qpsmtpd/0.29) with SMTP; Sat, 13 Apr 2013 01:45:53 +0000 Received: from [106.10.166.127] by nm10.bullet.mail.sg3.yahoo.com with NNFMP; 13 Apr 2013 01:45:29 -0000 Received: from [106.10.150.28] by tm16.bullet.mail.sg3.yahoo.com with NNFMP; 13 Apr 2013 01:45:28 -0000 Received: from [127.0.0.1] by omp1029.mail.sg3.yahoo.com with NNFMP; 13 Apr 2013 01:45:28 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 817239.88458.bm@omp1029.mail.sg3.yahoo.com Received: (qmail 61252 invoked by uid 60001); 13 Apr 2013 01:45:28 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.in; s=s1024; t=1365817528; bh=X09phGLs5TVFYw06klwVy/3cx8qGCLZtNnoEs1fvieM=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=3T3+T9EVDOhVYt6+aUGLNm+Nz8kvgVo+8zGMAGOL+mzOaQ9PX6Bzx29sll2MLy7VknpJLgl1hR7c6rMgQEE89FeL8Z1cwI7efpX8KRwBXy850TkuH5x9BcPxht7CAUelWHU8XLPr0oOT4usx/mpxncLM/6Wcr+/TuMC5au14PFE= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.in; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=DNzklTzt8LTRHgm2LR5B1mycN3NIqfyjHTUPMtvlFs99j59+WzIcNzlY7VRZU1hZ1vUNV5EtI8KT1rdzEAXe/9VTSC/a0X/I/aqS/IkvztbwLxKI1BT4uI1jvyqJZb8+i/IKixf/g3y3Ev+oShGkPz9otyCt+lSTt/0DURxetOc=; X-YMail-OSG: ZTfEQwUVM1nK_yebH6O2mP8qYnH7MYP.MdEkF.j9Tnkj9tC XEcZyxmBLSmq9tURSG.stVUDtt1ZwM4A_HRsWzvYZtpA29_tEaESSpM6abdP l7bkMkTYWgeBpQmyB5RmNyMVfTZ4riv.HVkaCyFzvi7v3b1MXmFqXLSu85Jt WRGaaXRfcETCFfqD3IqfV4vpDxgm.5FMuK_NUCh.vF3s0APJETD_qUsIr3wA oAt4hQCRIZ9viud6svkE7rqV7U9JImbGHOUdRl2hPsp9SecJB1yh88w4Ibr7 pNSoAiIQe4yX9yWoLCG8U0sNKkYlxnFN0iHM32PNAgrEClWbMYLbe4hf0RsT ckp2SXvehgNQ_XOZV8ohw9BjHyFIskeU5czPwCKm3uFWlTN6hP1_64yUvsHu MZRigG73clTnTvu.QM85dIUU0JacbYhT18VPY5OBuGtVkTQ-- Received: from [49.204.51.39] by web190703.mail.sg3.yahoo.com via HTTP; Sat, 13 Apr 2013 09:45:28 SGT X-Rocket-MIMEInfo: 002.001,CgpKdXN0IGEgZm9sbG93IHVwIHRvIHNlZSBpZiBhbnlvbmUgY2FuIHNoZWQgc29tZSBsaWdodCBvbiB0aGlzOgpNeSB1bmRlcnN0YW5kaW5nIGlzIHRoYXQgZWFjaCBibG9jayBhZnRlciBnZXR0aW5nIHJlcGxpY2F0ZWQgMyB0aW1lcywgYSBtYXAgdGFzayBpcyBydW4gb24gZWFjaCBvZiB0aGUgcmVwbGljYSBpbiBwYXJhbGxlbC4KVGhlIHRoaW5nIGkgYW0gdHJ5aW5nIHRvIGRvdWJsZSB2ZXJpZnkgaXMgaW4gYSBzY2VuYXJpbyB3aGVyZSBhIGZpbGUgaXMgc3BsaXQgaW50byAxMEsgb3IgMTAwSyBvciBtb3IBMAEBAQE- X-Mailer: YahooMailWebService/0.8.140.532 References: <1364377874.13753.YahooMailNeo@web194703.mail.sg3.yahoo.com> <1364577771.12724.YahooMailNeo@web194704.mail.sg3.yahoo.com> <1364719534.91394.YahooMailNeo@web194703.mail.sg3.yahoo.com> <1365042870.89547.YahooMailNeo@web194702.mail.sg3.yahoo.com> <1365740112.75877.YahooMailNeo@web190702.mail.sg3.yahoo.com> <578478094-1365742754-cardhu_decombobulator_blackberry.rim.net-2001349931-@b16.c6.bise7.blackberry> <1365754060.82642.YahooMailNeo@web190701.mail.sg3.yahoo.com> Message-ID: <1365817528.61054.YahooMailNeo@web190703.mail.sg3.yahoo.com> Date: Sat, 13 Apr 2013 09:45:28 +0800 (SGT) From: Sai Sai Reply-To: Sai Sai Subject: Re: 100K Maps scenario To: "user@hadoop.apache.org" In-Reply-To: <1365754060.82642.YahooMailNeo@web190701.mail.sg3.yahoo.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="-300775223-376415299-1365817528=:61054" X-Virus-Checked: Checked by ClamAV on apache.org ---300775223-376415299-1365817528=:61054 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable =0A=0AJust a follow up to see if anyone can shed some light on this:=0AMy u= nderstanding is that each block after getting replicated 3 times, a map tas= k is run on each of the replica in parallel.=0AThe thing i am trying to dou= ble verify is in a scenario where a file is split into 10K or 100K or more = blocks it will result in atleast 300K Map tasks being performed and this lo= oks like an overkill from a performance or just a logical perspective.=A0= =0AWill appreciate any thoughts on this.=0AThanks=0ASai=0A=0A______________= __________________=0A From: Sai Sai =0ATo: "user@hadoop.= apache.org" ; Sai Sai =0ASent: = Friday, 12 April 2013 1:37 PM=0ASubject: Re: Does a Map task run 3 times on= 3 TTs or just once=0A =0A=0A=0AJust wondering if it is right to assume tha= t a Map task is run 3 times on 3 different TTs in parallel and whoever comp= letes processing the task first that output is picked up and written to int= ermediate location.=0AOr is it true that a map task even though its data is= replicated 3 times will run only once and other 2 will be on the stand by = just incase this fails the second one will run followed by 3rd one if the 2= nd Mapper fails.=0APlesae pour some light.=0AThanks=0ASai ---300775223-376415299-1365817528=:61054 Content-Type: text/html; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable

Just a follow up to see if anyone can shed some light on this:<= /div>
My understanding is that each block after getting replicated 3 ti= mes, a map task is run on each of the replica in parallel.
The th= ing i am trying to double verify is in a scenario where a file is split int= o 10K or 100K or more blocks it will result in atleast 300K Map tasks being= performed and this looks like an overkill from a performance or just a log= ical perspective. 
Will appreciate any thoughts on this.
Thanks
Sai

From: Sai Sai <saigraph@yahoo.in= >
To: "user@hadoop.= apache.org" <user@hadoop.apache.org>; Sai Sai <saigraph@yahoo.in&g= t;
Sent: Friday, 12 A= pril 2013 1:37 PM
Subject:= Re: Does a Map task run 3 times on 3 TTs or just once

Just wondering i= f it is right to assume that a Map task is run 3 times on 3 different TTs i= n parallel and whoever completes processing the task first that output is p= icked up and written to intermediate location.
Or is it true that= a map task even though its data is replicated 3 times will run only once and other 2 = will be on the stand by just incase this fails the second one will run foll= owed by 3rd one if the 2nd Mapper fails.
Plesae pour some light.<= /div>
Thanks
Sai


---300775223-376415299-1365817528=:61054--