Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A36FC18A83 for ; Fri, 21 Aug 2015 17:37:43 +0000 (UTC) Received: (qmail 93863 invoked by uid 500); 21 Aug 2015 17:37:39 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 93728 invoked by uid 500); 21 Aug 2015 17:37:39 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 93718 invoked by uid 99); 21 Aug 2015 17:37:38 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Aug 2015 17:37:38 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 7A027C0332 for ; Fri, 21 Aug 2015 17:37:38 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id RUgtO9AzimlJ for ; Fri, 21 Aug 2015 17:37:37 +0000 (UTC) Received: from mail-yk0-f179.google.com (mail-yk0-f179.google.com [209.85.160.179]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id A183420381 for ; Fri, 21 Aug 2015 17:37:36 +0000 (UTC) Received: by ykfw73 with SMTP id w73so77493800ykf.3 for ; Fri, 21 Aug 2015 10:37:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=AvUSJ3B87ZhGhMyqZhFrSdeX7Xmdp7Y5Jyb0r+hz/XI=; b=q2x5kFemCeiFAGfno5YRAWoUVn6k8qgsG2HJmtKa+0DfzUBhv8+6AqrDjdmObl9oYQ z1lKqBeVivF/IylHbd6rP+wQbWcBjkS1p1quMgZhVSbbo4TrflQ4jq2xsAy4o85NLRuk s7chX/ghBojTvGikfa6NEKOAIffZ3ALZCvTOycN0GqGJ+OWIOA5GqLtSr4rNTZKIZb2V dcoR2IBhEck5UHn9lY/QAY/nNMX4q4kksqHwbhiEG6Y0UCtYsxtmuZTdPUgPApZ7C9V+ rGxMdwFVgxA6mUy/jCQdTdIjAyCIC9bXKVJYfnoTjjbyDo4eZC1fD2WCdfrpycSkF0AY 3vWg== MIME-Version: 1.0 X-Received: by 10.170.124.70 with SMTP id q67mr13146325ykb.65.1440178655580; Fri, 21 Aug 2015 10:37:35 -0700 (PDT) Received: by 10.37.22.70 with HTTP; Fri, 21 Aug 2015 10:37:35 -0700 (PDT) In-Reply-To: References: Date: Fri, 21 Aug 2015 13:37:35 -0400 Message-ID: Subject: Re: Chaining MapReduce From: Shahab Yunus To: "user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a113925362dda44051dd5b970 --001a113925362dda44051dd5b970 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable What is the different between the mappers? Is the input data suppose to go to all mappers or it is dependent on the source data? Regards, Shahab On Fri, Aug 21, 2015 at 1:35 PM, =E2=98=BC R Nair (=E0=A4=B0=E0=A4=B5=E0=A4= =BF=E0=A4=B6=E0=A4=82=E0=A4=95=E0=A4=B0 =E0=A4=A8=E0=A4=BE=E0=A4=AF=E0=A4= =B0) < ravishankar.nair@gmail.com> wrote: > All, > > I have three mappers, followed by a reducer. I executed the map reduce > successfully. The reported output shows that number of mappers executed i= s > 1 and number of reducers is also 1. Though number of reducers are correct= , > won't we be getting number of mappers as 3 , since I have three mapper > classes connected by ChainMapper? > > O/P given below (snippet) :- > > Job Counters > Launched map tasks=3D1 > Launched reduce tasks=3D1 > Data-local map tasks=3D1 > Total time spent by all maps in occupied slots (ms)=3D8853 > Total time spent by all reduces in occupied slots (ms)=3D9900 > Total time spent by all map tasks (ms)=3D8853 > Total time spent by all reduce tasks (ms)=3D9900 > Total vcore-seconds taken by all map tasks=3D8853 > Total vcore-seconds taken by all reduce tasks=3D9900 > Total megabyte-seconds taken by all map tasks=3D9065472 > Total megabyte-seconds taken by all reduce tasks=3D10137600 > > > What I guess is, since the output is passing through Context, the interna= l > connected mappers are not caught by job counter, am I correct ? > > Best, Ravion > --001a113925362dda44051dd5b970 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
What is the different between the mappers? Is the input da= ta suppose to go to all mappers or it is dependent on the source data?
=
Regards,
Shahab

On Fri, Aug 21, 2015 at 1:35 PM, =E2=98= =BC R Nair (=E0=A4=B0=E0=A4=B5=E0=A4=BF=E0=A4=B6=E0=A4=82=E0=A4=95=E0=A4=B0= =E0=A4=A8=E0=A4=BE=E0=A4=AF=E0=A4=B0) <ravishankar.nair@gmail.co= m> wrote:
=
All,

I have three mappers, followed by a reducer. I = executed the map reduce successfully. The reported output shows that number= of mappers executed is 1 and number of reducers is also 1. Though number o= f reducers are correct, won't we be getting number of mappers as 3 , si= nce I have three mapper classes connected by ChainMapper?

O/P given below (snippet) :-

Job Counters
=C2=A0=C2=A0=C2=A0 = =C2=A0=C2=A0=C2=A0 Launched map tasks=3D1
=C2=A0=C2=A0=C2=A0 =C2=A0=C2= =A0=C2=A0 Launched reduce tasks=3D1
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2= =A0 Data-local map tasks=3D1
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 Total= time spent by all maps in occupied slots (ms)=3D8853
=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0 Total time spent by all reduces in occupied slots (ms)= =3D9900
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 Total time spent by all ma= p tasks (ms)=3D8853
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 Total time spe= nt by all reduce tasks (ms)=3D9900
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0= Total vcore-seconds taken by all map tasks=3D8853
=C2=A0=C2=A0=C2=A0 = =C2=A0=C2=A0=C2=A0 Total vcore-seconds taken by all reduce tasks=3D9900
= =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 Total megabyte-seconds taken by all m= ap tasks=3D9065472
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 Total megabyte-= seconds taken by all reduce tasks=3D10137600


What I g= uess is, since the output is passing through Context, the internal connecte= d mappers are not caught by job counter, am I correct ?

<= /div>Best, Ravion

--001a113925362dda44051dd5b970--