Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5DED4DE31 for ; Sat, 16 Mar 2013 04:47:38 +0000 (UTC) Received: (qmail 76166 invoked by uid 500); 16 Mar 2013 04:47:33 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 75700 invoked by uid 500); 16 Mar 2013 04:47:30 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 75675 invoked by uid 99); 16 Mar 2013 04:47:30 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Mar 2013 04:47:29 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yypvsxf19870706@gmail.com designates 209.85.160.53 as permitted sender) Received: from [209.85.160.53] (HELO mail-pb0-f53.google.com) (209.85.160.53) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Mar 2013 04:47:24 +0000 Received: by mail-pb0-f53.google.com with SMTP id un1so4655438pbc.12 for ; Fri, 15 Mar 2013 21:47:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:references:mime-version:in-reply-to:content-type :content-transfer-encoding:message-id:cc:x-mailer:from:subject:date :to; bh=uM4E2Sg4aiG8GtgdLaeOfJLujNCCPsUSHvKaR6yDbd4=; b=Ngh/fw+hLTWFNWX0WRO3GlfwXbiryn7KyPvAusvQYfFB04S0s5iWpNoREpdTmjh01a HjsXiLuRoZ/nRAxFe265HqZY5VWwoCVuFtDNBP2OCBunu9egs68sLjCQYvFvmVfNLdY0 CCYgZkHXqIm22d9RB9mqu6RjHK3iCZ7cPlUncjSZJmCoFTnJ/m1zbMZqCfTy3k4ZdmT/ LpmpIl1UJkijzCMmKe7dm02kqWldP/n4RHDhVx+RQ6YDzbkzTl5HmJBK+1+SA0t4YwRC PcfcRE/kMcSyJ++6Jnbp52DBPo5WcTuiSG1xZ5WNfYnq7vgev36BMXFeqIEijUscokRA cEOg== X-Received: by 10.68.196.35 with SMTP id ij3mr21850970pbc.56.1363409224548; Fri, 15 Mar 2013 21:47:04 -0700 (PDT) Received: from [10.2.80.2] ([122.96.45.46]) by mx.google.com with ESMTPS id gf6sm12062676pbc.24.2013.03.15.21.47.00 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 15 Mar 2013 21:47:03 -0700 (PDT) References:

<1361612254.36060.YahooMailNeo@web194705.mail.sg3.yahoo.com> <1361623966.2874.YahooMailNeo@web194705.mail.sg3.yahoo.com> <1361706030.51309.YahooMailNeo@web194706.mail.sg3.yahoo.com> <1361706818.42489.YahooMailNeo@web194702.mail.sg3.yahoo.com> <1362390140.71433.YahooMailNeo@web194701.mail.sg3.yahoo.com> <1363250753.42275.YahooMailNeo@web194704.mail.sg3.yahoo.com> <1363253090.10644.YahooMailNeo@web194702.mail.sg3.yahoo.com>

Mime-Version: 1.0 (1.0) In-Reply-To: Content-Type: multipart/alternative; boundary=Apple-Mail-0118A8DA-2784-4F3E-ADBB-A3E5AA787A03 Content-Transfer-Encoding: 7bit Message-Id: Cc: "user@hadoop.apache.org" X-Mailer: iPhone Mail (10B146) From: yypvsxf19870706 Subject: Re: Increase the number of mappers in PM mode Date: Sat, 16 Mar 2013 12:46:12 +0800 To: "user@hadoop.apache.org" X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-0118A8DA-2784-4F3E-ADBB-A3E5AA787A03 Content-Type: text/plain; charset=GB2312 Content-Transfer-Encoding: quoted-printable hi=A3=BA i think i have got it . Thank you. =B7=A2=D7=D4=CE=D2=B5=C4 iPhone =D4=DA 2013-3-15=A3=AC18:32=A3=ACZheyi RONG =D0=B4=B5=C0= =A3=BA > Indeed you cannot explicitly set the number of mappers, but still you can g= ain some control over it, by setting mapred.max.split.size, or mapred.min.sp= lit.size. >=20 > For example, if you have a file of 10GB (10737418240 B), you would like 10= mappers, then each mapper has to deal with 1GB data. > According to "splitsize =3D max(minimumSize, min(maximumSize, blockSize))"= , you can set mapred.min.split.size=3D1073741824 (1GB), i.e. =20 > $hadoop jar -Dmapred.min.split.size=3D1073741824 yourjar yourargs >=20 > It is well explained in thread: http://stackoverflow.com/questions/9678180= /change-file-split-size-in-hadoop. >=20 > Regards, > Zheyi. >=20 > On Fri, Mar 15, 2013 at 8:49 AM, YouPeng Yang w= rote: >> s >=20 >=20 --Apple-Mail-0118A8DA-2784-4F3E-ADBB-A3E5AA787A03 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable

hi=EF=BC=9A

i t= hink i have got it . Thank you.

=E5=8F=91=E8=87=AA=E6=88=91=E7=9A=84 i= Phone

=E5=9C=A8 2013-3-15=EF=BC=8C18:32=EF=BC=8CZheyi RONG <= ;rongzheyi@gmail.com> =E5=86=99= =E9=81=93=EF=BC=9A

Indeed you ca= nnot explicitly set the number of mappers, but still you can gain some contr= ol over it, by setting mapred.max.split.size, or mapred.min.split.size.
=
For example, if you have a file of 10GB (10737418240 B), you w= ould like 10 mappers, then each mapper has to deal with 1GB data.

According to "splitsize =3D max(minimumSize, min(maximumSize, bloc= kSize))", you can set mapred.min.split.size=3D1073741824 (1GB), i.e. &= nbsp;
$hadoop jar -Dmapred.min.split.size=3D1073741824 yourjar you= rargs

It is well explained in thread: http://= stackoverflow.com/questions/9678180/change-file-split-size-in-hadoop.

Regards,
Zheyi.

=
On Fri, Mar 15, 2013 at 8:49 AM, YouPeng Yang <yypvsxf19870706@gmail.com> wrote:

s

= --Apple-Mail-0118A8DA-2784-4F3E-ADBB-A3E5AA787A03--