Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8F24E10FC4 for ; Mon, 6 May 2013 15:28:18 +0000 (UTC) Received: (qmail 68650 invoked by uid 500); 6 May 2013 15:28:13 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 68529 invoked by uid 500); 6 May 2013 15:28:13 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 68522 invoked by uid 99); 6 May 2013 15:28:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 May 2013 15:28:13 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yypvsxf19870706@gmail.com designates 209.85.210.51 as permitted sender) Received: from [209.85.210.51] (HELO mail-da0-f51.google.com) (209.85.210.51) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 May 2013 15:28:05 +0000 Received: by mail-da0-f51.google.com with SMTP id h15so1815149dan.24 for ; Mon, 06 May 2013 08:27:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:subject:references:from:content-type:x-mailer :in-reply-to:message-id:date:to:content-transfer-encoding :mime-version; bh=yw9DFa7esSW8meyo2IGF5taWrA5Y8uLdepPOQY0KwYc=; b=bZsfJjTfP4LTVak2i5Cmqyu2hRvlwQY9ixUWaCXqruiDSDSw5c19CVHJl+cR+gUJZ7 O00UUMKmK/LBpbXocf2v09KvVkzLCYD0RAke8TSkmjeWXWH33m3Ovje+fJzRKJRhG26/ UjCcYyRzxbcZKXLfBb6OrI5skEFjJ3SP6whn4SaHXXkTg7RP66Jgf50Gly+DDJbGP+e9 /0SPRvnr/qEmvo140KoPRNjcseniyvs4XHq8Or21TdkZwi1gLwwXSrXPTpGwOd+W7fdC NY3zVJakuKh8dmDI0FkTk51Cg2Kb9HU6DlW5+Te5asjS3oCmD+bMx8Txch6xI4awyvAZ +xEw== X-Received: by 10.68.112.1 with SMTP id im1mr21018210pbb.116.1367854064492; Mon, 06 May 2013 08:27:44 -0700 (PDT) Received: from [192.168.1.100] ([221.225.88.98]) by mx.google.com with ESMTPSA id cc15sm1610131pac.1.2013.05.06.08.27.42 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 06 May 2013 08:27:43 -0700 (PDT) Subject: Re: Uber Job! References: From: yypvsxf19870706 Content-Type: multipart/alternative; boundary=Apple-Mail-1FB6E353-A049-46FE-B60D-6F84CE7A3FFD X-Mailer: iPhone Mail (10B146) In-Reply-To: Message-Id: <0EFB395D-6BB2-4362-91EF-E1B9CCD23E3D@gmail.com> Date: Mon, 6 May 2013 23:25:52 +0800 To: "user@hadoop.apache.org" Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (1.0) X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-1FB6E353-A049-46FE-B60D-6F84CE7A3FFD Content-Type: text/plain; charset=GB2312 Content-Transfer-Encoding: quoted-printable Hi Suppose that your input file are 10 with total size 64mb , I think you w= ill get the 10 maps. By the ways,the uber mode is only for yarn . Suppose you have actually 1= map ,yarn will at least create two containers , one for app master and the o= ther for the map , if uber mode is enabled with the yarn , yarn will only cr= eate 1 container for both app master and the map.=20 =20 =B7=A2=D7=D4=CE=D2=B5=C4 iPhone =D4=DA 2013-5-6=A3=AC22:45=A3=ACRahul Bhattacharjee =D0=B4=B5=C0=A3=BA > Hi, >=20 > I was going through the definition of Uber Job of Hadoop. >=20 > A job is considered uber when it has 10 or less maps , one reducer and the= complete data is less than one dfs block size. >=20 > I have some doubts here- >=20 > Splits are created as per the dfs block size.Creating 10 mappers are possi= ble from one block of data by some settings change (changing the max split s= ize). But trying to understand , why would some job need to run around 10 ma= ps for 64 MB of data. > One thing may be that the job is immensely CUP intensive. Will it be a cor= rect assumption? or is there is any other reason for this. >=20 > Thanks, > Rahul >=20 >=20 --Apple-Mail-1FB6E353-A049-46FE-B60D-6F84CE7A3FFD Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable
Hi

  &nb= sp; Suppose that your input file are 10 with total size 64mb , I think you w= ill get the 10 maps.

    By the ways,the u= ber mode is only for yarn . Suppose you have actually 1 map ,yarn will at le= ast create two containers , one for app master and the other for the map , i= f uber mode is enabled with the yarn , yarn will only create 1 container for= both app master and the map. 
    

=E5= =8F=91=E8=87=AA=E6=88=91=E7=9A=84 iPhone

=E5=9C=A8 2013-5-6=EF= =BC=8C22:45=EF=BC=8CRahul Bhattacharjee <rahul.rec.dgp@gmail.com> =E5=86=99=E9=81=93=EF=BC=9A
Hi,
I was going through the definition of Uber Job of H= adoop.

A job is considered uber when it has 10 or less maps= , one reducer and the complete data is less than one dfs block size.

I have some doubts here-

= Splits are created as per the dfs block size.Creating 10 mappers are possibl= e from one block of data by some settings change (changing the max split siz= e). But trying to understand , why would some job need to run around 10 maps= for 64 MB of data.
One thing may be that the job is immensely CUP intensive= . Will it be a correct assumption? or is there is any other reason for this.=

Thanks,
Rahul


= --Apple-Mail-1FB6E353-A049-46FE-B60D-6F84CE7A3FFD--