Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6C2709D04 for ; Fri, 25 May 2012 06:35:38 +0000 (UTC) Received: (qmail 24072 invoked by uid 500); 25 May 2012 06:35:36 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 24013 invoked by uid 500); 25 May 2012 06:35:36 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 23989 invoked by uid 99); 25 May 2012 06:35:35 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 May 2012 06:35:35 +0000 X-ASF-Spam-Status: No, hits=4.7 required=5.0 tests=FREEMAIL_FORGED_REPLYTO,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [212.82.109.198] (HELO nm24-vm7.bullet.mail.ird.yahoo.com) (212.82.109.198) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 25 May 2012 06:35:25 +0000 Received: from [77.238.189.56] by nm24.bullet.mail.ird.yahoo.com with NNFMP; 25 May 2012 06:35:05 -0000 Received: from [212.82.108.133] by tm9.bullet.mail.ird.yahoo.com with NNFMP; 25 May 2012 06:35:05 -0000 Received: from [127.0.0.1] by omp1038.mail.ird.yahoo.com with NNFMP; 25 May 2012 06:35:05 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 206679.92624.bm@omp1038.mail.ird.yahoo.com Received: (qmail 5961 invoked by uid 60001); 25 May 2012 06:35:05 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1337927705; bh=GfPPuEUCxeQOco41Mh6fTlacxvYRhxxUPsGyoo/tzLQ=; h=X-YMail-OSG:Received:X-Mailer:Message-ID:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type; b=abFN+18gBYChPhBhRuakUToDldDxMjHmAYG7KPqW9Wd+5kU+gTXyXBgwjosZdbzB3BV3U+GybS15fFeEewfDte+jXViVfFTVHM8VmuZrC0MPq82TtMDxgAFTJU4N8/M4i8hmgIdCt0erdWBRXj4JLifvA6WoTE8yKWYmvh2pMy4= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Mailer:Message-ID:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type; b=cJNaN1I3Mb3iQAGxmiH4wq92BhuiIftXkvF4Ox735ebLTI972a1/ZrKuqllKNtG3QEJMCOqV4T4CYtJgkbaeiM4dzFoyV0UAyXTcd6n9qKr56G6XWzn3Iyo9XJHyWvV/yHZuY281TQOekObzxSk3fNL4LgvGLXJUNN4mpQyguek=; X-YMail-OSG: O2LWF3EVM1l6shxCvdyMtcnidC3Fxshn5bIuRNglxBybTDB QELubffJzKsIGR7ZU3oTnxNaY.xXuy3KP1VEbOPV0FQI4U1eIOljyRZg7MKb 3LH1gwmhDr4JjSLSqpyLyWUTGqMwcJ49Lon4iTN6jSmloYFPs8IOTAe6Eeyg J4dxATkCP4QpFXsvyd_.BP12.Ojud5lr_1ZiDWnk52LtokEZgMx35DJ6RO5K MBVzt.kZByp28ZUbDopT3kKrGqlvzVxSXy0pGgaIDSwr6fRXRrCYyGQoPhjB YHgfel.COy1TfoRN26Ncv1MhDqvyKtSzcnoi_GbkWQp7O5tJ_gPK4FKG6nAK jN0y.pE0vfcsjAsqEnk17WzlzPrqZtvY7bFkJr51fr16epAMY6_ZcnAkfn05 rkHPNNbrBxsx2.MXTTZgnL0G6BlEeBKFk5d9B0jXLMtuABjtdYgc39iYDwt. Ji4PdxXrqkIxs3I66Viv1hu20Wg5oqquBOOvQ2eejLQmDdYxoSmNBWGZU8Bb H.9DPFGZAwqHi3tE_VOV5AylHN2HL_UZ.zlc5OW_AkP__nRhwV6dTeSsVYxF BeWo_NVXSU71RsU0oWnLeGmTsLteazJAoPRfibQzfqvQ6DF3NftGo94sWG0A - Received: from [194.138.12.171] by web132104.mail.ird.yahoo.com via HTTP; Fri, 25 May 2012 07:35:05 BST X-Mailer: YahooMailWebService/0.8.118.349524 Message-ID: <1337927705.87668.YahooMailNeo@web132104.mail.ird.yahoo.com> Date: Fri, 25 May 2012 07:35:05 +0100 (BST) From: Florin P Reply-To: Florin P Subject: A question about HBase MapReduce To: "user@hbase.apache.org" MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="-1910628564-2145564354-1337927705=:87668" ---1910628564-2145564354-1337927705=:87668 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Hello!=0A=0AI've read Lars George's blog http://www.larsgeorge.com/2009/05/= hbase-mapreduce-101-part-i.html where at the end of the article, he mention= ed "In the next post I will show you how to import data from a raw data=0Af= ile into a HBase table and how you eventually process the data in the=0AHBa= se table. We will address questions like how many mappers and/or=0Areducers= are needed and how can I improve import and processing=0Aperformance.". I = looked in the blog up for these questions, but it seems that there is no ar= ticle related. Do you knoe if he you touched these subjects into a differen= t post or book? Particular I am interested=A0 =0A=0A1. how you can set up t= he number of mappers?=0A2. number of mappers can be set up per region serve= r? If yes how?=0A3. How the big number of set up mappers can affect the dat= a locality?=0A4. is this algorithm for computing the number of mappers (htt= ps://issues.apache.org/jira/browse/HBASE-1172) still available=0A"Currently= ,=0Athe number of mappers specified when using TableInputFormat is strictly= =0Afollowed if less than total regions on the input table. If greater, the= =0Anumber of regions is used.=0AThis will modify the splitting algorithm to= do the following:=0A=09* Specify 0 mappers when you want # mappers =3D # r= egions=0A=09* If you specify fewer mappers than regions, will use exactly t= he number you specify based on the current algorithm=0A=09* If=0Ayou specif= y more mappers than regions, will divide regions up by=0Adetermining [start= ,X) [X,end). The number of mappers will always be a=0Amultiple of number of= regions. This is so we do not have scanners=0Aspanning multiple regions.= =0AThere is an additional issue in that the default number of mappers=0Ain = JobConf is set to 1. That means if a user does not explicitly set=0Anumber = of map tasks, a single mapper will be used. "=0A=0AI'll look forward for yo= u answers. Thank you.=0A=0AKind regards,=A0Florin ---1910628564-2145564354-1337927705=:87668--