Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 85DA4933A for ; Wed, 19 Oct 2011 15:42:26 +0000 (UTC) Received: (qmail 97743 invoked by uid 500); 19 Oct 2011 15:42:24 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 97683 invoked by uid 500); 19 Oct 2011 15:42:24 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 97667 invoked by uid 99); 19 Oct 2011 15:42:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Oct 2011 15:42:24 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kranganathan@fb.com designates 67.231.153.30 as permitted sender) Received: from [67.231.153.30] (HELO mx0a-00082601.pphosted.com) (67.231.153.30) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Oct 2011 15:42:17 +0000 Received: from pps.filterd (m0004060 [127.0.0.1]) by mx0b-00082601.pphosted.com (8.14.4/8.14.4) with SMTP id p9JFdo76019669; Wed, 19 Oct 2011 08:41:56 -0700 Received: from mail.thefacebook.com (corpout1.snc1.tfbnw.net [66.220.144.38]) by mx0b-00082601.pphosted.com with ESMTP id 10j0h7r36w-1 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NOT); Wed, 19 Oct 2011 08:41:56 -0700 Received: from SC-MBX01-5.TheFacebook.com ([fe80::f05e:c7f1:c7dd:e640]) by sc-hub04.TheFacebook.com ([192.168.18.212]) with mapi id 14.01.0289.001; Wed, 19 Oct 2011 08:41:51 -0700 From: Karthik Ranganathan To: "user@hbase.apache.org" , Hbase Dev Subject: Re: Increase number of reducers for bulk data load to empty HBase table Thread-Topic: Increase number of reducers for bulk data load to empty HBase table Thread-Index: AQHMjXSJ5Lhcw+IgRkiF77vfOdgy5pWD0FAA Date: Wed, 19 Oct 2011 15:41:50 +0000 Message-ID: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: user-agent: Microsoft-MacOutlook/14.12.0.110505 x-originating-ip: [192.168.18.252] Content-Type: text/plain; charset="us-ascii" Content-ID: <7619C34D1D2C2F40BC63E938C4EBB4EF@fb.com> Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.4.6813,1.0.211,0.0.0000 definitions=2011-10-19_04:2011-10-19,2011-10-19,1970-01-01 signatures=0 X-Proofpoint-Spam-Reason: safe Hey Matthew, The only way to increase the number of reducers is to have more regions - each reducer produces an output per region, so the number of reducers =3D= =3D number of regions. Thanks Karthik On 10/18/11 2:00 AM, "Matthew Tovbin" wrote: >Hello, Guys, > >I'm willing to bulk load data from hdfs folders into HBase, for this >purpose >I used configureIncrementalLoad method from HFileOutputFormat that >configures the job, as follows: > >org.apache.hadoop.hbase.mapreduce.HFileOutputFormat.configureIncrementalLo >ad(job, >myTable) > >The problem is that destination table in HBase is empty, meaning it's only >hosted by one region server, so the resulted number of reducers is 1, >which >makes the job to run almost forever. > >How can I increase the number of reducers? Can the number of reducers be >set >to more than a number of region servers? > >Thanks in advance, > Matthew Tovbin.