Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8EFD080DB for ; Thu, 15 Sep 2011 05:14:52 +0000 (UTC) Received: (qmail 33623 invoked by uid 500); 15 Sep 2011 05:14:51 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 33519 invoked by uid 500); 15 Sep 2011 05:14:51 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 33510 invoked by uid 99); 15 Sep 2011 05:14:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Sep 2011 05:14:50 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of saint.ack@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qy0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Sep 2011 05:14:43 +0000 Received: by qyk36 with SMTP id 36so2826419qyk.14 for ; Wed, 14 Sep 2011 22:14:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=8lEBlhRurMphsGui6qwj3R3wx77fjwrOIe62TTOfIN0=; b=jpcsTJD6JHlClczwV12F/PgfaPLd+GkxjVWfjRgr9wgfl9SIO64HJz8hF3v/+nUyPo HzB8b9Vpl4+vjq/Ms1l3+Pbf4mPaI0CCQmYI2vuFjSdx9You4MfT/srDNflVQ3GU+rCP 9RRLD8f8JYNVLt/d72paQ13CJb1mcJD77CY9k= MIME-Version: 1.0 Received: by 10.224.176.4 with SMTP id bc4mr599124qab.6.1316063662799; Wed, 14 Sep 2011 22:14:22 -0700 (PDT) Sender: saint.ack@gmail.com Received: by 10.224.74.1 with HTTP; Wed, 14 Sep 2011 22:14:22 -0700 (PDT) In-Reply-To: References: Date: Wed, 14 Sep 2011 22:14:22 -0700 X-Google-Sender-Auth: aPRW6DzfTIpx82N6tCT76qWDnWg Message-ID: Subject: Re: HBase ImportTSV From: Stack To: user@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Do you know your keyspace roughly? Try creating a pre-split table with as many regions as you want reducers. St.Ack On Wed, Sep 14, 2011 at 8:25 PM, rajesh balamohan wrote: > ImportTSV internally uses HFileOutputFormat.configureIncrementalLoad(job, > table); > > However, for newly created tables there would not be any keys available. > Hence, it launches 1 reducer by default. > > Is there a way to increase the number of reducers for high volume imports > like 500+ GB. > > ~Rajesh.B > > On Thu, Sep 15, 2011 at 8:51 AM, rajesh balamohan wrote: > >> Hi All, >> >> ImportTSV is a great tool for bulk loading the data into HBASE. >> >> I have close to 500+GB of raw data which I would like to import into a >> newly created HTABLE. If I go ahead with ImportTSV, it creates only one >> reducer which is a bottleneck in terms of sorting and shuffling. >> >> Are there any other way, I can increase the number of reducers while doing >> bulk loads for new table?. >> >> ~Rajesh.B >> >