Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 393D7C2F0 for ; Tue, 7 Aug 2012 17:03:01 +0000 (UTC) Received: (qmail 61954 invoked by uid 500); 7 Aug 2012 17:02:59 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 61767 invoked by uid 500); 7 Aug 2012 17:02:59 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 61758 invoked by uid 99); 7 Aug 2012 17:02:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Aug 2012 17:02:59 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of imperros@gmail.com designates 209.85.214.41 as permitted sender) Received: from [209.85.214.41] (HELO mail-bk0-f41.google.com) (209.85.214.41) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Aug 2012 17:02:49 +0000 Received: by bkcjc3 with SMTP id jc3so2113661bkc.14 for ; Tue, 07 Aug 2012 10:02:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type; bh=fet/gh3eTUfwnEI/98wRzsK5CPX3BgzPxpq9vcoq0VA=; b=hh/vTFwlfQZG9JCAaM1wTko9w8zPk0exJG7FrW91rrocy5ukTdHDK95J3XY50Fdmfa aXLP3aztxQIatsba/73AzQu4TVf2Qbqyv8zr0Dr1qlqqBOwufg72suFKWSpIMo159vPC cdMY7GU+C1xXzfFJyWXjY6eIijPTliJNVArzp2dYIDzfBS+VR3igj3FRuw/Lg+1s5Bd2 jPdt9Pw1SEinUZ1W4VSFyS6BG0m3b7yf0AiB/gwJ6ANE3NFjYxN9gci8wDbBbgEkg8op kAQLCiruM4R5F/G6+ghz7rTzRMCFvKXJJf5zMYf/p6ynRGV5UTeTwAiVRtwVd718BPJW uk/w== Received: by 10.204.154.73 with SMTP id n9mr6034588bkw.113.1344358949641; Tue, 07 Aug 2012 10:02:29 -0700 (PDT) Received: from [192.168.1.100] (188.4.86.114.dsl.dyn.forthnet.gr. [188.4.86.114]) by mx.google.com with ESMTPS id t23sm9162711bks.4.2012.08.07.10.02.28 (version=SSLv3 cipher=OTHER); Tue, 07 Aug 2012 10:02:29 -0700 (PDT) Message-ID: <50214A23.8040101@gmail.com> Date: Tue, 07 Aug 2012 20:02:27 +0300 From: Ioakim Perros User-Agent: Mozilla/5.0 (X11; Linux i686; rv:14.0) Gecko/20120714 Thunderbird/14.0 MIME-Version: 1.0 To: user@hbase.apache.org Subject: Re: Bulk load - #Reducers different from #Regions References: <5021447B.5060000@gmail.com> In-Reply-To: Content-Type: multipart/alternative; boundary="------------070405090005050608020807" --------------070405090005050608020807 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Excuse me for not well-defining. I am bulk updating my hbase table through code, using configureIncrementalLoad function of HFileOutputFormat. At the respective documentation, I read that this function " Sets the number of reduce tasks to match the current number of regions" ,but I was wondering if I could explicitly avoid it, perhaps by another way of bulk importing data. PS: I try to insist on bulk importing, because I have understood (I hope that this is correct), that it is much more efficient than going with the traditional Hbase API. And as I require my job to be of iterative nature, this way hopefully would end up giving a good boost-up, as opposed to the Hbase API. Thank you for responding. On 08/07/2012 07:53 PM, Subir S wrote: > Bulk load using > ImportTsv with pre-splitted regions for target table? > > Do u mean to set number of reducers that ImportTsv must use? > > On 8/7/12, Ioakim Perros wrote: >> HI, >> >> I am bulk importing (updating) data iteratively and I would like to be >> able to set the number of reducers at a M/R task, to be different from >> the number of regions of the table to which I am updating data. >> >> I tried it through job.setNumReduceTasks(#reducers), but the job ignored >> it. >> >> Is there a way to avoid an intermediary job and to set the number of >> reducers explicitly ? >> I would be grateful if anyone could shed a light to this. >> >> Thanks and regards, >> Ioakim >> >> --------------070405090005050608020807--