Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 71B99C8D2 for ; Thu, 5 Jun 2014 15:11:53 +0000 (UTC) Received: (qmail 58388 invoked by uid 500); 5 Jun 2014 15:11:51 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 58322 invoked by uid 500); 5 Jun 2014 15:11:51 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 58311 invoked by uid 99); 5 Jun 2014 15:11:51 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Jun 2014 15:11:51 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of saint.ack@gmail.com designates 209.85.217.176 as permitted sender) Received: from [209.85.217.176] (HELO mail-lb0-f176.google.com) (209.85.217.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Jun 2014 15:11:46 +0000 Received: by mail-lb0-f176.google.com with SMTP id p9so660385lbv.21 for ; Thu, 05 Jun 2014 08:11:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=gFGPzrtHjHFib1B1HJ5EW2qpoiusPG4UrRlPURSWa64=; b=el32yuJo+TPKasilQ+bCL2CIK51zN/VM3Z4VWPjp4bdJhZe/Wsg0SiTsSCZ1595MWO 1jXT+012A80/JdlUk/GmQ16V02Iy1E53F2gNWoMreVJyGHxjoUBLx1bBWnugghg23wHP QeGGPlsuUHaEKLsssKFyIIHyRzxEJ5viExn6gug1N94HEPgqmJuAlLF/OCP5PM+wlzcY StM58Jb54Qy7V/1LSuqFpmYQ3UYOtxhjFzI+uD332RwiAleD475dbwqtI4LQq9Vmvi0S ZuU4j+zXucDSe7Z/f0YJEyNfFSP5qqUG2XF2GMP0oKQjKfwudf/KfpizkgN4CBEPe+nr FxBg== MIME-Version: 1.0 X-Received: by 10.112.148.165 with SMTP id tt5mr19746174lbb.61.1401981085531; Thu, 05 Jun 2014 08:11:25 -0700 (PDT) Sender: saint.ack@gmail.com Received: by 10.152.2.38 with HTTP; Thu, 5 Jun 2014 08:11:25 -0700 (PDT) In-Reply-To: References: <538F139E.6050305@viadeoteam.com> Date: Thu, 5 Jun 2014 08:11:25 -0700 X-Google-Sender-Auth: A_7J9X1n3Hl-1kK2q9_hYMx3XBg Message-ID: Subject: Re: HBase export limit bandwith From: Stack To: Hbase-User Content-Type: multipart/alternative; boundary=047d7b3a8c4e95c72604fb18287d X-Virus-Checked: Checked by ClamAV on apache.org --047d7b3a8c4e95c72604fb18287d Content-Type: text/plain; charset=UTF-8 > > > On Jun 4, 2014, at 7:39 AM, Damien Hardy wrote: > > > Hello, > > > > We are trying to export HBase table on S3 for backup purpose. > > By default export tool run a map per region and we want to limit output > > bandwidth on internet (to amazon s3). > > > > We were thinking in adding some reducer to limit the number of writers > > but this is explicitly hardcoded to 0 in Export class > > ``` > > // No reducers. Just write straight to output files. > > job.setNumReduceTasks(0); > > ``` > > Echoing Michael Segel, why not subclass and set reducers to whatever you want in your subclass? But you probably don't want to have reducers anyways. The output from your mappers will have to be sorted and fed to the reducers which will put up a load on your cluster, a loading that could be better deployed moving the data to S3. Or limit the number of mappers you have running at any one time via configuration or in a subclass limit the rate at which they write? St.Ack --047d7b3a8c4e95c72604fb18287d--