Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 739111069F for ; Sun, 23 Mar 2014 19:56:51 +0000 (UTC) Received: (qmail 43694 invoked by uid 500); 23 Mar 2014 19:56:48 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 42892 invoked by uid 500); 23 Mar 2014 19:56:45 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 42781 invoked by uid 99); 23 Mar 2014 19:56:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Mar 2014 19:56:42 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of vrodionov@carrieriq.com designates 204.235.122.16 as permitted sender) Received: from [204.235.122.16] (HELO obmail.carrieriq.com) (204.235.122.16) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Mar 2014 19:56:38 +0000 From: Vladimir Rodionov To: "user@hbase.apache.org" Date: Sun, 23 Mar 2014 12:51:52 -0700 Subject: RE: Effect of region size on compaction performance Thread-Topic: Effect of region size on compaction performance Thread-Index: Ac9GO4cqitD2iGeUT3SxwGpb6CJCRAAlc7Af Message-ID: References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US x-kse-antivirus-interceptor-info: scan successful x-kse-antivirus-info: Clean Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org How small is small and how large is large? Recommended region size is usually between 5-10GB. Too small regions result= s in more frequent flushes/compactions and have additional overhead in RS RAM. >>I am thinking about extending TableInputFormat to override the >>1-map-per-region default policy as an alternative. This looks better approach. Best regards, Vladimir Rodionov Principal Platform Engineer Carrier IQ, www.carrieriq.com e-mail: vrodionov@carrieriq.com ________________________________________ From: David Koch [ogdude@googlemail.com] Sent: Saturday, March 22, 2014 6:58 PM To: user@hbase.apache.org Subject: Effect of region size on compaction performance Hello, We run M/Rs over several HBase tables at the same time and chose to reduce region sizes in order to make map tasks faster and improve map-slot turnaround between the concurrent jobs. However, I am worried many regions will cause longer overall compactions of the HBase data. Is this the case? I am thinking about extending TableInputFormat to override the 1-map-per-region default policy as an alternative. Regards, /David Confidentiality Notice: The information contained in this message, includi= ng any attachments hereto, may be confidential and is intended to be read o= nly by the individual or entity to whom this message is addressed. If the r= eader of this message is not the intended recipient or an agent or designee= of the intended recipient, please note that any review, use, disclosure or= distribution of this message or its attachments, in any form, is strictly = prohibited. If you have received this message in error, please immediately= notify the sender and/or Notifications@carrieriq.com and delete or destroy= any copy of this message and its attachments.