Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2209DC27F for ; Wed, 12 Mar 2014 16:36:24 +0000 (UTC) Received: (qmail 62900 invoked by uid 500); 12 Mar 2014 16:36:20 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 62853 invoked by uid 500); 12 Mar 2014 16:36:18 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Delivered-To: moderator for user@hbase.apache.org Received: (qmail 95111 invoked by uid 99); 12 Mar 2014 13:41:19 -0000 X-ASF-Spam-Status: No, hits=2.4 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of linlindu2009@hotmail.com designates 65.54.190.89 as permitted sender) X-TMN: [q7ZUKfy+p5pcY6EnkmyNGqJtEeP1Rnkr] X-Originating-Email: [linlindu2009@hotmail.com] Message-ID: Content-Type: multipart/alternative; boundary="_13347522-55e5-4339-a687-8f88c5202258_" From: Linlin Du To: "user@hbase.apache.org" Subject: about region merging Date: Wed, 12 Mar 2014 10:40:50 -0300 Importance: Normal MIME-Version: 1.0 X-OriginalArrivalTime: 12 Mar 2014 13:40:51.0463 (UTC) FILETIME=[AF061970:01CF3DF8] X-Virus-Checked: Checked by ClamAV on apache.org --_13347522-55e5-4339-a687-8f88c5202258_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi all=2C Our usage of hbase is not very right and in many cases=2C time stamp is use= d as the prefix of the keys. And it is too late to get that fixed now. As more data comes in=2C more new regions will be spawned from the existing= regions. And after the retention period expires=2C many regions are almost empty. It is desira= ble to have these empty region deleted. Here are my questions: 1 If a timestamp is used in the key (maybe not as prefix=2C e.g.=2C in salt= ing) and there is a retention period=2C after a region's data expires=2C the region becomes empty and is never used= again. Is that right? 2 How to find out when a region was last modified/read? 3 Is there any easy way to merge small regions into a big one? Thanks. Linlin = --_13347522-55e5-4339-a687-8f88c5202258_--