Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2A1D2468A for ; Mon, 11 Jul 2011 18:07:44 +0000 (UTC) Received: (qmail 95466 invoked by uid 500); 11 Jul 2011 18:07:43 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 95389 invoked by uid 500); 11 Jul 2011 18:07:43 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 95381 invoked by uid 99); 11 Jul 2011 18:07:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Jul 2011 18:07:42 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jheitmann@gmail.com designates 74.125.83.169 as permitted sender) Received: from [74.125.83.169] (HELO mail-pv0-f169.google.com) (74.125.83.169) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Jul 2011 18:07:36 +0000 Received: by pvc12 with SMTP id 12so3739586pvc.14 for ; Mon, 11 Jul 2011 11:07:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=jVNvcEdvLLYKJ1HXzYiiy7k8E7rAqXp4QInv/MA6AyM=; b=J7Wfb4xZHKfytXWeaFVKqfjL+8PiBeBdt0MkdcA3cz24IYCv0PKpbE8zqslu401FIV td8fKJ+zkwOIh9h72zBuIqvtGIHUjnctFjXxNgTnfRqmnH1L7EkCY642vATXfn+wXrpk cLpdH7AYd48Ew6PEtZcQ7nmbF5qXqSOYnXmL8= Received: by 10.68.14.131 with SMTP id p3mr2386141pbc.131.1310407635766; Mon, 11 Jul 2011 11:07:15 -0700 (PDT) Received: from c82a140bd58e.amazon.com (207-171-191-60.amazon.com [207.171.191.60]) by mx.google.com with ESMTPS id n8sm8400663pbh.73.2011.07.11.11.07.13 (version=SSLv3 cipher=OTHER); Mon, 11 Jul 2011 11:07:14 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Apple Message framework v1244.3) Subject: Re: Region "auto-split" From: John Heitmann In-Reply-To: Date: Mon, 11 Jul 2011 11:07:12 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: <024452DA-2F3A-4E74-9E64-FDA9D5FAC91D@gmail.com> References: <30D4476CB62BAE4589FD646476C29CD20167E7EA@SZXEML505-MBX.china.huawei.com> To: dev@hbase.apache.org X-Mailer: Apple Mail (2.1244.3) Apologies for the "me too" message, but: me too. I've been seeing this = for a month or two on my trunk-based setup. I looked for log messages, = but things looked pretty silent, and when I looked at the code my = impression was identical to Michael's in that I didn't see a plausible = path for it to work outside of extreme circumstances.=20 Now that I look at it again, it seems like auto-splitting is initiated = only if 1) There are too many store files per region and 2) The max = blocking time hasn't been exceeded. It looks like the requestSplit call = should be hoisted a bit in MemStoreFlusher.flushRegion(). John On Jul 11, 2011, at 9:17 AM, Stack wrote: > If you have an individual storefile > 256M and its not splitting > something is wrong. Check the logs on the regionserver carrying your > single region. See if any clue therein. >=20 > St.Ack >=20 >=20 > On Mon, Jul 11, 2011 at 5:54 AM, Michael Morello > wrote: >> Hi, >>=20 >> I use the default values, so hbase.hregion.max.filesize should be 256 >> Mo as defined in HConstants. >>=20 >> Regards, >>=20 >> 2011/7/11 Zhoushuaifeng : >>> What's your settings of this : hbase.hregion.max.filesize ? >>> If your setting is more than 1GB, and total size in the region less = than this , your region will not split. >>>=20 >>=20 >>> -----Original Message----- >>> From: Michael Morello [mailto:michael.morello@gmail.com] >>> Sent: Monday, July 11, 2011 8:28 PM >>> To: dev@hbase.apache.org >>> Subject: Region "auto-split" >>>=20 >>> Hello, >>>=20 >>> I'm testing the trunk revision of HBase and there is something that >>> seems strange to me regarding the "auto-split" region feature, here = is >>> my test case : >>> I generate a lot of data with a simple client into a newly created >>> table with no predefined region. Everything is stored in the first >>> "default" region but this region is never splitted (I currently have = a >>> region with a single file store of more than 1Go). >>>=20 >>> It seems that it is the responsability of the MemStoreFlusher to = call >>> the CompactSplitThread (line 359 of MemStoreFlusher) and schedule a >>> split, but this part is never called. >>> More precisely one of the condition that is expected to be true in >>> order to schedule a split is that isTooManyStoreFiles(region) is = true, >>> this could never happen. >>>=20 >>> Is it an expected behaviour or is it a known problem ? >>=20