Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 75DFB18B1E for ; Thu, 20 Aug 2015 17:17:31 +0000 (UTC) Received: (qmail 50538 invoked by uid 500); 20 Aug 2015 17:17:29 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 50472 invoked by uid 500); 20 Aug 2015 17:17:29 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 50460 invoked by uid 99); 20 Aug 2015 17:17:29 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Aug 2015 17:17:29 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id A780D1AA7ED for ; Thu, 20 Aug 2015 17:17:28 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.879 X-Spam-Level: *** X-Spam-Status: No, score=3.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id tF0SGUC-ddn7 for ; Thu, 20 Aug 2015 17:17:26 +0000 (UTC) Received: from mail-qk0-f174.google.com (mail-qk0-f174.google.com [209.85.220.174]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 99A3C2100B for ; Thu, 20 Aug 2015 17:17:25 +0000 (UTC) Received: by qkep139 with SMTP id p139so18812942qke.3 for ; Thu, 20 Aug 2015 10:17:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=rEycnO3X9iWa9V96HhIWj9za4a5UjVrF2j+Hpz2U/7A=; b=oDMWwTp3F/bcyM+yq7aDSzroGvH/qCv2YB1dT/SATX2jhGFPruiUiSwugahw5XZK6o p1RzSEtQ8r+3WI60S5Lps+8WfzyWu2wYuxpar6FyOc/Zp58HGoBkzPhl4xBL2PbNTiFW CRbYKf/EoSNuc7Ij9+2mqfCCL184Y4gO5Vyk7+rx4htQszQBb4OAfZoENUi9Uw4CEBn4 Ca1HHueM+3aB1iGZqG+c9hN0sf5zPPZMRwc43UXOTw6hxDMRpijSsQ6QOCAvhAv0R9Rk A6+zWoKC1Y8dWIQm1Lg3DOf0Fcjf7ZaY8pvhs1A6iDwp4IQz4Yc7SE4g0lzh7vRac16A uW1g== MIME-Version: 1.0 X-Received: by 10.55.217.23 with SMTP id u23mr7451503qki.58.1440091044915; Thu, 20 Aug 2015 10:17:24 -0700 (PDT) Received: by 10.140.83.234 with HTTP; Thu, 20 Aug 2015 10:17:24 -0700 (PDT) In-Reply-To: References: Date: Thu, 20 Aug 2015 10:17:24 -0700 Message-ID: Subject: Re: Major compaction skipping for older regions From: Vladimir Rodionov To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a1147a08a2d2f2f051dc15363 --001a1147a08a2d2f2f051dc15363 Content-Type: text/plain; charset=UTF-8 Hey, this looks like a BUG. You may try this little hack (hbase-site.xml): hbase.hstore.min.locality.to.skip.major.compact=1.1 Any value above 1.0 should work -Vlad On Thu, Aug 20, 2015 at 5:04 AM, mukund murrali wrote: > Any update from anyone on this? We are invoking major compaction manually > and we see too many skipping of major compaction since we have MIN_VERSION > to 1. > > Thanks > > On Tue, Aug 11, 2015 at 5:09 PM, mukund murrali > wrote: > > > Yes. We have min version to be 1, hence I suppose there wont be any sort > > of time based compaction. After triggering major compaction we had only > the > > following logs > > > > compactions.RatioBasedCompactionPolicy: Selecting compaction from 3 store > > files, 0 compacting, 3 eligible, 50 blocking > > > > Thanks > > > > On Tue, Aug 11, 2015 at 3:25 PM, Ted Yu wrote: > > > >> bq. oldestTime -9223370597787064221ms > >> > >> This is due to minTimestamp missing from store file: > >> > >> Long minTimestamp = sf.getMinimumTimestamp(); > >> > >> long oldest = (minTimestamp == null) > >> > >> ? Long.MIN_VALUE > >> > >> : now - minTimestamp.longValue(); > >> > >> Can you pastebin log from RatioBasedCompactionPolicy in region server > log > >> after the manual compaction ? > >> > >> Thanks > >> > >> On Tue, Aug 11, 2015 at 2:06 AM, mukund murrali < > mukundmurrali9@gmail.com > >> > > >> wrote: > >> > >> > We are using hbase-1.0.0. The following logs appears for all regions > in > >> the > >> > regionserver > >> > > >> > 2015-08-08 14:01:51,586 DEBUG > [regionserver//R1:16020.compactionChecker] > >> > compactions.RatioBasedCompactionPolicy: Skipping major compaction of > >> > > >> > > >> > org.apache.hadoop.hbase.regionserver.compactions.ExploringCompactionPolicy@7bc4e8d8 > >> > because one (major) compacted file only, oldestTime > >> -9223370597787064221ms > >> > is < ttl=9223372036854775807 and blockLocalityIndex is 1.0 (min 0.0) > >> > > >> > > >> > Yes after manual triggering the deletes purged. But we don't want to > >> have > >> > it manual. Any other config to avoid such scenario? > >> > > >> > Thanks > >> > > >> > > >> > > >> > On Mon, Aug 10, 2015 at 8:01 PM, Ted Yu wrote: > >> > > >> > > What release of hbase are you using ? > >> > > > >> > > Can you pastebin region server log with DEBUG logging ? > >> > > > >> > > I guess you have tried issuing manual command. Did it work ? > >> > > > >> > > Thanks > >> > > > >> > > On Mon, Aug 10, 2015 at 7:02 AM, mukund murrali < > >> > mukundmurrali9@gmail.com> > >> > > wrote: > >> > > > >> > > > Any one help us in this :( Are we missing somewhere in the use > >> case? > >> > > None > >> > > > of the deleted cells are undergoing major compaction. > >> > > > > >> > > > Thanks > >> > > > > >> > > > On Wed, Aug 5, 2015 at 12:04 PM, mukund murrali < > >> > > mukundmurrali9@gmail.com> > >> > > > wrote: > >> > > > > >> > > > > Hi, > >> > > > > > >> > > > > We wanted to have deleted data for a week. So we configured > >> > > > > > >> > > > > MIN_VERSIONS => 1 > >> > > > > KEEP_DELETED_CELLS => TTL > >> > > > > TTL => 1 week. > >> > > > > > >> > > > > As per our understanding, after 1 week the deleted data becomes > >> > > available > >> > > > > for major compaction and should be purged (correct if wrong). > >> Since > >> > we > >> > > > have > >> > > > > time series data, we don't have any write operations in those > >> regions > >> > > > after > >> > > > > a week . But major compaction never took place for any regions > and > >> > our > >> > > > > overall size grew drastically though we have deletes happening. > >> After > >> > > > > analyzing, we found that major compaction takes place if any one > >> of > >> > > the 2 > >> > > > > condition is satisfied. > >> > > > > > >> > > > > 1. If the time interval between major compaction is greater > than a > >> > week > >> > > > > (default config). > >> > > > > 2. if the block locality index falls below a threshold. > >> > > > > > >> > > > > In our case, since we have min_versions to be 1, the first case > >> > > condition > >> > > > > fails. Time to verify is set to Long.Max value, if min versions > is > >> > not > >> > > 0. > >> > > > > > >> > > > > Second is block locality. To check the block locality index we > >> > enabled > >> > > > > fine logs. And we found the block locality is always 1, and we > >> got > >> > > logs > >> > > > > stating "Skipping major compaction......". > >> > > > > > >> > > > > So, in this case is manually triggering major compaction the > only > >> > > choice? > >> > > > > > >> > > > > Thanks > >> > > > > > >> > > > > >> > > > >> > > >> > > > > > --001a1147a08a2d2f2f051dc15363--