Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EEB8B200C3F for ; Wed, 22 Mar 2017 20:36:25 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EBA37160B86; Wed, 22 Mar 2017 19:36:25 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3EA78160B74 for ; Wed, 22 Mar 2017 20:36:25 +0100 (CET) Received: (qmail 20135 invoked by uid 500); 22 Mar 2017 19:36:13 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 19914 invoked by uid 99); 22 Mar 2017 19:36:12 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Mar 2017 19:36:12 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 76D8E181302 for ; Wed, 22 Mar 2017 19:36:12 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.397 X-Spam-Level: X-Spam-Status: No, score=-0.397 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.796, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 83_x6ts9uhpF for ; Wed, 22 Mar 2017 19:36:11 +0000 (UTC) Received: from mail-qk0-f171.google.com (mail-qk0-f171.google.com [209.85.220.171]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4A0505FDDB for ; Wed, 22 Mar 2017 19:36:11 +0000 (UTC) Received: by mail-qk0-f171.google.com with SMTP id y76so164316392qkb.0 for ; Wed, 22 Mar 2017 12:36:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=kxCrgT7q3PztHWf/piSJKFzEPRuVZOjZ1d6GgpkzHag=; b=PPXrpPYaxzY7qiNUfiuYu7BtEW01KTiLTfOnv/mmWZ8YX79GWr/DHl47VZ/xq+YvDb z4hqJdkGchonCBXJG4MlVxsAtDJEuzs1cIb8x+CXPs4wVIh4iMtXyUoZGBPFk5lIBTZr 4Kr92vtZOdc8NbUxW7jcGRoe76NU33ECFUcgHIxZFC7S8vRBr+WBUV2aiEVmPq2MYRDu rAfObiBVFFWWvxBqfQrDDod3mAhSwjQG7jh6KfRo74EaonNppQ4U5wJ8IOMBeApFnRPy 6N61mQ0NTP/QHlMH1sd7MUhXO1HCy6viw5Icw96SIGTbx7oEvpdpo+WhghmmxDm++lIh eECg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=kxCrgT7q3PztHWf/piSJKFzEPRuVZOjZ1d6GgpkzHag=; b=oxfFaX4DCZW1DIUvwAvEthn6/DCr/dPZO2Hu1SqSf3nZBggzo7oFx/PA8iQPqBnjKd E8BMsxVNQKEw9CliaXbQm1cxGzswyLS2OathHAii1XgPfs6IPWE0SvI9lVTNdT9N76eu oxdIhLsh5NpVaXcgR88IrIMNcDavwA8Md8P41Jvr/ca1LtVfBbrT8ewGvSC2dZ/oD2CI V55c9D1213Tsl/GzfPscAZk22ff9YfNS3Grb7s0783I/tpwjXsVSi7LmIR1hNswuJ6wj 2Elnxzhg8445beOMyr3tbNEEUkDCGRppddLW/MxKwilVq8jlQHpZNnKY1Ws6WlGbDwHf d76w== X-Gm-Message-State: AFeK/H0MNJuHRM5NC1ErZQI95lYAzft5y2XhW/2FMRBfGUurBVtDgr28szJViBeGtNK86FrJWNSkbWdckxs/vA== X-Received: by 10.55.25.81 with SMTP id k78mr10879938qkh.223.1490211365362; Wed, 22 Mar 2017 12:36:05 -0700 (PDT) MIME-Version: 1.0 Received: by 10.12.182.212 with HTTP; Wed, 22 Mar 2017 12:36:05 -0700 (PDT) In-Reply-To: References: From: Vladimir Rodionov Date: Wed, 22 Mar 2017 12:36:05 -0700 Message-ID: Subject: Re: Need guidance on Custom Compaction Policy To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a1147a9ba12cbde054b56dfeb archived-at: Wed, 22 Mar 2017 19:36:26 -0000 --001a1147a9ba12cbde054b56dfeb Content-Type: text/plain; charset=UTF-8 Older files will be purged by default HBase compactor if all data inside expired (you have TTL for data?) As for custom compaction policy you can refer to FIFOCompactionPolicy class to get the idea how custom compaction works. -Vlad On Wed, Mar 22, 2017 at 12:29 PM, jeff saremi wrote: > I mentioned some of this in another thread. We have a readonly database > which get bulk loaded using HFiles. > We want to keep only two versions/generations of data. Since the size of > data is massive we need to delete the older generation. > > Since we write one single HBase for each region for each CF for each > generation, could we just yank the older files using a separate standalone > process? (sounds a little scary) > > If not, could we write a custom compactor? what's involved (some pointers > please)? thanks > > Jeff > --001a1147a9ba12cbde054b56dfeb--