Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7735310908 for ; Thu, 27 Feb 2014 16:37:41 +0000 (UTC) Received: (qmail 65128 invoked by uid 500); 27 Feb 2014 16:37:36 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 65011 invoked by uid 500); 27 Feb 2014 16:37:36 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 64999 invoked by uid 99); 27 Feb 2014 16:37:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Feb 2014 16:37:36 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of Troy.Bryant@telus.com designates 208.38.59.78 as permitted sender) Received: from [208.38.59.78] (HELO orkaan.nssi.telus.com) (208.38.59.78) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Feb 2014 16:37:29 +0000 DomainKey-Signature: s=orkaan.nssi; d=telus.com; c=nofws; q=dns; h=X-IronPort-Anti-Spam-Filtered: X-IronPort-Anti-Spam-Result:X-IronPort-AV:Received: Received:From:To:Date:Subject:Thread-Topic:Thread-Index: Message-ID:Accept-Language:Content-Language: X-MS-Has-Attach:X-MS-TNEF-Correlator:acceptlanguage: Content-Type:MIME-Version; b=XPVswGl6KdYuMLeefoe4+CzyiApeEJq6e6xYotznXNsPgaosBTxMYT4K HUAjtO+Tp+voaJYYCBP4Kn0mP3uDu+WysQTNlOOvBdw92pxSjlaByk+es u5wn24haEo5rNSaHnI8v59sJi9ecBgfF4zliRPBpT9fOCoCkFxkKchfxR E=; X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AjkFAMpoD1OOsmOS/2dsb2JhbABagmUhpgmcFIEbFnSCLC0mOAEMCWsmAQQbh3EBmliwDReNcTODW4EUBIlKlWCLOINNgUg X-IronPort-AV: E=Sophos;i="4.97,555,1389744000"; d="scan'208,217";a="305747333" Received: from unknown (HELO WP40081.corp.ads) ([142.178.99.146]) by orkaan-o.nssi.telus.com with ESMTP/TLS/AES128-SHA; 27 Feb 2014 16:37:07 +0000 Received: from WP41072.corp.ads ([fe80::782e:6557:8fc4:def7]) by WP40081.corp.ads ([::1]) with mapi; Thu, 27 Feb 2014 09:37:06 -0700 From: Troy Bryant To: "user@hbase.apache.org" Date: Thu, 27 Feb 2014 09:37:05 -0700 Subject: Row Deletion by Column Value Thread-Topic: Row Deletion by Column Value Thread-Index: Ac8z2iLL2BY2P+WqTueadEvS6wFNPQ== Message-ID: <5B153F3C89024C4AAD13F8A6D2CAC0B02AB47883B7@WP41072.corp.ads> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: multipart/alternative; boundary="_000_5B153F3C89024C4AAD13F8A6D2CAC0B02AB47883B7WP41072corpad_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_5B153F3C89024C4AAD13F8A6D2CAC0B02AB47883B7WP41072corpad_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi all, Using 0.92.2 We're looking into custom garbage collection methods. Due to some business= logic, we'd like to be able to delete rows based on the value of one of th= e columns, these deletes can be eventual rather than immediate. We have wr= itten a Map Reduce job that works, but we aren't sure if it's fast enough i= n the long run. I have two questions: Would it be possible to implement a coprocessor that would essentially do t= he column value check during a major compaction, and only write rows that p= ass the check? I'm not sure this is feasible because based on what I under= stand, the reads occur at the key-value level and not the row level. Since our deletes can be eventual, would it be possible/faster to just tomb= stone the rows rather than delete them during our map reduce job, and let t= he major compaction handle the actual deletion? If I'm not mistaken addDel= eteMarker would be the method for this. Thanks for your time. Troy Bryant --_000_5B153F3C89024C4AAD13F8A6D2CAC0B02AB47883B7WP41072corpad_--