Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 94534 invoked from network); 29 Jul 2010 18:56:41 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Jul 2010 18:56:41 -0000 Received: (qmail 46225 invoked by uid 500); 29 Jul 2010 18:56:41 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 46165 invoked by uid 500); 29 Jul 2010 18:56:40 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 46157 invoked by uid 99); 29 Jul 2010 18:56:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Jul 2010 18:56:40 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Jul 2010 18:56:38 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o6TIuGS6017925 for ; Thu, 29 Jul 2010 18:56:17 GMT Message-ID: <9691605.74051280429776741.JavaMail.jira@thor> Date: Thu, 29 Jul 2010 14:56:16 -0400 (EDT) From: "Alexander Georgiev (JIRA)" To: issues@hbase.apache.org Subject: [jira] Updated: (HBASE-2823) Entire Row Deletes not stored in Row+Col Bloom In-Reply-To: <5444306.267311278620809482.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Georgiev updated HBASE-2823: -------------------------------------- Status: Patch Available (was: Open) Release Note: Just a minor fix Tags: hbase When a Delete Row is issued on a row with row+col bloom filter, some of the columns might not be deleted. Since a Delete Row is just Delete Family applied to all columns, if a file doesn't contain the column we are searching for it might end up unaffected. In order to ensure the file will be included, the row together with row+col are added in the bloom. Then shouldSeek() checks both row and row+col if the bloom is row+col (BloomType.ROWCOL). That adds additional false positives, which are taken into account with dividing the error rate the user requires by two. Added new test that checks this in TestHRegion.java. Dumped the contents of the StoreFile in order to ensure that the bloom filter has row as a value when using ROWCOL blooms. > Entire Row Deletes not stored in Row+Col Bloom > ---------------------------------------------- > > Key: HBASE-2823 > URL: https://issues.apache.org/jira/browse/HBASE-2823 > Project: HBase > Issue Type: Bug > Components: filters, regionserver > Affects Versions: 0.90.0 > Reporter: Nicolas Spiegelberg > Assignee: Alexander Georgiev > Fix For: 0.90.0 > > > If the user issues a Row Delete on an family with Row+Col blooms, that information is not currently detected by shouldSeek(). Possible known solutions are: > 1. adding Row as Bloom Filter Key on Row Delete, shouldSeek() should do both a Row & Row+Col query for Row+Col filters. > 2. keep delete information in a separate storage element. > #1 seems like the best solution, but need to investigate further and fix this problem. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.