Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DDD1F173C6 for ; Tue, 7 Oct 2014 05:01:34 +0000 (UTC) Received: (qmail 74873 invoked by uid 500); 7 Oct 2014 05:01:34 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 74826 invoked by uid 500); 7 Oct 2014 05:01:34 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 74815 invoked by uid 99); 7 Oct 2014 05:01:34 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Oct 2014 05:01:34 +0000 Date: Tue, 7 Oct 2014 05:01:34 +0000 (UTC) From: "Matt Corgan (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-12078) Missing Data when scanning using PREFIX_TREE DATA-BLOCK-ENCODING MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-12078?page=3Dcom.atlassia= n.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D141= 61467#comment-14161467 ]=20 Matt Corgan commented on HBASE-12078: ------------------------------------- Wow! Thank you guys so much for debugging that. Sorry I was not much help= lately. > Missing Data when scanning using PREFIX_TREE DATA-BLOCK-ENCODING > ---------------------------------------------------------------- > > Key: HBASE-12078 > URL: https://issues.apache.org/jira/browse/HBASE-12078 > Project: HBase > Issue Type: Bug > Affects Versions: 0.98.6.1 > Environment: CentOS 6.3 > hadoop 2.5.0(hdfs) > hadoop 2.2.0(hbase) > hbase 0.98.6.1 > sun-jdk 1.7.0_67-b01 > Reporter: zhangduo > Assignee: zhangduo > Priority: Critical > Fix For: 2.0.0, 0.98.7, 0.99.1 > > Attachments: HBASE-12078-0.98.patch, HBASE-12078.patch, HBASE-120= 78_1.patch, prefix_tree_error.patch > > > our row key is combined with two ints, and we found that sometimes when w= e using only the first int part to scan, the result returned may missing so= me rows. But when we dump the whole hfile, the row is still there. > We have written a testcase to reproduce the bug. It works like this: > put 1-12345 > put 12345-0x01000000 > put 12345-0x01010000 > put 12345-0x02000000 > put 12345-0x02020000 > put 12345-0x03000000 > put 12345-0x03030000 > put 12345-0x04000000 > put 12345-0x04040000 > flush memstore > then scan using 12345=EF=BC=8Cthe returned row key will be 12345-0x200000= 00(12345-0x10000000 expected) -- This message was sent by Atlassian JIRA (v6.3.4#6332)