Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6A3CDD893 for ; Sat, 30 Jun 2012 20:14:45 +0000 (UTC) Received: (qmail 61975 invoked by uid 500); 30 Jun 2012 20:14:45 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 61943 invoked by uid 500); 30 Jun 2012 20:14:45 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 61933 invoked by uid 99); 30 Jun 2012 20:14:45 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Jun 2012 20:14:45 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 82B4F1418B6 for ; Sat, 30 Jun 2012 20:14:44 +0000 (UTC) Date: Sat, 30 Jun 2012 20:14:44 +0000 (UTC) From: "Jonathan Hsieh (JIRA)" To: issues@hbase.apache.org Message-ID: <964221238.75773.1341087284537.JavaMail.jiratomcat@issues-vm> In-Reply-To: <1324594057.23772.1304576103124.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HBASE-3855) Performance degradation of memstore because reseek is linear MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-3855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404612#comment-13404612 ] Jonathan Hsieh commented on HBASE-3855: --------------------------------------- No response for half a week, going to punt so I can cut an RC. > Performance degradation of memstore because reseek is linear > ------------------------------------------------------------ > > Key: HBASE-3855 > URL: https://issues.apache.org/jira/browse/HBASE-3855 > Project: HBase > Issue Type: Improvement > Reporter: dhruba borthakur > Priority: Blocker > Fix For: 0.90.7 > > Attachments: memstoreReseek.txt, memstoreReseek2.txt > > > The scanner use reseek to find the next row (or next column) as part of a scan. The reseek code iterates over a Set to position itself at the right place. If there are many thousands of kvs that need to be skipped over, then the time-cost is very high. In this case, a seek would be far lesser in cost than a reseek. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira