Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ECD1A17C28 for ; Wed, 20 May 2015 14:39:00 +0000 (UTC) Received: (qmail 38739 invoked by uid 500); 20 May 2015 14:39:00 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 38698 invoked by uid 500); 20 May 2015 14:39:00 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 38686 invoked by uid 99); 20 May 2015 14:39:00 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 May 2015 14:39:00 +0000 Date: Wed, 20 May 2015 14:39:00 +0000 (UTC) From: "Jean-Marc Spaggiari (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HBASE-13721) Improve shell scan performances when using LIMIT MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-13721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jean-Marc Spaggiari updated HBASE-13721: ---------------------------------------- Status: Patch Available (was: Open) > Improve shell scan performances when using LIMIT > ------------------------------------------------ > > Key: HBASE-13721 > URL: https://issues.apache.org/jira/browse/HBASE-13721 > Project: HBase > Issue Type: Bug > Components: shell > Affects Versions: 1.1.0 > Reporter: Jean-Marc Spaggiari > Assignee: Jean-Marc Spaggiari > Attachments: HBASE-13721-v0-trunk.txt > > > When doing a scan which is expected to return the exact same number of rows as the LIMIT we give, we still scan the entire table until we return the row(s) and then test the numbers of rows we have. This can take a lot of time. > Example: > scan 'sensors', { COLUMNS => ['v:f92acb5b-079a-42bc-913a-657f270a3dc1'], STARTROW => '000a', LIMIT => 1 } > This is because we will break on the limit condition AFTER we ask for the next row. If there is none, we scan the entire table than exit. > Goal of this patch is to handle this specific case without impacting the others. -- This message was sent by Atlassian JIRA (v6.3.4#6332)