hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jean-Marc Spaggiari (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-13721) Improve shell scan performances when using LIMIT
Date Wed, 20 May 2015 13:01:00 GMT
Jean-Marc Spaggiari created HBASE-13721:

             Summary: Improve shell scan performances when using LIMIT
                 Key: HBASE-13721
                 URL: https://issues.apache.org/jira/browse/HBASE-13721
             Project: HBase
          Issue Type: Bug
          Components: shell
    Affects Versions: 1.1.0
            Reporter: Jean-Marc Spaggiari
            Assignee: Jean-Marc Spaggiari

When doing a scan which is expected to return the exact same number of rows as the LIMIT we
give, we still scan the entire table until we return the row(s) and then test the numbers
of rows we have. This can take a lot of time.

scan 'sensors', { COLUMNS => ['v:f92acb5b-079a-42bc-913a-657f270a3dc1'], STARTROW =>
'000a', LIMIT => 1 }

This is because we will break on the limit condition AFTER we ask for the next row. If there
is none, we scan the entire table than exit.

Goal of this patch is to handle this specific case without impacting the others.

This message was sent by Atlassian JIRA

View raw message