hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-3958) use Scan with setCaching() and PageFilter have a problem
Date Fri, 23 Sep 2011 17:16:26 GMT

    [ https://issues.apache.org/jira/browse/HBASE-3958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13113568#comment-13113568

stack commented on HBASE-3958:

I'm not sure I am completely understanding the problem but the javadoc on PageFilter says
that it will not work across region boundaries:  http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/filter/PageFilter.html

Filters have row scope only.  If your scope is beyond a single row, the results will be indeterminate.
 We say this in Filter javadoc but we don't say it enough and we don't say it on the main
Filter page.  We need to make this more clear.

Is this about indeterminate behavior because filter is working across rows?

> use Scan with setCaching() and PageFilter have a problem
> --------------------------------------------------------
>                 Key: HBASE-3958
>                 URL: https://issues.apache.org/jira/browse/HBASE-3958
>             Project: HBase
>          Issue Type: Bug
>          Components: filters, regionserver
>    Affects Versions: 0.90.3
>         Environment: Linux testbox 2.6.18-238.el5 #1 SMP Sun Dec 19 14:22:44 EST 2010
x86_64 x86_64 x86_64 GNU/Linux
> java version "1.6.0_23"
> Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
> Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
>            Reporter: Jerry Du
>            Priority: Minor
> I have a table with 3 ranges,then I scan the table cross all 3 ranges.
> Scan scan = new Scan();
> scan.setCaching(10);
> scan.setFilter(new PageFilter(21));
> [result rows count = 63]
> the Result has 63 rows, each range has scaned,and locally limit to page_szie.That is
expect result.
> Then if the page_size = N * caching_size, then result has only page_size rows,only the
first range has scanned.
> If page_size is Multiple of caching_size,one range rsult just align fill the caching,then
client NOT trrige next range scan.
> Example:
> Scan scan = new Scan();
> scan.setCaching(10);
> scan.setFilter(new PageFilter(20));
> [result rows count = 20]

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message