hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jim Kellerman (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-1439) Add endRow parameter to HClient#obtainScanner
Date Thu, 28 Jun 2007 00:41:26 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508698
] 

Jim Kellerman commented on HADOOP-1439:
---------------------------------------

It seems to me that if I specify a filter that is row key filter, then if the filter finds
a match, next() keeps returning values so long as the row filter matches. Once it stops matching,
the filter should close out the scanner since there will be no additional rows that match
that filter.

In this particular case, I am talking about row key filters based on >, =, < and not
regexp filters, because a regexp can potentially match any row.


> Add endRow parameter to HClient#obtainScanner
> ---------------------------------------------
>
>                 Key: HADOOP-1439
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1439
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: contrib/hbase
>            Reporter: stack
>            Assignee: stack
>            Priority: Minor
>
> Currently the HClient#obtainScanner looks like this:
> {code}
> public synchronized HScannerInterface obtainScanner(Text[] columns, Text startRow) throws
IOException;
> {code}
> Add an overload that allows specification of endRow:
> {code}
> public synchronized HScannerInterface obtainScanner(Text[] columns, Text startRow, Text
endRow) throws IOException;
> {code}
> Use Case: Table contains the whole web.  Client just wants to scan google's pages.  Currently,
client could cut off the scanner as soon as the row key leaves the google domain but cleaner
if {{HScannerInterface#next()}} returns false

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message