hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-1935) Scan in parallel
Date Mon, 19 Sep 2011 23:39:09 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108237#comment-13108237

Lars Hofhansl commented on HBASE-1935:

Created HBASE-4439.

Thinking about this issue a bit more. The patch-idea I provided is actually superfluous.

The same can be achieved, by just passing a Scan with start/endRow set to a Regions start/endRow
to HTable.getScanner(Scan), this also has the added benefit of dealing with concurrent splits.

That is also how one could implement a parallel scanner with much less effort.

> Scan in parallel
> ----------------
>                 Key: HBASE-1935
>                 URL: https://issues.apache.org/jira/browse/HBASE-1935
>             Project: HBase
>          Issue Type: New Feature
>          Components: coprocessors
>            Reporter: stack
>         Attachments: 1935-idea.txt, pscanner-v2.patch, pscanner-v3.patch, pscanner-v4.patch,
> A scanner that rather than scan in series, instead scanned multiple regions in parallell
would be more involved but could complete much faster partiularly if results are sparse.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message