hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars George (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HBASE-1829) Make use of start/stop row in TableInputFormat
Date Fri, 06 Nov 2009 21:31:32 GMT

     [ https://issues.apache.org/jira/browse/HBASE-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Lars George updated HBASE-1829:

    Attachment: HBASE-1829-v3.patch

OK, after pretty much two days of getting up to speed with how the regions mechanism works
I had to add a hook to flush the region cache in HConnection. WIth that I was able to recreate
the old MultiRegion functionality in the new HBaseTestingUtility. 

I have added 11 subtests that cover all combinations of empty or not empty start and stop
rows as well as single region to spanning many regions scans. All succeed, but please someone
review the big "if" statement in TableInputFormatBase.getSplits(). I want to make sure I have
that right. The tests say yes, but a second pair of eyes is appreciated.

> Make use of start/stop row in TableInputFormat
> ----------------------------------------------
>                 Key: HBASE-1829
>                 URL: https://issues.apache.org/jira/browse/HBASE-1829
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: Lars George
>            Assignee: Lars George
>             Fix For: 0.20.2, 0.21.0
>         Attachments: HBASE-1829-v2.patch, HBASE-1829-v3.patch, HBASE-1829.patch, HBaseTestingUtility.java,
> Since we can now specify a start and stop row with the Scan that is handed to the TIF
we can reduce the splits to the regions that contain these rows. That allows to test large
MR jobs on a single region for example.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message