hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Gray (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HBASE-1183) New MR splitting algorithm and other new features need a way to split a key range in N chunks
Date Tue, 03 Feb 2009 22:53:59 GMT

     [ https://issues.apache.org/jira/browse/HBASE-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jonathan Gray updated HBASE-1183:
---------------------------------

    Attachment: hbase-1183-v1.patch

Introduces 5 new functions to Bytes util class:

- public static byte [][] split(final byte [] a, final byte [] b, final int num)
- public static byte [] head(final byte [] a, final int length)
- public static byte [] tail(final byte [] a, final int length)
- public static byte [] padHead(final byte [] a, final int length)
- public static byte [] padTail(final byte [] a, final int length)

head/tail are certainly useful and generic.  Not sure we have a need for the padHead/padTail
functions but it's used for splitting (start/stop need to be same length for BigInteger and
am also prepending a 0 to both to ensure they are not intepreted as negative numbers).

> New MR splitting algorithm and other new features need a way to split a key range in
N chunks
> ---------------------------------------------------------------------------------------------
>
>                 Key: HBASE-1183
>                 URL: https://issues.apache.org/jira/browse/HBASE-1183
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: util
>            Reporter: Jonathan Gray
>            Assignee: Jonathan Gray
>            Priority: Minor
>             Fix For: 0.19.1, 0.20.0
>
>         Attachments: hbase-1183-v1.patch
>
>
> For HBASE-1172 and other functionality coming soon, we need to be able to take a [start,stop)
range and divide it into chunks.
> For example, we have 10 regions but want to run 30 maps.  We need to divide each region
into three key ranges for the start/stop of each scanner.
> Implementing using java.math.BigInteger
> Will also include a couple additional helpers in Bytes to make life easy.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message