hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dave Revell (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-4489) Better key splitting in RegionSplitter
Date Mon, 26 Sep 2011 22:01:12 GMT
Better key splitting in RegionSplitter

                 Key: HBASE-4489
                 URL: https://issues.apache.org/jira/browse/HBASE-4489
             Project: HBase
          Issue Type: Improvement
            Reporter: Dave Revell

The RegionSplitter utility allows users to create a pre-split table from the command line
or do a rolling split on an existing table. It supports pluggable split algorithms that implement
the SplitAlgorithm interface. The only/default SplitAlgorithm is one that assumes keys fall
in the range from ASCII string "00000000" to ASCII string "7FFFFFFF". This is not a sane default,
and seems useless to most users. Users are likely to be surprised by the fact that all the
region splits occur in in the byte range of ASCII characters.

A better default split algorithm would be one that evenly divides the space of all bytes,
which is what this patch does. Making a table with five regions would split at \x33\x33...,
\x66\x66...., \x99\x99..., \xCC\xCC..., and \xFF\xFF.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message