hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Purtell <apurt...@apache.org>
Subject Re: Collation order of items
Date Fri, 08 Jun 2012 18:59:03 GMT
On Fri, Jun 8, 2012 at 10:58 AM, Jason Frantz <jfrantz@maprtech.com> wrote:
> Yet another approach is to transform your keys into byte comparable values
> that preserve your desired sort order, and store that instead. The ICU
> library has the ability to do this for various collations of UTF strings:
> http://userguide.icu-project.org/collation/architecture#TOC-Sort-Keys
> So for this case HBase could store the ICU sortkey rather than the actual
> UTF string. You then get correct scans, but just as in Ian's example, you
> need to implement a layer that converts requests your client requests to
> HBase UTF to sortkey. This will almost certainly give you better HBase
> performance since memcmp is generally faster than a custom comparator.

I love this mailing list. Thanks, you just helped solve a problem for
me unrelated to HBase.

Best regards,

   - Andy

Problems worthy of attack prove their worth by hitting back. - Piet
Hein (via Tom White)

View raw message