hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Minh Duc Nguyen <mdngu...@gmail.com>
Subject Re: Schema for sorted results
Date Tue, 24 Jul 2012 14:16:49 GMT

   According to the HBase book: http://hbase.apache.org/book.html#dm.sort

All data model operations HBase return data in sorted order. First by row,
then by ColumnFamily, followed by column qualifier, and finally timestamp
(sorted in reverse, so newest records are returned first).

    ~ Minh

On Tue, Jul 24, 2012 at 9:50 AM, Hari Prasanna <hari@slideshare.com> wrote:

> Hello -
> I'm using HBase for web server log processing and I'm trying to save
> the top N urls per category per day in a sorted manner in HBase. From
> what I've read, the only sortable structure that HBase offers is the
> lexicographic sort in the row keys. So, here is the rowkey format I'm
> currently using
> <date>|<category>|<padded_visits>|<url>
> where,  padded_visits = Long.MAX_VALUE - visits
> This seems wasteful because of the long rowkeys. Is there any other
> approach to maintain sorted results in HBase?
> Thanks
> Hari Prasanna

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message