hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: How to implement sorting in HBase scans for a particular column
Date Tue, 29 Apr 2014 12:16:32 GMT
Have you looked at Apache Phoenix ?

Cheers

On Apr 29, 2014, at 2:13 AM, Vikram Singh Chandel <vikramsinghchandel@gmail.com> wrote:

> Hi
> 
> We have a requirement in which we have to get the scan result sorted on a
> particular column.
> 
> eg. *Get Details of Authors sorted by their Publication Count. Limit :1000 *
> 
> *Row Key is a MD5 hash of Author Id*
> 
> Number of records 8.2 million rows for 3 year data.(sample dataset, actual
> data set is 30 years)
> 
> We are currently looking in to implement a *comparator *and sort the
> values. But but for this first we have to store all 8.2 m records in a
> map/list and then sort. And this approach is neither memory efficient nor
> time efficient.
> 
> Is there any solution via which this kind of request can be fulfilled in
> real time?
> 
> 
> 
> -- 
> *Regards*
> 
> *VIKRAM SINGH CHANDEL*
> 
> Please do not print this email unless it is absolutely necessary,Reduce.
> Reuse. Recycle. Save our planet.

Mime
View raw message