hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From lars hofhansl <lhofha...@yahoo.com>
Subject Re: Regarding rowkey
Date Wed, 12 Sep 2012 15:22:03 GMT
If you (1) load through the HTable API and (2) have multiple region servers and (3) have monotonically
increasing row keys: Yes.


-- Lars


----- Original Message -----
From: Ramasubramanian <ramasubramanian.narayanan@gmail.com>
To: "user@hbase.apache.org" <user@hbase.apache.org>
Cc: 
Sent: Wednesday, September 12, 2012 12:07 AM
Subject: Re: Regarding rowkey

Hi thanks! But for loading data into hbase, adding hash in rowkey will improve performance?

Regards,
Rams

On 12-Sep-2012, at 8:38 AM, lars hofhansl <lhofhansl@yahoo.com> wrote:

> It depends. If you do not need to perform rangescans along (prefixes of) your row keys,
you can prefix the row key by a hash of the row key.
> That will give you a more or less random distribution of the keys and hence not hit the
same region server over and over.
> 
> You'll probably also want to presplit your table then.
> 
> -- Lars
> 
> 
> 
> ----- Original Message -----
> From: Ramasubramanian <ramasubramanian.narayanan@gmail.com>
> To: user@hbase.apache.org
> Cc: 
> Sent: Tuesday, September 11, 2012 10:39 AM
> Subject: Regarding rowkey
> 
> Hi,
> 
> What can be used as rowkey to improve performance while loading into hbase? Currently
I am having sequence. It takes some 11 odd minutes to load 1 million record with 147 columns.
> 
> Regards,
> Rams 


Mime
View raw message