hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tatsuya Kawano <tatsuya6...@gmail.com>
Subject Re: impact of total region numbers?
Date Tue, 18 Jan 2011 02:45:18 GMT
Hi Tao, 

I think the number of regions won't have much impact to random read throughput and latency.
But the number of generations (HFiles) per region will do. 

If this is the case, try to run major compaction on the table. This will merge HFile generations
so the read throughput and latency will be recovered. You can do this from the hbase shell.

Also, you might want to increase  hbase.region.mstore.flush.size to keep the number of HFile
generations smaller.


Tatsuya Kawano (Mr.)
Tokyo, Japan

On Jan 18, 2011, at 11:20 AM, Tao Xie <xietao.mailbox@gmail.com> wrote:

> For example, I have total some data and I can tune
> hbase.hregion.max.filesize to increase/decrease total region number, rite?
> I want to know if the region number has performance impact to random read
> tests. I observed that in my ycsb test,  with larger hfile size, I got
> better tput and smaller latency.
> Anybody can give me hints. Thanks.
> Tao

View raw message