hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alok Singh <aloksi...@gmail.com>
Subject Re: HBase region assignment by range?
Date Wed, 08 Apr 2015 02:47:03 GMT
>I looked at presplit: { SPLITS => ['row100','row200','row300'] } , but
>don't think it serves this purpose.

Why doesn't this work for you? Is it because regions are not evenly
distributed across the cluster after the split? You can move regions
manually and spread them out evenly.


On Tue, Apr 7, 2015 at 5:05 PM, Demai Ni <nidmgg@gmail.com> wrote:
> hi, folks,
> I have a question about region assignment and like to clarify some through.
> Let's say I have a table with rowkey as "row00000 ~ row30000" on a 4 node
> hbase cluster, is there a way to keep data partitioned by range on each
> node? for example:
> node1:  <=row10000
> node2:  row10001~row20000
> node3:  row20001~row30000
> node4:  >row30000
> And even when one of the node become hotspot, the boundary won't be crossed
> unless manually doing a load balancing?
> I looked at presplit: { SPLITS => ['row100','row200','row300'] } , but
> don't think it serves this purpose.
> BTW, a bit background. I am thinking to do a local join between two tables
> if both have same rowkey, and partitioned by range (or same hash
> algorithm). If I can keep the join-key on the same node(aka regionServer),
> the join can be handled locally instead of broadcast to all other nodes.
> Thanks for your input. A couple pointers to blog/presentation would be
> appreciated.
> Demai

View raw message