accumulo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eric Newton <eric.new...@gmail.com>
Subject Re: tablet split
Date Tue, 20 Oct 2015 15:12:46 GMT
Accumulo will balance the tablets based on the configured balancer.

Without getting down into the details, the splits will be moved to other
nodes.

The Details:

It depends. With the default balancer, it will try to smooth out the number
of tablets among servers, by table. So, if this table goes from 1 tablet to
3, and there are at least 3 servers, each split will eventually find itself
moved to separate server. But, if you add one split among hundreds, it may
not make much of a difference to bother moving the tablet.

-Eric


On Tue, Oct 20, 2015 at 10:39 AM, z11373 <z11373@outlook.com> wrote:

> As my understanding, Accumulo will have data already sorted with row id,
> and
> if the number of rows is growing, it will split the tablet at one point.
> For example, let say I have following row ids:
>
> 1_abcxxx
> 1_abdxxx
> 1_abexxx
> 1_abfxxx
> 1_abgxxx
> 1_abhxxx
> 1_abixxx
> ...
> 1_zzzxxx
> 2_abcxxx
> 2_abdxxx
> 2_abexxx
> 2_abfxxx
> 2_abgxxx
> 2_abhxxx
> ...
>
> Let say the data with row id starts with "1_" has a million of rows, and
> for
> sake of example, let say the tablet size is 400K, so in this case the "1_"
> data will be split into 3 tablets.
>
> My question is will Accumulo distribute those 3 tablets into different
> tablet server nodes? Or perhaps two or all of them will remain in that
> original tablet server?
>
>
> Thanks,
> Z
>
>
>
>
> --
> View this message in context:
> http://apache-accumulo.1065345.n5.nabble.com/tablet-split-tp15399.html
> Sent from the Developers mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message