Mailing-List: contact user-help@helix.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@helix.apache.org
MIME-Version: 1.0
In-Reply-To: <CAMmoVT6Z7y26Eay=PY-Kpo2BtWvC0e_fJQjYRXusqD0iKcb_XA@mail.gmail.com>
References: <CAC56g40mP6W=b_z-C2Pp36y2e21-XwGu-LbKc3fCgc5GjfxdqA@mail.gmail.com>
 <CABaj-QaKa8NRRW8g02tb9eHk1wuO-dmot_nJJ+MBKGOpVHWj_A@mail.gmail.com>
 <CAC56g40EU+hRCbmAvHB-qmYA_3KUpmB4+ggsucOdLdh7JGKQsg@mail.gmail.com>
 <CABaj-QaeWktKSEXA+2SvRXUsRfev6iGim=Ua8PKwBLqUrbxUjw@mail.gmail.com>
 <CAFGR15y9cA-Aycb2jMud2d=wHBbo9a8e_QZUUzC1v92QPQxVkQ@mail.gmail.com>
 <CAC56g43fjQ_UH_SS-cRgXW_EDbgkSkj2TG7p5YyLySXP=iPMEw@mail.gmail.com> <CAMmoVT6Z7y26Eay=PY-Kpo2BtWvC0e_fJQjYRXusqD0iKcb_XA@mail.gmail.com>
From: Michael Craig <mcraig@box.com>
Date: Thu, 20 Oct 2016 10:28:23 -0700
Message-ID: <CAC56g42r6VnwZy0fo4B9DfYnqL1DZc4_K7zAKKw8XY5VM1OtWg@mail.gmail.com>
Subject: Re: Correct way to redistribute work from disconnected instances?
To: user@helix.apache.org
Content-Type: multipart/alternative; boundary=001a114edc00b5238f053f4f400f
archived-at: Thu, 20 Oct 2016 17:28:35 -0000

--001a114edc00b5238f053f4f400f
Content-Type: text/plain; charset=UTF-8

That works! The cluster is automatically rebalancing when nodes start/stop.
This has raised other questions about rebalancing:

Example output below, and I updated the gist:
https://gist.github.com/mkscrg/bcb2ab1dd1b3e84ac93e7ca16e2824f8

   - When NODE_0 restarts, why is the resource moved back? This seems like
   unhelpful churn in the cluster.
   - Why does the resource stay in the OFFLINE state on NODE_0?


2 node cluster with a single resource with 1 partition/replica, using
OnlineOffline:

Starting ZooKeeper at localhost:2199
Setting up cluster THE_CLUSTER
Starting CONTROLLER
Starting NODE_0
Starting NODE_1
Adding resource THE_RESOURCE
Rebalancing resource THE_RESOURCE
Transition: NODE_0 OFFLINE to ONLINE for THE_RESOURCE
Cluster state after setup:
NODE_0: ONLINE
NODE_1: null
------------------------------------------------------------
Stopping NODE_0
Transition: NODE_1 OFFLINE to ONLINE for THE_RESOURCE
Cluster state after stopping first node:
NODE_0: null
NODE_1: ONLINE
------------------------------------------------------------
Starting NODE_0
Transition: NODE_1 ONLINE to OFFLINE for THE_RESOURCE
Transition: NODE_1 OFFLINE to DROPPED for THE_RESOURCE
Cluster state after restarting first node:
NODE_0: OFFLINE
NODE_1: null
------------------------------------------------------------

On Thu, Oct 20, 2016 at 9:18 AM, Lei Xia <lxia@linkedin.com> wrote:

> Hi, Michael
>
>   To answer your questions:
>
>    - Should you have to `rebalance` a resource when adding a new node to
>    the cluster?
> *--- No, if you are using full-auto rebalance mode,  yes if you are in
>    semi-auto rebalance mode. *
>    - Should you have to `rebalance` when a node is dropped? *-- Again,
>    same answer, No, you do not need to in full-auto mode.  In full-auto mode,
>    Helix is supposed to detect nodes add/delete/online/offline and rebalance
>    the resource automatically. *
>
>
>   The problem you saw was because your resource was created in SEMI-AUTO
> mode instead of FULL-AUTO mode.  HelixAdmin.addResource() creates a
> resource in semi-auto mode by default if you do not specify a rebalance
> mode explicitly.  Please see my comments below on how to fix it.
>
>
> static void addResource() throws Exception {
>   echo("Adding resource " + RESOURCE_NAME);
>   ADMIN.addResource(CLUSTER_NAME, RESOURCE_NAME, NUM_PARTITIONS,
> STATE_MODEL_NAME);  *==> ADMIN.addResource(CLUSTER_NAME, RESOURCE_NAME,
> NUM_PARTITIONS, STATE_MODEL_NAME, RebalanceMode.FULL_AUTO); *
>   echo("Rebalancing resource " + RESOURCE_NAME);
>   ADMIN.rebalance(CLUSTER_NAME, RESOURCE_NAME, NUM_REPLICAS);  * // This
> just needs to be called once after the resource was created, no need to
> call when there is node change. *
> }
>
>
> Please give it a try and let me know whether it works.  Thanks!
>
>
> Lei
>
> On Wed, Oct 19, 2016 at 11:52 PM, Michael Craig <mcraig@box.com> wrote:
>
>> Here is some repro code for "drop a node, resource is not redistributed"
>> case I described: https://gist.github.com/mkscrg/bcb2ab1dd1b3e84ac9
>> 3e7ca16e2824f8
>>
>> Can we answer these 2 questions? That would help clarify things:
>>
>>    - Should you have to `rebalance` a resource when adding a new node to
>>    the cluster?
>>    - If no, this is an easy bug to reproduce. The example code
>>       <https://github.com/apache/helix/blob/helix-0.6.x/helix-core/src/main/java/org/apache/helix/examples/Quickstart.java#L198>
>>       calls rebalance after adding a node, and it breaks if you comment out that
>>       line.
>>       - If yes, what is the correct way to manage many resources on a
>>       cluster? Iterate through all resources and rebalance them for every new
>>       node?
>>    - Should you have to `rebalance` when a node is dropped?
>>       - If no, there is a bug. See the repro code posted above.
>>       - If yes, we are in the same rebalance-every-resource situation as
>>       above.
>>
>> My use case is to manage a set of ad-hoc tasks across a cluster of
>> machines. Each task would be a separate resource with a unique name, with 1
>> partition and 1 replica. Each resource would reside on exactly 1 node, and
>> there is no limit on the number of resources per node.
>>
>> On Wed, Oct 19, 2016 at 9:23 PM, Lei Xia <xiaxlei@gmail.com> wrote:
>>
>>> Hi, Michael
>>>
>>>   Could you be more specific on the issue you see? Specifically:
>>>   1) For 1 resource and 2 replicas, you mean the resource has only 1
>>> partition, with replica number equals to 2, right?
>>>   2) You see* REBALANCE_MODE="FULL_AUTO"*, not* IDEALSTATE_MODE="AUTO" *in
>>> your idealState, right?
>>>   3) by dropping N1, you mean disconnect N1 from helix/zookeeper, so N1
>>> is not in liveInstances, right?
>>>
>>>   If your answers to all of above questions are yes, then there may be
>>> some bug here.  If possible, please paste your idealstate, and your test
>>> code (if there is any) here, I will try to reproduce and debug it.  Thanks
>>>
>>>
>>> Lei
>>>
>>> On Wed, Oct 19, 2016 at 9:02 PM, kishore g <g.kishore@gmail.com> wrote:
>>>
>>>> Can you describe your scenario in detail and the expected behavior?. I
>>>> agree calling rebalance on every live instance change is ugly and
>>>> definitely not as per the design. It was an oversight (we focussed a lot of
>>>> large number of partitions and failed to handle this simple case).
>>>>
>>>> Please file and jira and we will work on that. Lei, do you think the
>>>> recent bug we fixed with AutoRebalancer will handle this case?
>>>>
>>>> thanks,
>>>> Kishore G
>>>>
>>>> On Wed, Oct 19, 2016 at 8:55 PM, Michael Craig <mcraig@box.com> wrote:
>>>>
>>>>> Thanks for the quick response Kishore. This issue is definitely tied
>>>>> to the condition that partitions * replicas < NODE_COUNT.
>>>>> If all running nodes have a "piece" of the resource, then they behave
>>>>> well when the LEADER node goes away.
>>>>>
>>>>> Is it possible to use Helix to manage a set of resources where that
>>>>> condition is true? I.e. where the *total *number of
>>>>> partitions/replicas in the cluster is greater than the node count, but each
>>>>> individual resource has a small number of partitions/replicas.
>>>>>
>>>>> (Calling rebalance on every liveInstance change does not seem like a
>>>>> good solution, because you would have to iterate through all resources in
>>>>> the cluster and rebalance each individually.)
>>>>>
>>>>>
>>>>> On Wed, Oct 19, 2016 at 12:52 PM, kishore g <g.kishore@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> I think this might be a corner case when partitions * replicas <
>>>>>> TOTAL_NUMBER_OF_NODES. Can you try with many partitions and replicas and
>>>>>> check if the issue still exists.
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Wed, Oct 19, 2016 at 11:53 AM, Michael Craig <mcraig@box.com>
>>>>>> wrote:
>>>>>>
>>>>>>> I've noticed that partitions/replicas assigned to disconnected
>>>>>>> instances are not automatically redistributed to live instances. What's the
>>>>>>> correct way to do this?
>>>>>>>
>>>>>>> For example, given this setup with Helix 0.6.5:
>>>>>>> - 1 resource
>>>>>>> - 2 replicas
>>>>>>> - LeaderStandby state model
>>>>>>> - FULL_AUTO rebalance mode
>>>>>>> - 3 nodes (N1 is Leader, N2 is Standby, N3 is just sitting)
>>>>>>>
>>>>>>> Then drop N1:
>>>>>>> - N2 becomes LEADER
>>>>>>> - Nothing happens to N3
>>>>>>>
>>>>>>> Naively, I would have expected N3 to transition from Offline to
>>>>>>> Standby, but that doesn't happen.
>>>>>>>
>>>>>>> I can force redistribution from GenericHelixController#onLiveInstanceChange
>>>>>>> by
>>>>>>> - dropping non-live instances from the cluster
>>>>>>> - calling rebalance
>>>>>>>
>>>>>>> The instance dropping seems pretty unsafe! Is there a better way?
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>>
>>> --
>>> Lei Xia
>>>
>>
>>
>
>
> --
>
> *Lei Xia *Senior Software Engineer
> Data Infra/Nuage & Helix
> LinkedIn
>
> lxia@linkedin.com
> www.linkedin.com/in/lxia1
>

--001a114edc00b5238f053f4f400f
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">That works! The cluster is automatically rebalancing when =
nodes start/stop. This has raised other questions about rebalancing:<div><b=
r></div><div>Example output below, and I updated the gist:=C2=A0<a href=3D"=
https://gist.github.com/mkscrg/bcb2ab1dd1b3e84ac93e7ca16e2824f8">https://gi=
st.github.com/mkscrg/bcb2ab1dd1b3e84ac93e7ca16e2824f8</a></div><div><ul><li=
>When NODE_0 restarts, why is the resource moved back? This seems like unhe=
lpful churn in the cluster.</li><li>Why does the resource stay in the OFFLI=
NE state on NODE_0?</li></ul></div><div><br></div><div>2 node cluster with =
a single resource with 1 partition/replica, using OnlineOffline:</div><div>=
<br></div><div><div>Starting ZooKeeper at localhost:2199</div><div>Setting =
up cluster THE_CLUSTER<br></div><div>Starting CONTROLLER</div><div>Starting=
 NODE_0</div><div>Starting NODE_1</div><div>Adding resource THE_RESOURCE</d=
iv><div>Rebalancing resource THE_RESOURCE</div><div>Transition: NODE_0 OFFL=
INE to ONLINE for THE_RESOURCE</div><div>Cluster state after setup:</div><d=
iv><span class=3D"gmail-Apple-tab-span" style=3D"white-space:pre">	</span>N=
ODE_0: ONLINE</div><div><span class=3D"gmail-Apple-tab-span" style=3D"white=
-space:pre">	</span>NODE_1: null</div><div>--------------------------------=
----------------------------</div><div>Stopping NODE_0</div><div>Transition=
: NODE_1 OFFLINE to ONLINE for THE_RESOURCE</div><div>Cluster state after s=
topping first node:</div><div><span class=3D"gmail-Apple-tab-span" style=3D=
"white-space:pre">	</span>NODE_0: null</div><div><span class=3D"gmail-Apple=
-tab-span" style=3D"white-space:pre">	</span>NODE_1: ONLINE</div><div>-----=
-------------------------------------------------------</div><div>Starting =
NODE_0</div><div>Transition: NODE_1 ONLINE to OFFLINE for THE_RESOURCE</div=
><div>Transition: NODE_1 OFFLINE to DROPPED for THE_RESOURCE</div><div>Clus=
ter state after restarting first node:</div><div><span class=3D"gmail-Apple=
-tab-span" style=3D"white-space:pre">	</span>NODE_0: OFFLINE</div><div><spa=
n class=3D"gmail-Apple-tab-span" style=3D"white-space:pre">	</span>NODE_1: =
null</div><div>------------------------------------------------------------=
</div></div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"=
>On Thu, Oct 20, 2016 at 9:18 AM, Lei Xia <span dir=3D"ltr">&lt;<a href=3D"=
mailto:lxia@linkedin.com" target=3D"_blank">lxia@linkedin.com</a>&gt;</span=
> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bo=
rder-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Hi, Michae=
l<br><br></div>=C2=A0 To answer your questions: <br><div><div><ul><li>Shoul=
d you have to `rebalance` a resource when adding a new node to the cluster?=
=C2=A0=C2=A0 <i><b>--- No, if you are using full-auto rebalance mode,=C2=A0=
 yes if you are in semi-auto rebalance mode. <br></b></i></li><li>Should yo=
u have to `rebalance` when a node is dropped? <i><b>-- Again, same answer, =
No, you do not need to in full-auto mode.=C2=A0 In full-auto mode, Helix is=
 supposed to detect nodes add/delete/online/offline and rebalance the resou=
rce automatically. </b></i><br></li></ul><br><div>=C2=A0 The problem you sa=
w was because your resource was created in SEMI-AUTO mode instead of FULL-A=
UTO mode.=C2=A0 HelixAdmin.addResource() creates a resource in semi-auto mo=
de by default if you do not specify a rebalance mode explicitly.=C2=A0 Plea=
se see my comments below on how to fix it.<br></div><div><br><br><span clas=
s=3D"m_-4067220369866295966gmail-pl-k">static</span> <span class=3D"m_-4067=
220369866295966gmail-pl-k">void</span> <span class=3D"m_-406722036986629596=
6gmail-pl-en">addResource</span>() <span class=3D"m_-4067220369866295966gma=
il-pl-k">throws</span> <span class=3D"m_-4067220369866295966gmail-pl-smi">E=
xception</span> {
     =20
     =20
       =20
            <br>=C2=A0 echo(<span class=3D"m_-4067220369866295966gmail-pl-s=
"><span class=3D"m_-4067220369866295966gmail-pl-pds">&quot;</span>Adding re=
source <span class=3D"m_-4067220369866295966gmail-pl-pds">&quot;</span></sp=
an> <span class=3D"m_-4067220369866295966gmail-pl-k">+</span> <span class=
=3D"m_-4067220369866295966gmail-pl-c1">RESOURCE_NAME</span>);
     =20
     =20
       =20
            <span class=3D"m_-4067220369866295966gmail-pl-c1"><br>=C2=A0 AD=
MIN</span><span class=3D"m_-4067220369866295966gmail-pl-k">.</span>addResou=
rce(<span class=3D"m_-4067220369866295966gmail-pl-c1">CLUSTER_NAME</span><w=
br>, <span class=3D"m_-4067220369866295966gmail-pl-c1">RESOURCE_NAME</span>=
, <span class=3D"m_-4067220369866295966gmail-pl-c1">NUM_PARTITIONS</span>, =
<span class=3D"m_-4067220369866295966gmail-pl-c1">STATE_MODEL_NAME</span>);=
=C2=A0
     =20
     =20
       =20
            <b><i>=3D=3D&gt; <span class=3D"m_-4067220369866295966gmail-pl-=
c1">ADMIN</span><span class=3D"m_-4067220369866295966gmail-pl-k">.</span>ad=
dResource(<span class=3D"m_-4067220369866295966gmail-pl-c1">CLUSTER_NAME</s=
pan><wbr>, <span class=3D"m_-4067220369866295966gmail-pl-c1">RESOURCE_NAME<=
/span>, <span class=3D"m_-4067220369866295966gmail-pl-c1">NUM_PARTITIONS</s=
pan>, <span class=3D"m_-4067220369866295966gmail-pl-c1">STATE_MODEL_NAME</s=
pan>, RebalanceMode.FULL_AUTO);
     =20
     =20
       =20
            </i></b><br>=C2=A0 echo(<span class=3D"m_-4067220369866295966gm=
ail-pl-s"><span class=3D"m_-4067220369866295966gmail-pl-pds">&quot;</span>R=
ebalancing resource <span class=3D"m_-4067220369866295966gmail-pl-pds">&quo=
t;</span></span> <span class=3D"m_-4067220369866295966gmail-pl-k">+</span> =
<span class=3D"m_-4067220369866295966gmail-pl-c1">RESOURCE_NAME</span>);
     =20
     =20
       =20
            <span class=3D"m_-4067220369866295966gmail-pl-c1"><br>=C2=A0 AD=
MIN</span><span class=3D"m_-4067220369866295966gmail-pl-k">.</span>rebalanc=
e(<span class=3D"m_-4067220369866295966gmail-pl-c1">CLUSTER_NAME</span>, <s=
pan class=3D"m_-4067220369866295966gmail-pl-c1">RESOURCE_NAME</span>, <span=
 class=3D"m_-4067220369866295966gmail-pl-c1">NUM_REPLICAS</span>);
=C2=A0<i><b> // This just needs to be called once after the resource was cr=
eated, no need to call when there is node change.     =20
     =20
       =20
          </b></i><br>}<br><br><br></div><div>Please give it a try and let =
me know whether it works.=C2=A0 Thanks!<br><br><br></div><div>Lei<br></div>=
</div></div></div><div class=3D"gmail_extra"><div><div class=3D"h5"><br><di=
v class=3D"gmail_quote">On Wed, Oct 19, 2016 at 11:52 PM, Michael Craig <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:mcraig@box.com" target=3D"_blank">mcra=
ig@box.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr"><div>Here is some repro code for &quot;drop a node, resource is no=
t redistributed&quot; case I described:=C2=A0<a href=3D"https://gist.github=
.com/mkscrg/bcb2ab1dd1b3e84ac93e7ca16e2824f8" target=3D"_blank">https://gis=
t.github<wbr>.com/mkscrg/bcb2ab1dd1b3e84ac9<wbr>3e7ca16e2824f8</a></div><di=
v><br></div><div>Can we answer these 2 questions? That would help clarify t=
hings:</div><div><ul><li>Should you have to `rebalance` a resource when add=
ing a new node to the cluster?<br></li><ul><li>If no, this is an easy bug t=
o reproduce. The <a href=3D"https://github.com/apache/helix/blob/helix-0.6.=
x/helix-core/src/main/java/org/apache/helix/examples/Quickstart.java#L198" =
target=3D"_blank">example code</a> calls rebalance after adding a node, and=
 it breaks if you comment out that line.</li><li>If yes, what is the correc=
t way to manage many resources on a cluster? Iterate through all resources =
and rebalance them for every new node?</li></ul><li>Should you have to `reb=
alance` when a node is dropped?</li><ul><li>If no, there is a bug. See the =
repro code posted above.</li><li>If yes, we are in the same rebalance-every=
-resource situation as above.</li></ul></ul><div>My use case is to manage a=
 set of ad-hoc tasks across a cluster of machines. Each task would be a sep=
arate resource with a unique name, with 1 partition and 1 replica. Each res=
ource would reside on exactly 1 node, and there is no limit on the number o=
f resources per node.</div></div></div><div class=3D"m_-4067220369866295966=
HOEnZb"><div class=3D"m_-4067220369866295966h5"><div class=3D"gmail_extra">=
<br><div class=3D"gmail_quote">On Wed, Oct 19, 2016 at 9:23 PM, Lei Xia <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:xiaxlei@gmail.com" target=3D"_blank">x=
iaxlei@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote"=
 style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><d=
iv dir=3D"ltr"><div><div><div>Hi,<span name=3D"Michael Craig" class=3D"m_-4=
067220369866295966m_-6512765072814265242m_-4664887730816711291gmail-m_17385=
03792594186118gmail-gD"> Michael<br><br></span></div><span name=3D"Michael =
Craig" class=3D"m_-4067220369866295966m_-6512765072814265242m_-466488773081=
6711291gmail-m_1738503792594186118gmail-gD">=C2=A0 Could you be more specif=
ic on the issue you see? Specifically:<br>=C2=A0 1) F</span>or 1 resource a=
nd 2 replicas, you mean the resource has only 1 partition, with replica num=
ber equals to 2, right?<br>=C2=A0 2) You see<i> <span class=3D"m_-406722036=
9866295966m_-6512765072814265242m_-4664887730816711291gmail-pl-c1">REBALANC=
E_MODE=3D</span>&quot;FULL_AUTO&quot;</i>, not<i> IDEALSTATE_MODE=3D&quot;A=
UTO&quot; </i>in your idealState, right?<br>=C2=A0 3)  by dropping N1, you =
mean disconnect N1 from helix/zookeeper, so N1 is not in liveInstances,  ri=
ght?<br><br></div>=C2=A0 If your answers to all of above questions are yes,=
 then there may be some bug here.=C2=A0 If possible, please paste your idea=
lstate, and your test code (if there is any) here, I will try to reproduce =
and debug it.=C2=A0 Thanks<br><br><br></div>Lei<br></div><div class=3D"gmai=
l_extra"><div><div class=3D"m_-4067220369866295966m_-6512765072814265242h5"=
><br><div class=3D"gmail_quote">On Wed, Oct 19, 2016 at 9:02 PM, kishore g =
<span dir=3D"ltr">&lt;<a href=3D"mailto:g.kishore@gmail.com" target=3D"_bla=
nk">g.kishore@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail=
_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:=
1ex"><div dir=3D"ltr">Can you describe your scenario in detail and the expe=
cted behavior?. I agree calling rebalance on every live instance change is =
ugly and definitely not as per the design. It was an oversight (we focussed=
 a lot of large number of partitions and failed to handle this simple case)=
.<div><br></div><div>Please file and jira and we will work on that. Lei, do=
 you think the recent bug we fixed with AutoRebalancer will handle this cas=
e?</div><div><br></div><div>thanks,</div><div>Kishore G</div></div><div cla=
ss=3D"m_-4067220369866295966m_-6512765072814265242m_-4664887730816711291HOE=
nZb"><div class=3D"m_-4067220369866295966m_-6512765072814265242m_-466488773=
0816711291h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On =
Wed, Oct 19, 2016 at 8:55 PM, Michael Craig <span dir=3D"ltr">&lt;<a href=
=3D"mailto:mcraig@box.com" target=3D"_blank">mcraig@box.com</a>&gt;</span> =
wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Thanks for t=
he quick response Kishore. This issue is definitely tied to the condition t=
hat partitions * replicas &lt; NODE_COUNT.=C2=A0</div><div>If all running n=
odes have a &quot;piece&quot; of the resource, then they behave well when t=
he LEADER node goes away.<br></div><div><br></div><div>Is it possible to us=
e Helix to manage a set of resources where that condition is true? I.e. whe=
re the <i>total </i>number of partitions/replicas in the cluster is greater=
 than the node count, but each individual resource has a small number of pa=
rtitions/replicas.</div><div><br></div><div>(Calling rebalance on every liv=
eInstance change does not seem like a good solution, because you would have=
 to iterate through all resources in the cluster and rebalance each individ=
ually.)</div><div><br></div></div><div class=3D"m_-4067220369866295966m_-65=
12765072814265242m_-4664887730816711291m_4219714574528950349HOEnZb"><div cl=
ass=3D"m_-4067220369866295966m_-6512765072814265242m_-4664887730816711291m_=
4219714574528950349h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_q=
uote">On Wed, Oct 19, 2016 at 12:52 PM, kishore g <span dir=3D"ltr">&lt;<a =
href=3D"mailto:g.kishore@gmail.com" target=3D"_blank">g.kishore@gmail.com</=
a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">I t=
hink this might be a corner case when partitions * replicas &lt; TOTAL_NUMB=
ER_OF_NODES. Can you try with many partitions and replicas and check if the=
 issue still exists.<div><br></div><div><br></div></div><div class=3D"m_-40=
67220369866295966m_-6512765072814265242m_-4664887730816711291m_421971457452=
8950349m_-5488723509582659357HOEnZb"><div class=3D"m_-4067220369866295966m_=
-6512765072814265242m_-4664887730816711291m_4219714574528950349m_-548872350=
9582659357h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On =
Wed, Oct 19, 2016 at 11:53 AM, Michael Craig <span dir=3D"ltr">&lt;<a href=
=3D"mailto:mcraig@box.com" target=3D"_blank">mcraig@box.com</a>&gt;</span> =
wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>I&#39;ve not=
iced that partitions/replicas assigned to disconnected instances are not au=
tomatically redistributed to live instances. What&#39;s the correct way to =
do this?</div><div><br></div>For example, given this setup with Helix 0.6.5=
:<div>- 1 resource<br></div><div>- 2 replicas</div><div>- LeaderStandby sta=
te model</div><div>- FULL_AUTO rebalance mode</div><div>- 3 nodes (N1 is Le=
ader, N2 is Standby, N3 is just sitting)</div><div><br></div><div>Then drop=
 N1:</div><div>- N2 becomes LEADER</div><div>- Nothing happens to N3</div><=
div><br></div><div>Naively, I would have expected N3 to transition from Off=
line to Standby, but that doesn&#39;t happen.</div><div><br></div><div>I ca=
n force redistribution from GenericHelixController#onLiveI<wbr>nstanceChang=
e by</div><div>- dropping non-live instances from the cluster</div><div>- c=
alling rebalance</div><div><br></div><div>The instance dropping seems prett=
y unsafe! Is there a better way?</div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><br></div></div><span =
class=3D"m_-4067220369866295966m_-6512765072814265242HOEnZb"><font color=3D=
"#888888">-- <br><div class=3D"m_-4067220369866295966m_-6512765072814265242=
m_-4664887730816711291gmail_signature" data-smartmail=3D"gmail_signature">L=
ei Xia<br></div>
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><br></div></div><span =
class=3D"HOEnZb"><font color=3D"#888888">-- <br><div class=3D"m_-4067220369=
866295966gmail_signature" data-smartmail=3D"gmail_signature"><div dir=3D"lt=
r"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><font =
size=3D"2"><b>Lei Xia<br>
</b>Senior Software Engineer</font><font size=3D"2"><br>Data Infra/<font si=
ze=3D"2"><font size=3D"2">Nuage &amp; Helix<br>
</font></font>LinkedIn<br><br></font><font size=3D"2"><a href=3D"mailto:lxi=
a@linkedin.com" target=3D"_blank">lxia@linkedin.com</a><br>
</font><font size=3D"2"><a href=3D"http://www.linkedin.com/in/lxia1" target=
=3D"_blank"><span><span>www.linkedin.com/in/</span><span>lxia1</span></span=
></a></font><br></div></div></div></div></div></div></div></div>
</font></span></div>
</blockquote></div><br></div>

--001a114edc00b5238f053f4f400f--