Return-Path: X-Original-To: apmail-storm-user-archive@minotaur.apache.org Delivered-To: apmail-storm-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7B4851098D for ; Wed, 2 Apr 2014 15:13:07 +0000 (UTC) Received: (qmail 26442 invoked by uid 500); 2 Apr 2014 15:13:06 -0000 Delivered-To: apmail-storm-user-archive@storm.apache.org Received: (qmail 26402 invoked by uid 500); 2 Apr 2014 15:13:06 -0000 Mailing-List: contact user-help@storm.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@storm.incubator.apache.org Delivered-To: mailing list user@storm.incubator.apache.org Received: (qmail 26195 invoked by uid 99); 2 Apr 2014 15:13:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Apr 2014 15:13:03 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of anujsays@gmail.com designates 209.85.212.174 as permitted sender) Received: from [209.85.212.174] (HELO mail-wi0-f174.google.com) (209.85.212.174) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Apr 2014 15:12:58 +0000 Received: by mail-wi0-f174.google.com with SMTP id d1so7364618wiv.7 for ; Wed, 02 Apr 2014 08:12:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=NbVz+FF1wIEbQLBc0BG2HgQxXh/33GeXhrIoCi1rn7A=; b=OyZ8V8kIOffE0ePiG9WaLjKfZxXQBaBUwUqjxyGjdpRk7FtK0TMqX4cU5ImbIPDb/P QOpw8KVS6DZ1lcPCIrBVGpenzsfOo/bvXYqDKPt1w3HJmq8JJ82Dr/s9HO9MGlvtIXfY Cn46WwWtGfLEigWD+BI7LoFQgQX9iK9ISNZ5sp95PkBg54tio1oRXsw54/j8e8zuUoQ+ RRO1kTZbRXDAF0JMiAUQwCQclZArhZ5P5zgFqBM/USe/xI0WVmeEwoavqSCXfvlgu1T1 hOWbrn2kLRSzJ/P4GquSRXzpwcpe9M0jKwxJ49nkv7t+BZjBcQeQSM7yVPS4F3I5sO4+ tX6w== MIME-Version: 1.0 X-Received: by 10.194.187.50 with SMTP id fp18mr1390670wjc.89.1396451556590; Wed, 02 Apr 2014 08:12:36 -0700 (PDT) Received: by 10.217.11.201 with HTTP; Wed, 2 Apr 2014 08:12:36 -0700 (PDT) In-Reply-To: References: Date: Wed, 2 Apr 2014 20:42:36 +0530 Message-ID: Subject: Re: Whole topology restarts when a worker crash From: Anuj Kumar To: user@storm.incubator.apache.org Content-Type: multipart/alternative; boundary=047d7bb03b4efa0ae404f610b6eb X-Virus-Checked: Checked by ClamAV on apache.org --047d7bb03b4efa0ae404f610b6eb Content-Type: text/plain; charset=ISO-8859-1 Great! On Wed, Apr 2, 2014 at 4:48 PM, Quentin de G. wrote: > OK, seems all this was a misunderstanding of how Trident works. > > I allowed 2 supevisor slots on each machine, but kept 1 worker per > machine. When I killed machine A, worker from machine A took the unused > slot on machine B. First worker slot didn't restarted. I then restarted > machine A and rebalanced the topology. Only one worker was moved. > This is the behaviour I expected. > > To put it in a nutshell : Always reserve empty slots for fault-tolerance. > Thanks again for the time you took Anuj :) > > --047d7bb03b4efa0ae404f610b6eb Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Great!


On Wed, Apr 2, 2014 at 4:48 PM, Quentin de G. <quen= tin.de.gr@gmail.com> wrote:
OK= , seems all this was a misunderstanding of how Trident works.

= I allowed 2 supevisor slots on each machine, but kept 1 worker per machine.= When I killed machine A, worker from machine A took the unused slot on mac= hine B. First worker slot didn't restarted. I then restarted machine A = and rebalanced the topology. Only one worker was moved.
This is the behaviour I expected.

To put it in a nutshel= l : Always reserve empty slots for fault-tolerance.
Thanks again f= or the time you took Anuj :)


--047d7bb03b4efa0ae404f610b6eb--