Return-Path: X-Original-To: apmail-storm-user-archive@minotaur.apache.org Delivered-To: apmail-storm-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 56A421177D for ; Wed, 2 Apr 2014 07:09:42 +0000 (UTC) Received: (qmail 901 invoked by uid 500); 2 Apr 2014 07:09:41 -0000 Delivered-To: apmail-storm-user-archive@storm.apache.org Received: (qmail 836 invoked by uid 500); 2 Apr 2014 07:09:41 -0000 Mailing-List: contact user-help@storm.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@storm.incubator.apache.org Delivered-To: mailing list user@storm.incubator.apache.org Received: (qmail 821 invoked by uid 99); 2 Apr 2014 07:09:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Apr 2014 07:09:39 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=AC_DIV_BONANZA,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of quentin.de.gr@gmail.com designates 209.85.220.181 as permitted sender) Received: from [209.85.220.181] (HELO mail-vc0-f181.google.com) (209.85.220.181) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Apr 2014 07:09:34 +0000 Received: by mail-vc0-f181.google.com with SMTP id id10so11466702vcb.12 for ; Wed, 02 Apr 2014 00:09:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=KdZ/A2h92m0zMGVrEJR9cMq6auFcWaIhbQLpzdEZi6A=; b=sBHTTtuJ7/6TS9P9YC3PH6n19/DK0sibuaDnIcnXkUk3t/qoQv1/2zv2QQfYvBzBTs 9A09MNxZXWjjBKvJpb9FHcjxmR+8RQ31m6tqhF6HFsAjOAIHmND9iEtYnsjRmQ1I8ace PDIiRKDJz/q4Kx6/zD/0kCBO8cv/Bn845pj3X3EULdLTdfyR9jlQPAFFpcrR0hzH18nX IYPQtWmUEa/N8W6sMb2RzhygOleKTTTJeSTGxA3MK04z4+jMHXsW5LACKBSHU2CYVSrg 7rRfefC1UkFN7Jx3OwzREqnJRI28UP6oyv9dMq1/tdeFPFdv7IhBqWQWLzCBrVAQDR+Q TRRw== X-Received: by 10.52.95.135 with SMTP id dk7mr6212717vdb.32.1396422552950; Wed, 02 Apr 2014 00:09:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.220.103.135 with HTTP; Wed, 2 Apr 2014 00:08:51 -0700 (PDT) In-Reply-To: References: From: "Quentin de G." Date: Wed, 2 Apr 2014 09:08:51 +0200 Message-ID: Subject: Re: Whole topology restarts when a worker crash To: user@storm.incubator.apache.org Content-Type: multipart/alternative; boundary=001a11368daa399b5704f609f6cd X-Virus-Checked: Checked by ClamAV on apache.org --001a11368daa399b5704f609f6cd Content-Type: text/plain; charset=UTF-8 Hello again, Anuj, and thanks for your time. I just checked the worker process ID on 169.machine.com. If I kill a worker on 168.machine.com, there's no change on 169.machine.comworker PID, and tasks on 169.machine.com don't restart. If I kill the supervisor on 168.machine.com, the worker PID on 169.machine.com changes and all tasks are restarted. I'm not sure if its a normal behaviour. Thanks. On Tue, Apr 1, 2014 at 6:23 PM, Anuj Kumar wrote: > Can you check the worker process ID on the 169.machine.com. Does it > change when you kill the worker on 168.machine.com? > > > On Tue, Apr 1, 2014 at 9:41 PM, Quentin de G. wrote: > >> Sure, here's the url of the gist: >> https://gist.github.com/noKid/e1fdbc582973f5c3e5d6 >> >> Screenshots: >> Before: >> Storm UI: http://postimg.org/image/dpsxjbjwv/ >> Topology: http://postimg.org/image/664wedx1r/ >> A topology component: http://postimg.org/image/nkp4mnu6n/ >> >> After >> Same topology component: http://postimg.org/image/e2pdmm8i7/ >> >> >> I only killed 168.machine.com, yet all the tasks (even those on >> 169.machine.com) restarted :( >> >> >> On Tue, Apr 1, 2014 at 5:20 PM, Anuj Kumar wrote: >> >>> Can you share the gist of your Trident topology and some screenshots of >>> the current state and state after you kill the worker and it restarts? >>> >>> >>> On Tue, Apr 1, 2014 at 8:25 PM, Quentin de G. wrote: >>> >>>> Yes. >>>> >>> >>> >> >> >> -- >> Quentin de Grandmaison >> >> / LinkedInprofile / Profil >> Viadeo Mon CV >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> > -- Quentin de Grandmaison / LinkedIn profile / Profil Viadeo Mon CV --001a11368daa399b5704f609f6cd Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hello again, Anuj, and thanks for your= time.
I just checked the worker process ID on 169.machine.com.

If I kill a worker on 168.machine.com, there's no change = on 169.machine.com worker PID, and t= asks on 169.machine.com don't re= start.
If I kill the supervisor on 168.ma= chine.com, the worker PID on 169.mac= hine.com changes and all tasks are restarted. I'm not sure if its a= normal behaviour.

Thanks.


On Tue, Apr 1, 2014 at 6:23 PM, Anuj Kumar = <anujsays@gmail.= com> wrote:
Can you check the worker pr= ocess ID on the 169.ma= chine.com. Does it change when you kill the worker on 168.machine.com?


On Tue, Apr 1, 2014 at 9:41 PM, Quentin = de G. <quentin.de.gr@gmail.com> wrote:
Sure, here's the url of the gist:<= br>https://gist.github.com/noKid/e1fdbc582973f5c3e5d6

Screenshots:
=C2=A0 Before:
=C2=A0 =C2=A0 Storm UI: http://postimg.org/image/dpsxjbjwv/<= /a>
=C2=A0 =C2=A0 Topology: http://postimg.org/image/664wedx1r/ =C2=A0 =
=C2=A0=C2=A0=C2=A0 A topo= logy component: http://postimg.org/image/nkp4mnu6n/

=C2=A0 Aft= er
=C2=A0=C2=A0=C2=A0 Same topology component: http://postimg.org/imag= e/e2pdmm8i7/


I only killed 168.machine.com, yet all the tasks (even those on= 169.machine.com) = restarted :(
<= div>

On Tue, Apr 1, 2014 at 5:20 PM, Anuj= Kumar <anujsays@gmail.com> wrote:
Can you share the gist of y= our Trident topology and some screenshots of the current state and state af= ter you kill the worker and it restarts?


On Tue, Apr 1= , 2014 at 8:25 PM, Quentin de G. <quentin.de.gr@gmail.com> wrote:
Yes.




--
=09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09
=09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09
=09
=09
Quenti= n de Grandmaison

/ LinkedIn= profile / Profil Viadeo Mon CV


















--
=09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09
=09 =09 =09 =09 =09 =09 =09 =09 =09 =09 =09
=09
=09
Quentin de Grandmaison

/ LinkedIn= profile / Profil Viadeo Mon CV














--001a11368daa399b5704f609f6cd--