Return-Path: Delivered-To: apmail-geronimo-user-archive@www.apache.org Received: (qmail 47523 invoked from network); 3 Nov 2009 18:11:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 3 Nov 2009 18:11:42 -0000 Received: (qmail 81891 invoked by uid 500); 3 Nov 2009 18:11:42 -0000 Delivered-To: apmail-geronimo-user-archive@geronimo.apache.org Received: (qmail 81841 invoked by uid 500); 3 Nov 2009 18:11:41 -0000 Mailing-List: contact user-help@geronimo.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: user@geronimo.apache.org List-Id: Delivered-To: mailing list user@geronimo.apache.org Received: (qmail 81833 invoked by uid 99); 3 Nov 2009 18:11:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Nov 2009 18:11:41 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=AWL,BAYES_00,HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.217.211] (HELO mail-gx0-f211.google.com) (209.85.217.211) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Nov 2009 18:11:34 +0000 Received: by gxk3 with SMTP id 3so7866540gxk.15 for ; Tue, 03 Nov 2009 10:11:12 -0800 (PST) MIME-Version: 1.0 Received: by 10.101.179.24 with SMTP id g24mr678248anp.62.1257271872330; Tue, 03 Nov 2009 10:11:12 -0800 (PST) In-Reply-To: References: <7A8BE6EB-2942-45DE-8FAF-8FF3778C38CC@gmail.com> From: Quintin Beukes Date: Tue, 3 Nov 2009 20:10:52 +0200 Message-ID: <1f3854d50911031010r1b4a313u346c5334be937d8f@mail.gmail.com> Subject: Re: 2.2 in production To: user@geronimo.apache.org Content-Type: multipart/alternative; boundary=001636c92868561d8904777b6bcd --001636c92868561d8904777b6bcd Content-Type: text/plain; charset=UTF-8 Hey, This isn't related to this problem. Just figured I'd mention it. The hostnames for the members are printed as tcp://{-64, -88, 1, 60}:4000. I checked it out and this is because it makes the hostname like so: this.hostname = org.apache.catalina.tribes.util.Arrays.toString(host); Where "host" is a byte[]. So the 192 as a signed 8bit gives -64. Is it supposed to be like this or should I fix/report it? Quintin Beukes On Tue, Nov 3, 2009 at 5:35 PM, Trygve Hardersen wrote: > > > On Mon, Nov 2, 2009 at 5:39 PM, Kevan Miller wrote: >> >> >> Thanks Gianny. I'd like to see this included in the Geronimo 2.2 release. >> Can we look for a new WADI release, soon? Once we know the problem is fixed? >> >> Trygve, the sooner we get confirmation that your issue is resolved, the >> sooner we can start finalizing the 2.2 release. >> >> --kevan >> > > I've now built Geronimo using the 2.2-SNAPSHOT of WADI and installed it on > our test environment. No obvious issues so I'll go ahead and deploy this to > production either later this evening (EU time) or tomorrow. Then it needs to > run for a few days before I can confirm if the issue has really been > resolved. > > BTW I got this on our test system: > > AS-000: > 16:23:17,773 INFO [TcpFailureDetector] Received > memberDisappeared[org.apache.catalina.tribes.membership.MemberImpl[tcp://{-64, > -88, 1, 61}:4000,{-64, -88, 1, 61},4000, alive=1814258,id={50 18 86 10 111 > -47 79 83 -108 -4 82 -8 26 82 -79 -59 }, payload={-84 -19 0 5 115 114 0 50 > 111 ...(423)}, command={}, domain={74 79 84 84 65 95 87 65 68 ...(10)}, ]] > message. Will verify. > 16:23:17,897 INFO [TcpFailureDetector] Verification complete. Member still > alive[org.apache.catalina.tribes.membership.MemberImpl[tcp://{-64, -88, 1, > 61}:4000,{-64, -88, 1, 61},4000, alive=1814258,id={50 18 86 10 111 -47 79 83 > -108 -4 82 -8 26 82 -79 -59 }, payload={-84 -19 0 5 115 114 0 50 111 > ...(423)}, command={}, domain={74 79 84 84 65 95 87 65 68 ...(10)}, ]] > > AS-001: > 16:23:18,446 INFO [TcpFailureDetector] Received > memberDisappeared[org.apache.catalina.tribes.membership.MemberImpl[tcp://{-64, > -88, 1, 60}:4000,{-64, -88, 1, 60},4000, alive=2500759,id={107 -64 91 -23 > 109 93 75 116 -95 109 110 22 -85 53 -52 85 }, payload={-84 -19 0 5 115 114 0 > 50 111 ...(423)}, command={}, domain={74 79 84 84 65 95 87 65 68 ...(10)}, > ]] message. Will verify. > 16:23:18,456 INFO [TcpFailureDetector] Verification complete. Member still > alive[org.apache.catalina.tribes.membership.MemberImpl[tcp://{-64, -88, 1, > 60}:4000,{-64, -88, 1, 60},4000, alive=2500759,id={107 -64 91 -23 109 93 75 > 116 -95 109 110 22 -85 53 -52 85 }, payload={-84 -19 0 5 115 114 0 50 111 > ...(423)}, command={}, domain={74 79 84 84 65 95 87 65 68 ...(10)}, ]] > > And then: > > AS-000 > 16:30:02,576 INFO [ChannelInterceptorBase] memberDisappeared:tcp://{-64, > -88, 1, 61}:4000 > 16:30:02,577 INFO [BasicPartitionBalancerSingletonService] Queueing > partition rebalancing > 16:30:02,600 INFO [SimpleStateManager] > ============================= > New Partition Balancing > Partition Balancing > Size [24] > Partition[0] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[1] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[2] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[3] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[4] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[5] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[6] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[7] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[8] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[9] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[10] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[11] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[12] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[13] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[14] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[15] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[16] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[17] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[18] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[19] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[20] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[21] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[22] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > Partition[23] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; > version [3]; mergeVersion [0] > ============================= > > 16:30:02,888 WARN [TcpFailureDetector] Member added, even though we werent > notified:org.apache.catalina.tribes.membership.MemberImpl[tcp://{-64, -88, > 1, 61}:4000,{-64, -88, 1, 61},4000, alive=2221072,id={50 18 86 10 111 -47 79 > 83 -108 -4 82 -8 26 82 -79 -59 }, payload={-84 -19 0 5 115 114 0 50 111 > ...(423)}, command={}, domain={74 79 84 84 65 95 87 65 68 ...(10)}, ] > 16:30:02,889 INFO [ChannelInterceptorBase] memberAdded:tcp://{-64, -88, 1, > 61}:4000 > > AS-001 > Nothing.... > > There is practically no load on this network. Anyway I'll try this with > load and see what happens. > > Many thanks again! > > Trygve > --001636c92868561d8904777b6bcd Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hey,

This isn't related to this problem. Just figure= d I'd mention it.

The hostnames for the member= s are printed as=C2=A0tcp://{-64, -88, 1, 60}:4000. I checked it out and th= is is because it makes the hostname like so:
this.hostname =3D org.apache.catalina.tribes.util.Arrays.toString= (host);

Where "host" is a byte[]. So the= 192 as a signed 8bit gives -64. Is it supposed to be like this or should I= fix/report it?

Quintin Beukes


On Tue, Nov 3, 2009 at 5:35 PM, Trygve H= ardersen <trygve@jo= tta.no> wrote:


On Mon, Nov 2, 2009 at= 5:39 PM, Kevan Miller <kevan.miller@gmail.com> wrote:<= blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px= #ccc solid;padding-left:1ex">

Thanks Gianny. I'd like to see this included in the Geronimo 2.2 releas= e. Can we look for a new WADI release, soon? Once we know the problem is fi= xed?

Trygve, the sooner we get confirmation that your issue is resolved, the soo= ner we can start finalizing the 2.2 release.

--kevan

I've now built Geronimo using = the 2.2-SNAPSHOT of WADI and installed it on our test environment. No obvio= us issues so I'll go ahead and deploy this to production either later t= his evening (EU time) or tomorrow. Then it needs to run for a few days befo= re I can confirm if the issue has really been resolved.

BTW I got this on our test system:

=
AS-000:
16:23:17,773 INFO =C2=A0[TcpFailureDetector] Re= ceived memberDisappeared[org.apache.catalina.tribes.membership.MemberImpl[t= cp://{-64, -88, 1, 61}:4000,{-64, -88, 1, 61},4000, alive=3D1814258,id=3D{5= 0 18 86 10 111 -47 79 83 -108 -4 82 -8 26 82 -79 -59 }, payload=3D{-84 -19 = 0 5 115 114 0 50 111 ...(423)}, command=3D{}, domain=3D{74 79 84 84 65 95 8= 7 65 68 ...(10)}, ]] message. Will verify.
16:23:17,897 INFO =C2=A0[TcpFailureDetector] Verification complete. Me= mber still alive[org.apache.catalina.tribes.membership.MemberImpl[tcp://{-6= 4, -88, 1, 61}:4000,{-64, -88, 1, 61},4000, alive=3D1814258,id=3D{50 18 86 = 10 111 -47 79 83 -108 -4 82 -8 26 82 -79 -59 }, payload=3D{-84 -19 0 5 115 = 114 0 50 111 ...(423)}, command=3D{}, domain=3D{74 79 84 84 65 95 87 65 68 = ...(10)}, ]]

AS-001:
16:23:18,446 INFO =C2=A0[T= cpFailureDetector] Received memberDisappeared[org.apache.catalina.tribes.me= mbership.MemberImpl[tcp://{-64, -88, 1, 60}:4000,{-64, -88, 1, 60},4000, al= ive=3D2500759,id=3D{107 -64 91 -23 109 93 75 116 -95 109 110 22 -85 53 -52 = 85 }, payload=3D{-84 -19 0 5 115 114 0 50 111 ...(423)}, command=3D{}, doma= in=3D{74 79 84 84 65 95 87 65 68 ...(10)}, ]] message. Will verify.
16:23:18,456 INFO =C2=A0[TcpFailureDetector] Verification complete. Me= mber still alive[org.apache.catalina.tribes.membership.MemberImpl[tcp://{-6= 4, -88, 1, 60}:4000,{-64, -88, 1, 60},4000, alive=3D2500759,id=3D{107 -64 9= 1 -23 109 93 75 116 -95 109 110 22 -85 53 -52 85 }, payload=3D{-84 -19 0 5 = 115 114 0 50 111 ...(423)}, command=3D{}, domain=3D{74 79 84 84 65 95 87 65= 68 ...(10)}, ]]

And then:

AS-000
16:30:02,576 INFO =C2=A0[ChannelInterceptorBase] memberDisappeared:tcp://= {-64, -88, 1, 61}:4000
16:30:02,577 INFO =C2=A0[BasicPartitionBal= ancerSingletonService] Queueing partition rebalancing
16:30:02,600 INFO =C2=A0[SimpleStateManager]=C2=A0
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D
New Partition Balancing
Partition= Balancing
=C2=A0=C2=A0 =C2=A0Size [24]
=C2=A0=C2= =A0 =C2=A0Partition[0] owned by [TribesPeer [AS-000; tcp://192.168.1.60:400= 0]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[1] owned by [TribesPeer [AS-000; tcp://19= 2.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2= =A0Partition[2] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; ve= rsion [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[3] owned by [TribesPeer [AS-000; tcp://19= 2.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2= =A0Partition[4] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; ve= rsion [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[5] owned by [TribesPeer [AS-000; tcp://19= 2.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2= =A0Partition[6] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; ve= rsion [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[7] owned by [TribesPeer [AS-000; tcp://19= 2.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2= =A0Partition[8] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; ve= rsion [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[9] owned by [TribesPeer [AS-000; tcp://19= 2.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2= =A0Partition[10] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]; v= ersion [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[11] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 = =C2=A0Partition[12] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]= ; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[13] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 = =C2=A0Partition[14] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]= ; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[15] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 = =C2=A0Partition[16] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]= ; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[17] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 = =C2=A0Partition[18] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]= ; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[19] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 = =C2=A0Partition[20] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]= ; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[21] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=C2=A0=C2=A0 = =C2=A0Partition[22] owned by [TribesPeer [AS-000; tcp://192.168.1.60:4000]]= ; version [3]; mergeVersion [0]
=C2=A0=C2=A0 =C2=A0Partition[23] owned by [TribesPeer [AS-000; tcp://1= 92.168.1.60:4000]]; version [3]; mergeVersion [0]
=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D

16:30:02,888 WARN =C2=A0[TcpFailureDetector] Member = added, even though we werent notified:org.apache.catalina.tribes.membership= .MemberImpl[tcp://{-64, -88, 1, 61}:4000,{-64, -88, 1, 61},4000, alive=3D22= 21072,id=3D{50 18 86 10 111 -47 79 83 -108 -4 82 -8 26 82 -79 -59 }, payloa= d=3D{-84 -19 0 5 115 114 0 50 111 ...(423)}, command=3D{}, domain=3D{74 79 = 84 84 65 95 87 65 68 ...(10)}, ]
16:30:02,889 INFO =C2=A0[ChannelInterceptorBase] memberAdded:tcp://{-6= 4, -88, 1, 61}:4000

AS-001
Nothing....

There is practically no load on this network.= Anyway I'll try this with load and see what happens.

Many thanks again!

Trygve

--001636c92868561d8904777b6bcd--