Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 22253 invoked from network); 20 Apr 2010 02:41:25 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 Apr 2010 02:41:25 -0000 Received: (qmail 77368 invoked by uid 500); 20 Apr 2010 02:41:24 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 77335 invoked by uid 500); 20 Apr 2010 02:41:24 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 77327 invoked by uid 99); 20 Apr 2010 02:41:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Apr 2010 02:41:24 +0000 X-ASF-Spam-Status: No, hits=-0.3 required=10.0 tests=AWL,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jbellis@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Apr 2010 02:41:18 +0000 Received: by wwi18 with SMTP id 18so1091916wwi.31 for ; Mon, 19 Apr 2010 19:40:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:received:message-id:subject:to:content-type :content-transfer-encoding; bh=i8B70J5Te7AhyUEGz3oDcrHMm+SsD0YCUkwtzm6/Zzc=; b=LL+57zY+UpxWG82GQwy/3WnLl/D157yEGKWvoNJH3AHgMqWgCmJwr/v9AhWY15S1Kv 4AjCwzyvmHak3lMZSe3WV9G9t4gVdMK5n3MG8V0loSJD4SWYBRqZhj6KEMd03hln0ezx Qtc2pyQwSSADiuUpK0YCfkOl8SPS0ftjVkxGc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=oC6NRw3hiLQUUE/ygVRPPd8rFjb5zvGO+p3cv1SAgpZOyiycIp6sO/uuO49fwXRlTN H/wct0z8zl4DIZ9MuuoDBPLKyIIQuiVxm/QSHtkc+gxjvLTyGPBNzGINeAunBfmi+6K4 FIu295cMDoyg5Dl5pJQ0+CYqVt5HG8MABrQFQ= MIME-Version: 1.0 Received: by 10.216.17.147 with HTTP; Mon, 19 Apr 2010 19:40:37 -0700 (PDT) In-Reply-To: References: <4BCCCF9E.80002@digg.com> From: Jonathan Ellis Date: Mon, 19 Apr 2010 21:40:37 -0500 Received: by 10.216.90.141 with SMTP id e13mr6144626wef.166.1271731257146; Mon, 19 Apr 2010 19:40:57 -0700 (PDT) Message-ID: Subject: Re: busy thread on IncomingStreamReader ? To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I don't see csArena-tmp-6-Index.db in the incoming files list. If it's not there, that means that it did break out of that while loop. Did you check both logs for exceptions? On Mon, Apr 19, 2010 at 9:36 PM, Ingram Chen wrote: > Ouch ! I talk too early ! > > We still suffer same problems after upgrade to 1.6.0_20. > > In JMX StreamingService, I see several wired incoming/outgoing transfer: > > In Host A, 192.168.2.87 > > StreamingService Status: > Done with transfer to /192.168.2.88 > > StreamingService StreamSources: > [/192.168.2.88] > > StreamingService StreamDestinations: > [/192.168.2.88] > > StreamingService getIncomingFiles=3D192.168.2.88 > [ > UserState: /var/lib/cassandra/data/UserState/multiMine-tmp-11-Index.db > 0/5718, > UserState: /var/lib/cassandra/data/UserState/multiMine-tmp-11-Filter.db > 0/325, > UserState: /var/lib/cassandra/data/UserState/multiMine-tmp-11-Data.db > 0/29831, > UserState: /var/lib/cassandra/data/UserState/csArena-tmp-13-Index.db > 0/47623, > > ... omit several 0 received pending files..... > > UserState: /var/lib/cassandra/data/UserState/battleCity2-tmp-19-Data.db > 0/355041, > > UserState: /var/lib/cassandra/data/UserState/mahjong-tmp-12-Data.db > 27711/2173906, > UserState: /var/lib/cassandra/data/UserState/darkChess-tmp-12-Data.db > 27711/18821998, > UserState: /var/lib/cassandra/data/UserState/battleCity2-tmp-6-Data.db > 27711/743037, > UserState: /var/lib/cassandra/data/UserState/big2-tmp-12-Index.db > 27711/189214, > UserState: /var/lib/cassandra/data/UserState/facebookPoker99-tmp-6-Data.d= b > 27711/1892375, > UserState: /var/lib/cassandra/data/UserState/facebookPoker99-tmp-6-Index.= db > 27711/143216, > UserState: /var/lib/cassandra/data/UserState/csArena-tmp-6-Data.db > 27711/201188, > UserState: /var/lib/cassandra/data/UserState/darkChess-tmp-12-Index.db > 27711/354923, > UserState: /var/lib/cassandra/data/UserState/big2-tmp-12-Data.db > 27711/1260768, > UserState: /var/lib/cassandra/data/UserState/mahjong-tmp-12-Index.db > 27711/332649, > UserState: /var/lib/cassandra/data/UserState/battleCity2-tmp-6-Index.db > 27711/39739 > ] > > lots of files stalled after receiving 27711 bytes. this strange number is > the length of first file to income, see Host B > > Host B, 192.168.2.88 > > StreamingService Status: > Receiving stream > > StreamingService StreamSources: > StreamSources: [/192.168.2.87] > > StreamingService StreamDestinations: > =A0[/192.168.2.87] > > StreamingService getOutgoingFiles=3D192.168.2.87 > [ > /var/lib/cassandra/data/UserState/stream/csArena-6-Index.db 27711/27711, > /var/lib/cassandra/data/UserState/stream/csArena-6-Filter.db 0/1165, > /var/lib/cassandra/data/UserState/stream/csArena-6-Data.db 0/201188, > > ... omit pending outgoing files .... > ] > > It seems that outgoing files does not terminate properly. and cause the > receiver goes into infinite loop to cause busy thread. From thread dump, = it > looks like fc.transferFrom() in IncomingStreamReader never return: > > =A0=A0=A0 while (bytesRead < pendingFile.getExpectedBytes()) { > =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 bytesRead +=3D fc.transferF= rom(socketChannel, bytesRead, > FileStreamTask.CHUNK_SIZE); > =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 pendingFile.update(bytesRea= d); > =A0=A0=A0 } > > > On Tue, Apr 20, 2010 at 05:48, Rob Coli wrote: >> >> On 4/17/10 6:47 PM, Ingram Chen wrote: >>> >>> after upgrading jdk from =A01.6.0_16 to =A01.6.0_20, the problem solved= . >> >> FYI, this sounds like it might be : >> >> https://issues.apache.org/jira/browse/CASSANDRA-896 >> >> http://bugs.sun.com/view_bug.do;jsessionid=3D60c39aa55d3666c0c84dd70eb82= 6?bug_id=3D6805775 >> >> Where garbage collection issues in JVM/JDKs before 7.b70 leads to GC >> storming which hoses performance. >> >> =3DRob > > > > >