Return-Path: Delivered-To: apmail-apr-dev-archive@www.apache.org Received: (qmail 70748 invoked from network); 11 Feb 2010 03:12:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Feb 2010 03:12:10 -0000 Received: (qmail 31562 invoked by uid 500); 11 Feb 2010 03:12:09 -0000 Delivered-To: apmail-apr-dev-archive@apr.apache.org Received: (qmail 31299 invoked by uid 500); 11 Feb 2010 03:12:09 -0000 Mailing-List: contact dev-help@apr.apache.org; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Id: Delivered-To: mailing list dev@apr.apache.org Received: (qmail 31288 invoked by uid 99); 11 Feb 2010 03:12:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Feb 2010 03:12:08 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of trawick@gmail.com designates 209.85.217.218 as permitted sender) Received: from [209.85.217.218] (HELO mail-gx0-f218.google.com) (209.85.217.218) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Feb 2010 03:12:00 +0000 Received: by gxk10 with SMTP id 10so793870gxk.3 for ; Wed, 10 Feb 2010 19:11:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=ejdfR+sIXynxLddpaC/0Ryl3J/HTCTBKchKgMgIahOI=; b=kGuy7ayGkELgpspswheJ+6AXnS9dLIvsQuK8k8O4Gam/SygbTuR+WlETUQ2DXCy1GY zaP5bftWppiq3BUWUhCKaw+HiKwF74AzgVHnvLkwkg4n4ZOhRdJSmCrvjSI0ncQuthmc 6DOtp4QSrKQ4VHJxYD7WsJS65HnKeKa1MtC0s= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=MEZV2p4K1/7uOQQIPj7cYWdAOE/XIuRpwtf6uBVx0urCjy3OzhNTuVlL8iedSqwSoQ 4tpo9bpiIYez2sboKHFd/bLWUpQ8h+Yqq8tTCqsc8oIaRx27bPMjhEIvW0rsHjDTj8xX UXZgfmrI/1EPCCAH1oZ1kVuU3berp8QOaH7uE= MIME-Version: 1.0 Received: by 10.100.246.13 with SMTP id t13mr1584179anh.184.1265857899441; Wed, 10 Feb 2010 19:11:39 -0800 (PST) In-Reply-To: References: <20100210094639.GA3685@redhat.com> <7124d6e51002101437v21c50e03n4b88632ed8786a02@mail.gmail.com> Date: Wed, 10 Feb 2010 22:11:39 -0500 Message-ID: Subject: Re: [VOTE] release apr-1.3.12 From: Jeff Trawick To: dev@apr.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org On Wed, Feb 10, 2010 at 9:50 PM, Gregory Szorc wr= ote: >> what does "pstack PID" display when it is hung? > > 3305: =A0 ./testall testpoll > fedc9a45 portfs =A0 (6, 37, 8110b98, 32, 32, 0) > fef898a6 apr_pollcb_poll (8110b88, ffffffff, ffffffff, 806626d, 8047b04, > 8047b3c) + 82 > 08066408 trigger_pollcb (8047b3c, 0, 80772eb, 0, 80f55d8, 80f55d8) + 117 > 080557c1 abts_run_test (80f5b20, 80662f1, 0, 0, 807384d, 8088eac) + 56 > 08066904 testpoll (80f5b20, 4, fefcab34, 8047ba4, 16, 807512a) + 1fa > 08056171 main =A0 =A0 (8055170, 2, 8047bbc) + 20f > 08055170 _start =A0 (2, 8047ca8, 8047cb2, 0, 8047cbb, 8047cf3) + 80 Hmmm... If I comment out the send_msg() call so that there's no data available yet, my backtrace looks like 25072: ./testall testpoll fee04157 portfs (6, 37, 81155a0, 32, 1, 0) fef87218 call_port_getn (37, 81155a0, 32, 8047180, ffffffff, ffffffff) + c= 8 fef881a4 apr_pollcb_poll (8115590, ffffffff, ffffffff, 8066c40, 80471c0, 0) + 54 08066dde trigger_pollcb (80471fc, 0) + fe 08056331 abts_run_test (80fa528, 8066ce0, 0) + 71 08067274 testpoll (80fa528, 0) + 184 08056ee3 main (2, 8047284, 8047290, 8055c8f) + 213 08055ced _start (2, 80473b8, 80473c2, 0, 80473cb, 804743d) + 7d Interestingly, the fifth parm to portfs() in your backtrace is 32 =3D=3D nalloc, and the fifth parm to portfs() in mine is 1. Meanwhile, there's a bug fix in 1.3.12 to fix a hang in apr_pollcb_poll() on Solaris by passing 1 instead of nalloc for the number of events to wait for. I can't imagine how you wouldn't have the fix or would be running the wrong libapr, but can you check with pldd which libapr is getting loaded just in case? (I'm guessing the absence of call_port_getn() in the backtrace is due to gcc inlining, though apr <=3D 1.3.8 doesn't have that function.) > I have access to other Solaris releases. =A0I can always try to compile a= nd > test on them. =A0I may also start going through the SVN commits and isola= ting > the failure to a specific revision. =A0Of course, this could all be relat= ed to > my toolchain - I'm using GNU for everything but the linker. =A0Still, a b= ug is > a bug. Sure; it might be quicker to use LD_LIBRARY_PATH to run the 1.3.12 testall testpoll against apr 1.3.9's libapr before sorting through individual commits. (so weird)