Mailing-List: contact derby-user-help@db.apache.org; run by ezmlm
Precedence: bulk
Reply-To: "Derby Discussion" <derby-user@db.apache.org>
Received-SPF: pass (nike.apache.org: domain of charlie.hubbard@gmail.com
 designates 74.125.92.25 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=fB0ysrbyVW2O6fFWuz1hMIEqu4spncSWb1PkmycNcqsWNcJh+UysIOF+RRGZ2/d9si
         mf/JCje0oVqZiFGkW+kRGSdKM+j7KumPFIFRnBd9z9OeckuOkAFcE5AGtusyUvTDqu/r
         DjRFdVVq/HPCTox3RuSYtKq7mYm+5AML7z12Y=
MIME-Version: 1.0
In-Reply-To: <4B7C77EC.8020109@amberpoint.com>
References: <b4dfc01b1002171310t412c36bdyfc9864dfc5c819e2@mail.gmail.com>
	 <4B7C77EC.8020109@amberpoint.com>
Date: Thu, 18 Feb 2010 14:42:32 -0500
Message-ID: <b4dfc01b1002181142p5e5517f9g57c69d269ed0e7da@mail.gmail.com>
Subject: Re: Deadlocked in Log2File
From: Charlie Hubbard <charlie.hubbard@gmail.com>
To: Derby Discussion <derby-user@db.apache.org>
Content-Type: multipart/alternative; boundary=001485f87c6203eb19047fe52b94

--001485f87c6203eb19047fe52b94
Content-Type: text/plain; charset=ISO-8859-1

We do not call SYSCS_UTIL.SYSCS_FREEZE_DATABASE or
SYSCS_UTIL.SYSCS_CHECKPOINT_DATABASE.

We do perform backups using CALL SYSCS_UTIL.SYSCS_BACKUP_DATABASE(?).
 However, in this particular test case the backup hasn't been run.  So, for
this particular test case the backup_database stored proc hasn't been
called.

This is a new application that uses derby so I can't say if this is new
behavior for this application or not.  We don't do any replication.  This
seems to happen fairly frequently if I try and process a very large data set
for my application (70,000 entries).  For smaller data set sizes it seems to
work out ok (7500 entries).

I encountered this from within my profiler so I decided to try this outside
the profiler, and I didn't encounter any locking issue.  If I do this within
my profiler I see it frequently.  For right now I think it's the profiler
unless I can reproduce this outside my profiler.

I'm using Derby 10.5.3.

Thanks,
Charlie

On Wed, Feb 17, 2010 at 6:12 PM, Bryan Pendleton
<bpendleton@amberpoint.com>wrote:

>  I'm sometimes getting deadlocks in Log2File where all my worker threads
>> are getting blocked, but I can't figure out which monitor they are blocked
>> on.
>>
>
> Thanks for sending the thread dump, it was very interesting.
>
> I agree with you, it's not clear what is blocking these threads,
> at least from the thread dump.
>
> I think this could be a Derby bug. How often does it happen? Is it
> a new behavior?
>
> Looking at the source code, LogToFile.flush() has some complicated
> synchronization logic which interacts with the backup/checkpoint methods.
>
> Does your application perform a backup?
>
> Does your application involve replication?
>
> Does your application ever call SYSCS_UTIL.SYSCS_FREEZE_DATABASE
> or SYSCS_UTIL.SYSCS_CHECKPOINT_DATABASE?
>
> If you are calling SYSCS_FREEZE_DATABASE, is it possible that you
> have some path through your system which fails to subsequently call
> SYSCS_UNFREEZE_DATABASE?
>
> thanks,
>
> bryan
>
>

--001485f87c6203eb19047fe52b94
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

We do not call=A0<span style=3D"font-family:arial, sans-serif;font-size:13p=
x;border-collapse:collapse">SYSCS_UTIL.SYSCS_FREEZE_DATABASE or=A0</span><s=
pan style=3D"font-family:arial, sans-serif;font-size:13px;border-collapse:c=
ollapse">SYSCS_UTIL.SYSCS_CHECKPOINT_DATABASE.</span><div>

<font face=3D"arial, sans-serif"><span style=3D"border-collapse:collapse"><=
br></span></font></div><div><font face=3D"arial, sans-serif"><span style=3D=
"border-collapse:collapse">We do perform backups using=A0CALL SYSCS_UTIL.SY=
SCS_BACKUP_DATABASE(?). =A0However, in this particular test case the backup=
 hasn&#39;t been run. =A0So, for this particular test case the backup_datab=
ase stored proc hasn&#39;t been called.</span></font></div>

<div><font face=3D"arial, sans-serif"><span style=3D"border-collapse:collap=
se"><br></span></font></div><div><font face=3D"arial, sans-serif"><span sty=
le=3D"border-collapse:collapse">This is a new application that uses derby s=
o I can&#39;t say if this is new behavior for this application or not. =A0W=
e don&#39;t do any replication. =A0This seems to happen fairly frequently i=
f I try and process a very large data set for my application (70,000 entrie=
s). =A0For smaller data set sizes it seems to work out ok (7500 entries).</=
span></font></div>
<div><font face=3D"arial, sans-serif"><span style=3D"border-collapse:collap=
se"><br></span></font></div><div><font face=3D"arial, sans-serif"><span sty=
le=3D"border-collapse:collapse">I encountered this from within my profiler =
so I decided to try this outside the profiler, and I didn&#39;t encounter a=
ny locking issue. =A0If I do this within my profiler I see it frequently. =
=A0For right now I think it&#39;s the profiler unless I can reproduce this =
outside my profiler.</span></font></div>
<div><font face=3D"arial, sans-serif"><span style=3D"border-collapse:collap=
se"><br></span></font></div><div><font face=3D"arial, sans-serif"><span sty=
le=3D"border-collapse:collapse">I&#39;m using Derby 10.5.3.</span></font></=
div>

<div><font face=3D"arial, sans-serif"><span style=3D"border-collapse:collap=
se"><br></span></font></div><div><font face=3D"arial, sans-serif"><span sty=
le=3D"border-collapse:collapse">Thanks,</span></font></div><div><font face=
=3D"arial, sans-serif"><span style=3D"border-collapse:collapse">Charlie<br>

</span></font><br><div class=3D"gmail_quote">On Wed, Feb 17, 2010 at 6:12 P=
M, Bryan Pendleton <span dir=3D"ltr">&lt;<a href=3D"mailto:bpendleton@amber=
point.com" target=3D"_blank">bpendleton@amberpoint.com</a>&gt;</span> wrote=
:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-le=
ft:1px #ccc solid;padding-left:1ex">
I&#39;m sometimes getting deadlocks in Log2File where all my worker threads=
 are getting blocked, but I can&#39;t figure out which monitor they are blo=
cked on. <br>
</blockquote>
<br></div>
Thanks for sending the thread dump, it was very interesting.<br>
<br>
I agree with you, it&#39;s not clear what is blocking these threads,<br>
at least from the thread dump.<br>
<br>
I think this could be a Derby bug. How often does it happen? Is it<br>
a new behavior?<br>
<br>
Looking at the source code, LogToFile.flush() has some complicated<br>
synchronization logic which interacts with the backup/checkpoint methods.<b=
r>
<br>
Does your application perform a backup?<br>
<br>
Does your application involve replication?<br>
<br>
Does your application ever call SYSCS_UTIL.SYSCS_FREEZE_DATABASE<br>
or SYSCS_UTIL.SYSCS_CHECKPOINT_DATABASE?<br>
<br>
If you are calling SYSCS_FREEZE_DATABASE, is it possible that you<br>
have some path through your system which fails to subsequently call<br>
SYSCS_UNFREEZE_DATABASE?<br>
<br>
thanks,<br><font color=3D"#888888">
<br>
bryan<br>
<br>
</font></blockquote></div><br></div>

--001485f87c6203eb19047fe52b94--