Mailing-List: contact user-java-help@ibatis.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user-java@ibatis.apache.org
Received-SPF: pass (athena.apache.org: domain of fatboysuns@gmail.com
 designates 209.85.146.177 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=J8DhL/pFej0mZRZA0d1gvhdGAYGlnVwtneVGRIx9gdlJpn/yBx9KsvciGgwvT/SBRy
         5jC/Js3tqlwsMJC6yl68kXxwGyHgrCYDKbk7uLk+a9q5kgAmRvPIgiPWE28IHBdN4ZxQ
         asXG76rSx29YMDM7Aj4xUGk+RLrBmgCQETgfY=
MIME-Version: 1.0
In-Reply-To: <16178eb10901202012v5238f874n37693ea9d269888b@mail.gmail.com>
References: <16178eb10901200543s160f8e78h62d2fd35487fb162@mail.gmail.com>
	 <536e8800901201133w692015d5ia8882b1c144a45b8@mail.gmail.com>
	 <16178eb10901201139u5f35c67bl74c41187e92e88c6@mail.gmail.com>
	 <d4ebe7ab0901201150v7bfd5503sed3523232357f16@mail.gmail.com>
	 <536e8800901201212k7505d48dgf2537076e3dccbbc@mail.gmail.com>
	 <16178eb10901202012v5238f874n37693ea9d269888b@mail.gmail.com>
Date: Tue, 20 Jan 2009 22:47:33 -0700
Message-ID: <d4ebe7ab0901202147x6d7b6f1bqb0ebcbbf7db9c42a@mail.gmail.com>
Subject: Re: [SURVEY] How many connections are in your pool?
From: Sundar Sankar <fatboysuns@gmail.com>
To: user-java@ibatis.apache.org
Content-Type: multipart/alternative; boundary=0016363b7e9c4385bc0460f7b1fb

--0016363b7e9c4385bc0460f7b1fb
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit

Thanks So much Clinton. That was terrific!

-Sundar

On Tue, Jan 20, 2009 at 9:12 PM, Clinton Begin <clinton.begin@gmail.com>wrote:

> Absolutely.  In addition to general resource contention (CPU, Disk I/O
> etc.), you also have to consider lock contention against the database tables
> themselves.  Relational databases do not scale well in this regard.  Throw
> as much CPU power and hardware against your database as you like, as soon as
> you lock a table, the game is over, and everyone else has to wait.
>
> But to address fatboysuns (and Rick's and Nathan's) question of:  "aren't
> number of connections in a pool in relation to the number of parallel users
> that access the application than the number of CPU cores in a database?"
>
> The answer is no... historically databases allow for hundreds of concurrent
> connections (Oracle defaults to 1000) because it was originally intended
> that end users connect directly to the database.  SQL was originally
> intended as an end-user command-line interface to the database.  Eventually
> it was decided that it was too complex for end users, so we threw UIs at it
> (anyone ever use Oracle Forms?)... still 1 user per connection.
>
> But while they allowed 1000's of connections at a time, this did not imply
> that ALL 1000 could be active with a transaction or a query at the same
> time.  And back then, since the number of users was actually quite low, and
> the speed at which the transactions occurred was generally limited by how
> fast a person could enter data into a form, it was a fairly low risk.
>
> But now with N-Tier architecture and web applications that service
> thousands users, this does not mean that we can just bump the number of
> connections in the pool up to 1000 and be done with it.
>
> The number of effective transactions/queries allowed at any given time
> should be constrained artificially to avoid creating too much contention on
> the database resources.
>
> I usually use 2 or 3 times the number of CPUs to:
>
>  *  allow for some low level optimization of the threads, as it has to wait
> for disk I/O and modern hardware allows for pretty deep pipelines of queued
> "work",
>  * allow for some opportunistic parallel processing (especially in
> databases with LOTS of tables and mutually exclusive access to those
> tables),
>  * latency if the Java app does have additional processing between
> transaction steps (which should be avoided if at all possible).
>
> 2 - 3 times is reasonable, even up to 5 times.  If it was 10 times, I'd
> start to wonder....
>
> But over 100 times is terribly odd and I can't imagine how that could be
> good for performance.  It seems to me it's just an opportunity for tons of
> stale connections, wasted resources, deadlocking, and excessive resource
> contention.
>
> The best place to block is high in the app architecture.  On a 8 core app
> server and an 8 core database server, I might allow 48 concurrent threads on
> the app server (half of which will often be waiting for the DB at any given
> time) and 24 on the database server.
>
> Cheers,
> Clinto
>
>
> On Tue, Jan 20, 2009 at 1:12 PM, Nicholoz Koka Kiknadze <
> kiknadze@gmail.com> wrote:
>
>> Hi Sundar,
>>
>> I am not an hardware expert, but I suspect that even with modern dma
>> access etc if you ask your CPU to process N database transactions (initiated
>> by different users) in parallel it may take longer compared to when you ask
>> it to do them consequently. So quite possible that pools with connection
>> number > CPU number induce performence penalties. In other words the time
>> your pool waits for a connection to get available in the pool is just caused
>> by your hardware (CPU) beeing busy, so why add extra latency with extra pool
>> code...
>>
>> Again, of course the logic can not applyed to long running transactions
>> when CPU is idling in the midst of transaction waiting for e.g. extra user
>> input.
>>
>>
>> On Tue, Jan 20, 2009 at 2:50 PM, Sundar Sankar <fatboysuns@gmail.com>wrote:
>>
>>> Hi Clinton,
>>>                   I apologize ahead, if I am missing or not getting
>>> something right. As far as my understanding goes, arent number of
>>> connections in a pool in relation to the number of parallel users that
>>> access the application than the number of CPU cores in a database?
>>>
>>> Regards
>>> S
>>>
>>>
>>> On Tue, Jan 20, 2009 at 12:39 PM, Clinton Begin <clinton.begin@gmail.com
>>> > wrote:
>>>
>>>> It sounds like you're still using a "pool", but your max, min, idle, and
>>>> active connections are all equal (i.e. 16).  Otherwise, how do you allocate
>>>> connections to the incoming requests?
>>>>
>>>> Cheers,
>>>> Clinton
>>>>
>>>>
>>>> On Tue, Jan 20, 2009 at 12:33 PM, Nicholoz Koka Kiknadze <
>>>> kiknadze@gmail.com> wrote:
>>>>
>>>>> Ours is an application that requires guaranteed response times under 50
>>>>> ms, so:
>>>>>
>>>>> 1) We dropped using any kind of pool, so that
>>>>> 2) number of constantly open connections equals to the number of
>>>>> processors (16)
>>>>>
>>>>> 3) I know you were asking about pool, but still I dared to respond with
>>>>> this no-pool variant because I think maybe what you are asking can be
>>>>> reformulated as: is there any use of DB pool in a short lived transaction
>>>>> scenario, or its better to have one connection per CPU. Testing our app made
>>>>> us to drop using pool with TimesTen (in memory) database. Now I started to
>>>>> suspect that using using db pool (I've mostly used dbcp ) in other less
>>>>> demanding projects (but again w/o long running transactions) was just saving
>>>>> development time (let pool handle concurrency issues), but not any
>>>>> substantial performance gain. Wonder what others think...
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> On Tue, Jan 20, 2009 at 8:43 AM, Clinton Begin <
>>>>> clinton.begin@gmail.com> wrote:
>>>>>
>>>>>> Hi all,
>>>>>>
>>>>>> I've been studying a few large enterprise applications and have
>>>>>> noticed an interesting trend... many of these apps have HUNDREDS of
>>>>>> connections (like 600) available or even open in their connection pools...
>>>>>>
>>>>>> Survey Questions:
>>>>>>
>>>>>>   1. How many connections do you have available in your pool?
>>>>>>   2. And if you know, how many CPU cores are available on your
>>>>>> database server (or cluster)?
>>>>>>   3. If you have 2x or 3x more connections than you do CPUs, do you
>>>>>> have a reason that you could share?
>>>>>>
>>>>>> Cheers,
>>>>>> Clinton
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>

--0016363b7e9c4385bc0460f7b1fb
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thanks So much Clinton. That was terrific!<br><br>-Sundar<br><br><div class=
=3D"gmail_quote">On Tue, Jan 20, 2009 at 9:12 PM, Clinton Begin <span dir=
=3D"ltr">&lt;<a href=3D"mailto:clinton.begin@gmail.com">clinton.begin@gmail=
.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Absolutely.&nbsp;=
 In addition to general resource contention (CPU, Disk I/O etc.), you also =
have to consider lock contention against the database tables themselves.&nb=
sp; Relational databases do not scale well in this regard.&nbsp; Throw as m=
uch CPU power and hardware against your database as you like, as soon as yo=
u lock a table, the game is over, and everyone else has to wait.&nbsp; <br>


<br>But to address fatboysuns (and Rick&#39;s and Nathan&#39;s) question of=
:&nbsp; &quot;aren&#39;t number of connections in a pool in relation to the=
 number of
parallel users that access the application than the number of CPU cores
in a database?&quot;<br><br>The answer is no... historically databases allo=
w for hundreds of concurrent connections (Oracle defaults to 1000) because =
it was originally intended that end users connect directly to the database.=
&nbsp; SQL was originally intended as an end-user command-line interface to=
 the database.&nbsp; Eventually it was decided that it was too complex for =
end users, so we threw UIs at it (anyone ever use Oracle Forms?)... still 1=
 user per connection.<br>


<br>But while they allowed 1000&#39;s of connections at a time, this did no=
t imply that ALL 1000 could be active with a transaction or a query at the =
same time.&nbsp; And back then, since the number of users was actually quit=
e low, and the speed at which the transactions occurred was generally limit=
ed by how fast a person could enter data into a form, it was a fairly low r=
isk. <br>


<br>But now with N-Tier architecture and web applications that service thou=
sands users, this does not mean that we can just bump the number of connect=
ions in the pool up to 1000 and be done with it.&nbsp; <br><br>

The number of effective transactions/queries allowed at any given time shou=
ld be constrained artificially to avoid creating too much contention on the=
 database resources.&nbsp; <br><br>I usually use 2 or 3 times the number of=
 CPUs to:&nbsp; <br>

<br>&nbsp;*&nbsp; allow for some low level optimization of the threads, as =
it has to wait for disk I/O and modern hardware allows for pretty deep pipe=
lines of queued &quot;work&quot;,<br>&nbsp;* allow for some opportunistic p=
arallel processing (especially in databases with LOTS of tables and mutuall=
y exclusive access to those tables), <br>

&nbsp;* latency if the Java app does have additional processing between tra=
nsaction steps (which should be avoided if at all possible). <br><br>2 - 3 =
times is reasonable, even up to 5 times.&nbsp; If it was 10 times, I&#39;d =
start to wonder.... <br>

<br>But over 100 times is terribly odd and I can&#39;t imagine how that cou=
ld be good for performance.&nbsp; It seems to me it&#39;s just an opportuni=
ty for tons of stale connections, wasted resources, deadlocking, and excess=
ive resource contention.<br>

<br>The best place to block is high in the app architecture.&nbsp; On a 8 c=
ore app server and an 8 core database server, I might allow 48 concurrent t=
hreads on the app server (half of which will often be waiting for the DB at=
 any given time) and 24 on the database server.&nbsp; <br>

<br>Cheers,<br>Clinto<div><div></div><div class=3D"Wj3C7c"><br> <br><div cl=
ass=3D"gmail_quote">On Tue, Jan 20, 2009 at 1:12 PM, Nicholoz Koka Kiknadze=
 <span dir=3D"ltr">&lt;<a href=3D"mailto:kiknadze@gmail.com" target=3D"_bla=
nk">kiknadze@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Hi Sundar,<br><br=
>I am not an hardware expert, but I suspect that even with modern dma acces=
s etc if you ask your CPU to process N database transactions (initiated by =
different users) in parallel it may take longer compared to when you ask it=
 to do them consequently. So quite possible that pools with connection numb=
er &gt; CPU number induce performence penalties. In other words the time yo=
ur pool waits for a connection to get available in the pool is just caused =
by your hardware (CPU) beeing busy, so why add extra latency with extra poo=
l code...<br>


<br>Again, of course the logic can not applyed to long running transactions=
 when CPU is idling in the midst of transaction waiting for e.g. extra user=
 input.<div><div></div><div><br><br><div class=3D"gmail_quote">
On Tue, Jan 20, 2009 at 2:50 PM, Sundar Sankar <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:fatboysuns@gmail.com" target=3D"_blank">fatboysuns@gmail.com</a=
>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Hi Clinton,<br>&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
;&nbsp;&nbsp;&nbsp;&nbsp; I apologize ahead, if I am missing or not getting=
 something right. As far as my understanding goes, arent number of connecti=
ons in a pool in relation to the number of parallel users that access the a=
pplication than the number of CPU cores in a database? <br>


<br>Regards<br><font color=3D"#888888">S</font><div><div></div><div><br><br=
><div class=3D"gmail_quote">On Tue, Jan 20, 2009 at 12:39 PM, Clinton Begin=
 <span dir=3D"ltr">&lt;<a href=3D"mailto:clinton.begin@gmail.com" target=3D=
"_blank">clinton.begin@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
It sounds like you&#39;re still using a &quot;pool&quot;, but your max, min=
, idle, and active connections are all equal (i.e. 16).&nbsp; Otherwise, ho=
w do you allocate connections to the incoming requests?<br><br>Cheers,<br><=
font color=3D"#888888">Clinton</font><div>


<div></div><div><br>
<br><div class=3D"gmail_quote">On Tue, Jan 20, 2009 at 12:33 PM, Nicholoz K=
oka Kiknadze <span dir=3D"ltr">&lt;<a href=3D"mailto:kiknadze@gmail.com" ta=
rget=3D"_blank">kiknadze@gmail.com</a>&gt;</span> wrote:<br><blockquote cla=
ss=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, 204, 204); marg=
in: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">


Ours is an application that requires guaranteed response times under 50 ms,=
 so:<br><br>1) We dropped using any kind of pool, so that<br>2) number of c=
onstantly open connections equals to the number of processors (16)<br>


<br>

3) I know you were asking about pool, but still I dared to respond with thi=
s no-pool variant because I think maybe what you are asking can be reformul=
ated as: is there any use of DB pool in a short lived transaction scenario,=
 or its better to have one connection per CPU. Testing our app made us to d=
rop using pool with TimesTen (in memory) database. Now I started to suspect=
 that using using db pool (I&#39;ve mostly used  dbcp ) in other less deman=
ding projects (but again w/o long running transactions) was just saving dev=
elopment time (let pool handle concurrency issues), but not any substantial=
 performance gain. Wonder what others think...<div>


<div></div><div><br>
<br><br><br><div class=3D"gmail_quote">On Tue, Jan 20, 2009 at 8:43 AM, Cli=
nton Begin <span dir=3D"ltr">&lt;<a href=3D"mailto:clinton.begin@gmail.com"=
 target=3D"_blank">clinton.begin@gmail.com</a>&gt;</span> wrote:<br><blockq=
uote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, 204, 20=
4); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">


Hi all,<br><br>I&#39;ve been studying a few large enterprise applications a=
nd have noticed an interesting trend... many of these apps have HUNDREDS of=
 connections (like 600) available or even open in their connection pools...=
<br>


<br>Survey Questions:<br><br>&nbsp; 1. How many connections do you have ava=
ilable in your pool?&nbsp; <br>&nbsp; 2. And if you know, how many CPU core=
s are available on your database server (or cluster)?<br>&nbsp; 3. If you h=
ave 2x or 3x more connections than you do CPUs, do you have a reason that y=
ou could share? <br>


<br>Cheers,<br><font color=3D"#888888">Clinton<br>
</font></blockquote></div><br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>

--0016363b7e9c4385bc0460f7b1fb--