Return-Path: Delivered-To: apmail-commons-issues-archive@locus.apache.org Received: (qmail 389 invoked from network); 29 Feb 2008 06:11:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 29 Feb 2008 06:11:56 -0000 Received: (qmail 27740 invoked by uid 500); 29 Feb 2008 06:11:47 -0000 Delivered-To: apmail-commons-issues-archive@commons.apache.org Received: (qmail 27669 invoked by uid 500); 29 Feb 2008 06:11:47 -0000 Mailing-List: contact issues-help@commons.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: issues@commons.apache.org Delivered-To: mailing list issues@commons.apache.org Received: (qmail 27644 invoked by uid 99); 29 Feb 2008 06:11:47 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Feb 2008 22:11:47 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Feb 2008 06:11:08 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 34E3D234C047 for ; Thu, 28 Feb 2008 22:10:51 -0800 (PST) Message-ID: <2129864603.1204265451215.JavaMail.jira@brutus> Date: Thu, 28 Feb 2008 22:10:51 -0800 (PST) From: "Phil Steitz (JIRA)" To: issues@commons.apache.org Subject: [jira] Commented: (DBCP-44) [dbcp] Evictor thread in GenericObjectPool has potential for deadlock MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/DBCP-44?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12573607#action_12573607 ] Phil Steitz commented on DBCP-44: --------------------------------- This is a new issue, though the basic problem of contention between the Evictor and client threads is the same. Turning off testWhileIdle or just setting timeBetweenEvictionRuns to the default (-1) so the evictor does not run should prevent this. Looks to me like what is going on here is 1) Client thread ("SocketRequest-8") calls close on a connection from the pool. This locks the PoolableConnection. 2) The returned connection gets added back into the pool 3) The evictor starts and locks the pool 4) The evictor tries to validate the newly returned connection, but needs to acquire its lock 5) returnObject (actually addObjectToPool) needs to acquire the pool lock to complete ***deadlock*** This is new because pool 1.4 reduced synchronization scope in returnObject, which used to be fully synchronized (opening the door for 3). We could revert to full synchronization of returnObject in pool, but that was a source of performance problems and I think the root cause here is really how DBCP uses the GOP Evictor without controlling client locks and we need to find a solution for that problem. Reverting to pool 1.3 will also eliminate this particular deadlock. > [dbcp] Evictor thread in GenericObjectPool has potential for deadlock > --------------------------------------------------------------------- > > Key: DBCP-44 > URL: https://issues.apache.org/jira/browse/DBCP-44 > Project: Commons Dbcp > Issue Type: Bug > Affects Versions: 1.2.1 > Environment: Operating System: All > Platform: All > Reporter: Eric Layton > Fix For: 1.3 > > Attachments: DBCP-44.patch, deadlock.txt, deadlock_post_patch.txt, testConcurrency.java > > > The GenericObjectPool Evictor thread can potentially cause a deadlock between > its connection factory and java.sql.DriverManager. The deadlock occurs when the > Evictor thread is trying to make enough connections to bring the pool's idle > connections up to what's specified in minIdle, at the same time a connection is > being requested through DriverManager.getConnection(). See the attached stack > trace dump: > Found one Java-level deadlock: > ============================= > "Thread-0": > waiting to lock monitor 0x0809a994 (object 0x698c2b48, a java.lang.Class), > which is held by "main" > "main": > waiting to lock monitor 0x0809aad4 (object 0x65df7030, a > org.apache.commons.dbcp.PoolableConnectionFactory), > which is held by "Thread-0" > > Java stack information for the threads listed above: > =================================================== > "Thread-0": > at java.sql.DriverManager.getConnection(DriverManager.java:158) > - waiting to lock <0x698c2b48> (a java.lang.Class) > at > org.apache.commons.dbcp.DriverManagerConnectionFactory.createConnection(DriverManagerConnectionFactory.java:48) > at > org.apache.commons.dbcp.PoolableConnectionFactory.makeObject(PoolableConnectionFactory.java:290) > - locked <0x65df7030> (a org.apache.commons.dbcp.PoolableConnectionFactory) > at > org.apache.commons.pool.impl.GenericObjectPool.addObject(GenericObjectPool.java:996) > at > org.apache.commons.pool.impl.GenericObjectPool.ensureMinIdle(GenericObjectPool.java:978) > at > org.apache.commons.pool.impl.GenericObjectPool.access$000(GenericObjectPool.java:124) > at > org.apache.commons.pool.impl.GenericObjectPool$Evictor.run(GenericObjectPool.java:1090) > at java.lang.Thread.run(Thread.java:595) > "main": > at > org.apache.commons.dbcp.PoolableConnectionFactory.makeObject(PoolableConnectionFactory.java:290) > - waiting to lock <0x65df7030> (a > org.apache.commons.dbcp.PoolableConnectionFactory) > at > org.apache.commons.pool.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:771) > at org.apache.commons.dbcp.PoolingDriver.connect(PoolingDriver.java:175) > at java.sql.DriverManager.getConnection(DriverManager.java:525) > - locked <0x698c2b48> (a java.lang.Class) > at java.sql.DriverManager.getConnection(DriverManager.java:193) > - locked <0x698c2b48> (a java.lang.Class) > at Deadlock.main(Deadlock.java:83) > > Found 1 deadlock. > The deadlock occurs when GenericObjectPool.addObject() calls synchronized method > PoolableConnectionFactory.makeObject(), meanwhile static synchronized > DriverManager.getConnection() is called. makeObject() waits for the lock on > DriverManager to be released, whereas DriverManager waits for the lock on the > PoolableConnectionFactory instance to be released. > The Java program below, based on ManualPoolingDriverExample.java provided with > DBCP, duplicates the deadlock for me within several seconds of being run: > import java.sql.*; > import org.apache.commons.pool.*; > import org.apache.commons.pool.impl.*; > import org.apache.commons.dbcp.*; > > /** > * Duplicate DBCP pool deadlocking. > * > * Compile with: > * /usr/java/jdk1.5.0/bin/javac > * -classpath commons-collections.jar:commons-dbcp-1.2.1.jar:commons-pool-1.2.jar > * Deadlock.java > * > * Run with: > * /usr/java/jdk1.5.0/bin/java > * -classpath > commons-collections.jar:commons-dbcp-1.2.1.jar:commons-pool-1.2.jar:ojdbc14.jar:xerces.jar:. > * Deadlock > * > * Locks still occur when compiled and run with J2SDK 1.4.1_03. > */ > public class Deadlock { > > private static final int ACTIVE = 10; > > private static void init() { > System.out.println("Loading drivers"); > > try { > Class.forName("oracle.jdbc.driver.OracleDriver"); > Class.forName("org.apache.commons.dbcp.PoolingDriver"); > } catch (ClassNotFoundException e) { > e.printStackTrace(); > } > > System.out.println("Setting up pool"); > > try { > GenericObjectPool.Config config = new > GenericObjectPool.Config(); > config.maxActive = ACTIVE; > config.minIdle = 2; // Idle limits are low to allow more > possibility of locking. > config.maxIdle = 4; // Locking only occurs when there > are 0 idle connections in the pool. > config.maxWait = 5000L; > config.testOnBorrow = true; > config.testOnReturn = false; > config.testWhileIdle = true; > // Locking still occurs whether these tests are > performed or not. > config.whenExhaustedAction = > GenericObjectPool.WHEN_EXHAUSTED_BLOCK; > // Locking still occurs regardless of the > exhausted action. > config.timeBetweenEvictionRunsMillis = 3000L; // The > Evictor thread is involved in the lock, so run it quite often. > config.minEvictableIdleTimeMillis = 6000L; > config.numTestsPerEvictionRun = 3; > > ObjectPool op = new GenericObjectPool(null, config); > > ConnectionFactory cf = new > DriverManagerConnectionFactory("jdbc:oracle:thin:@oracle8server:1521:sid", > "username", "password"); > PoolableConnectionFactory pcf = new > PoolableConnectionFactory(cf, op, null, "SELECT 1 FROM DUAL", false, true); > // Locking still occurs whether there is a > validation query or not. > > PoolingDriver pd = > (PoolingDriver)DriverManager.getDriver("jdbc:apache:commons:dbcp:"); > pd.registerPool("PoolName", op); > > } catch (SQLException e) { > e.printStackTrace(); > } > > System.out.println("Done"); > } > > > public static void main(String[] args) { > init(); > > Connection[] c = new Connection[ACTIVE]; > > try { > printPoolStatus(); > > // Loop forever to create a high load. > while (true) { > // Create a number of connections. > for (int i = 0; i < ACTIVE; ++i) { > c[i] = > DriverManager.getConnection("jdbc:apache:commons:dbcp:PoolName"); > printPoolStatus(); > } > // Then immmediately drop them. > for (int i = 0; i < ACTIVE; ++i) { > try { > if (c[i] != null) { > c[i].close(); > printPoolStatus(); > c[i] = null; > } > } catch (SQLException e) { > e.printStackTrace(); > } > } > } > > } catch (SQLException e) { > e.printStackTrace(); > > } finally { > // Close down any open connetions. > for (int i = 0; i < ACTIVE; ++i) { > try { > if (c[i] != null) c[i].close(); > } catch (SQLException e) { } > } > > System.out.println("Closing pool"); > try { > PoolingDriver pd = > (PoolingDriver)DriverManager.getDriver("jdbc:apache:commons:dbcp:"); > pd.closePool("PoolName"); > } catch (SQLException e) { > e.printStackTrace(); > } > System.out.println("Pool closed"); > } > > } > > > /** > * Display pool status. Locks still occur even if this method is never > called. > */ > private static void printPoolStatus() throws SQLException { > PoolingDriver pd = > (PoolingDriver)DriverManager.getDriver("jdbc:apache:commons:dbcp:"); > ObjectPool op = pd.getConnectionPool("PoolName"); > > System.out.println("Active / idle: " + op.getNumActive() + " / " > + op.getNumIdle()); > } > > } > The patch I initially suggest is as follows (sorry for not providing a diff): > In org.apache.commons.dbcp.PoolableConnectionFactory.java we have: > synchronized public Object makeObject() throws Exception { > Connection conn = _connFactory.createConnection(); > if(null != _stmtPoolFactory) { > KeyedObjectPool stmtpool = _stmtPoolFactory.createPool(); > conn = new PoolingConnection(conn,stmtpool); > stmtpool.setFactory((PoolingConnection)conn); > } > return new PoolableConnection(conn,_pool,_config); > } > I suggest changing that to this: > public Object makeObject() throws Exception { > Connection conn = _connFactory.createConnection(); > synchronized (this) { > if(null != _stmtPoolFactory) { > KeyedObjectPool stmtpool = _stmtPoolFactory.createPool(); > conn = new PoolingConnection(conn,stmtpool); > stmtpool.setFactory((PoolingConnection)conn); > } > return new PoolableConnection(conn,_pool,_config); > } > } > Note the move of the synchronized block from the entire method to within the > method after the connection (obtained ultimately by DriverManager) is retrieved. > I'm afraid I don't know enough about DBCP and pooling to know the full > ramification of this change and other calls to makeObject(). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.