Return-Path: Delivered-To: apmail-jackrabbit-users-archive@minotaur.apache.org Received: (qmail 53784 invoked from network); 28 Jul 2009 09:29:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 28 Jul 2009 09:29:31 -0000 Received: (qmail 60475 invoked by uid 500); 28 Jul 2009 09:30:48 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 60421 invoked by uid 500); 28 Jul 2009 09:30:47 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 60408 invoked by uid 99); 28 Jul 2009 09:30:47 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Jul 2009 09:30:47 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of aas.ojj@gmail.com designates 209.85.221.194 as permitted sender) Received: from [209.85.221.194] (HELO mail-qy0-f194.google.com) (209.85.221.194) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Jul 2009 09:30:36 +0000 Received: by qyk32 with SMTP id 32so4330494qyk.15 for ; Tue, 28 Jul 2009 02:30:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=7VLjSC/5tjIwcjNmgWvlwKlh7WGZx7xdRb6GQDxiKNU=; b=oUMK2/ZL57hrFRNzdmq07dEpC/g8o2VYy91JcQ8Alh22BsrBYLOTjJtbQiHHMcYFQ2 /kiGJPP2Kq61MtJF2oBjEZ8dTw3x7WTG3kf0dDIxXZY3SZi8b7dr41uKWYS9w1Ltbh7p 88G9HE+ghe4CL7YFIOfgUD8iywvwozcW4lGSE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=SE10fcsnjIFJ93g7NkTupGmLIjbD7LYaul+LtK7zTywhaWczkDm+I3FXx50TR8TSNW oYLmm1rFDXry3uPxLY3uuV4TiBYTLZc8QMMWLjuk5/ySW576GFfCS46YduoqNKczGyPJ VE0cxjwrZp49lLz6x37F650+D6jwUlaSL7HTo= MIME-Version: 1.0 Received: by 10.229.96.16 with SMTP id f16mr1775311qcn.85.1248773415614; Tue, 28 Jul 2009 02:30:15 -0700 (PDT) Date: Tue, 28 Jul 2009 15:00:15 +0530 Message-ID: Subject: Deadlock in cluster deployment From: aasoj j To: users@jackrabbit.apache.org Content-Type: multipart/alternative; boundary=0016364ee316d7b9f8046fc0b7be X-Virus-Checked: Checked by ClamAV on apache.org --0016364ee316d7b9f8046fc0b7be Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hi, We have a 3 host Jackrabbit cluster deployment (version 1.5.4). In our case new data nodes are added to the repository frequently (500 new nodes in different parts of the tree every 15 minutes), and a huge number of reads are serviced (50 per second). After working for a few hours we observe that most of the threads end up in a wait state: "http-0.0.0.0-4080-37" Id=3098 in WAITING on lock=EDU.oswego.cs.dl.util.concurrent.WriterPreferenceReadWriteLock$ReaderLock@42ae9b4d at java.lang.Object.wait(Native Method) at java.lang.Object.wait(Object.java:485) at EDU.oswego.cs.dl.util.concurrent.WriterPreferenceReadWriteLock$ReaderLock.acquire(Unknown Source) at org.apache.jackrabbit.core.state.DefaultISMLocking$ReadLockImpl.(DefaultISMLocking.java:84) at org.apache.jackrabbit.core.state.DefaultISMLocking$ReadLockImpl.(DefaultISMLocking.java:78) at org.apache.jackrabbit.core.state.DefaultISMLocking.acquireReadLock(DefaultISMLocking.java:44) at org.apache.jackrabbit.core.state.SharedItemStateManager.acquireReadLock(SharedItemStateManager.java:1409) at org.apache.jackrabbit.core.state.SharedItemStateManager.hasItemState(SharedItemStateManager.java:286) at org.apache.jackrabbit.core.state.XAItemStateManager.hasItemState(XAItemStateManager.java:295) at org.apache.jackrabbit.core.state.SessionItemStateManager.getItemState(SessionItemStateManager.java:181) at org.apache.jackrabbit.core.HierarchyManagerImpl.getItemState(HierarchyManagerImpl.java:150) at org.apache.jackrabbit.core.HierarchyManagerImpl.resolvePath(HierarchyManagerImpl.java:113) at org.apache.jackrabbit.core.HierarchyManagerImpl.resolvePath(HierarchyManagerImpl.java:325) at org.apache.jackrabbit.core.CachingHierarchyManager.resolvePath(CachingHierarchyManager.java:135) at org.apache.jackrabbit.core.HierarchyManagerImpl.resolvePath(HierarchyManagerImpl.java:363) at org.apache.jackrabbit.core.ItemManager.getItem(ItemManager.java:435) at org.apache.jackrabbit.core.SessionImpl.getItem(SessionImpl.java:817) This is usually observed after new nodes are added to the repository in one of the host. We also observed the following NPE once. ERROR org.apache.jackrabbit.core.cluster.ClusterNode:292 Unexpected error while syncing of journal: null java.lang.NullPointerException at org.apache.jackrabbit.core.state.XAItemStateManager.stateModified(XAItemStateManager.java:580) at org.apache.jackrabbit.core.state.StateChangeDispatcher.notifyStateModified(StateChangeDispatcher.java:111) at org.apache.jackrabbit.core.state.SharedItemStateManager.stateModified(SharedItemStateManager.java:400) at org.apache.jackrabbit.core.state.ItemState.notifyStateUpdated(ItemState.java:244) at org.apache.jackrabbit.core.state.ChangeLog.persisted(ChangeLog.java:285) at org.apache.jackrabbit.core.state.SharedItemStateManager.doExternalUpdate(SharedItemStateManager.java:1186) at org.apache.jackrabbit.core.state.SharedItemStateManager.externalUpdate(SharedItemStateManager.java:1109) at org.apache.jackrabbit.core.RepositoryImpl$WorkspaceInfo.externalUpdate(RepositoryImpl.java:2098) at org.apache.jackrabbit.core.cluster.ClusterNode.process(ClusterNode.java:852) at org.apache.jackrabbit.core.cluster.ChangeLogRecord.process(ChangeLogRecord.java:384) at org.apache.jackrabbit.core.cluster.ClusterNode.consume(ClusterNode.java:798) at org.apache.jackrabbit.core.journal.AbstractJournal.doSync(AbstractJournal.java:213) at org.apache.jackrabbit.core.journal.AbstractJournal.sync(AbstractJournal.java:188) at org.apache.jackrabbit.core.cluster.ClusterNode.sync(ClusterNode.java:315) at org.apache.jackrabbit.core.cluster.ClusterNode.run(ClusterNode.java:286) at java.lang.Thread.run(Thread.java:619) It seems that these issues are similar to issues below. 1. http://www.nabble.com/-jira--Created%3A-%28JCR-2168%29-Avoid-premature-publication-of-XAItemStateManager-td24145090.html#a24200238 . 2. https://issues.apache.org/jira/browse/JCR-2171 Please advise. Thanks in advance --0016364ee316d7b9f8046fc0b7be--