Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 57284 invoked from network); 16 Jul 2007 12:36:40 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Jul 2007 12:36:40 -0000 Received: (qmail 63932 invoked by uid 500); 16 Jul 2007 12:36:40 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 63868 invoked by uid 500); 16 Jul 2007 12:36:40 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 63844 invoked by uid 99); 16 Jul 2007 12:36:40 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Jul 2007 05:36:40 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of stefan.guggisberg@gmail.com designates 209.85.134.188 as permitted sender) Received: from [209.85.134.188] (HELO mu-out-0910.google.com) (209.85.134.188) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Jul 2007 05:36:36 -0700 Received: by mu-out-0910.google.com with SMTP id w1so1350399mue for ; Mon, 16 Jul 2007 05:36:15 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=rF8e3EEAhFaoGVRkBXB87IzBGZ4Dv5Kz18OGiVqZdHsd0WRjdN6frZ2cKsbhSf2ULCM6ycclEekHjI7YpCJECctlR7Iw/P9hhg2SIoXFSuWd+BVj7xoJlLvTmI9G4x12Dih/MVVQ1b3FgAtipSNH+/xxy8lq2t052qwaHfpkulI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=UA+DM15G/idsiiyfHEOo2CEuyeYznO9EfFj//lNmVYFgmmsvlmiKf1hIBdZDTvifDlbzVC/HQiH+G3j9j0eQaZ9uAd5W7tMDViO3n86AtC5PFdYdPZ7MhoxCJO2O8ayAj+j+/AHgVy3oZkRrFcQdE086ymE9pfvlcCx1dBWobjo= Received: by 10.82.127.14 with SMTP id z14mr4641309buc.1184589374912; Mon, 16 Jul 2007 05:36:14 -0700 (PDT) Received: by 10.82.158.11 with HTTP; Mon, 16 Jul 2007 05:36:14 -0700 (PDT) Message-ID: <90a8d1c00707160536p23bae5e2te7d3d3f6e7b8312b@mail.gmail.com> Date: Mon, 16 Jul 2007 14:36:14 +0200 From: "Stefan Guggisberg" To: users@jackrabbit.apache.org Subject: Re: Saving of nodes takes too long/Indexing configuration In-Reply-To: <227621ad0707160117yd9c0005k1561de528f144517@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <227621ad0707140143w2f8e0a12hd02d8fac7b6b59fa@mail.gmail.com> <510143ac0707150308t40ac6c68o496fd3ea10556df0@mail.gmail.com> <227621ad0707152331u45942942x99d943a319863714@mail.gmail.com> <227621ad0707160117yd9c0005k1561de528f144517@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org hi, On 7/16/07, Sridhar Raman wrote: > Also, how do I switch to bundle persistence? Currently, this is the > configuration in my workspace.xml file: > > > > > > > > > > > > > How do I change it to include the bundle persistance for Derby? while switching to BundleDbPersistenceManager would certainly provide a certain performance gain i doubt that it would solve your issue. you're using an embedded derby db which should provide a decent perfomance. i just ran a quick test using DerbyPersistenceManager: saving 1000 nodes with 5 string properties each takes about 3 seconds on a 1.9ghz intel macbook pro (i.e. ~12s./4000 nodes). you mentioned that in your case it takes ~32 minutes (!) to save 4000 nodes. please tell us more on your data model. are you storing large binary properties? how many properties (and of what type) are you storing per node? can you provide a simple test case? cheers stefan > > Thanks, > Sridhar > > On 7/16/07, Sridhar Raman wrote: > > > > I use DerbyPersistenceManager and LocalFileSystem. So would I be able to > > switch to bundle persistence in this case, and would it be helpful? > > > > On 7/15/07, Jukka Zitting wrote: > > > > > > Hi, > > > > > > On 7/14/07, Sridhar Raman wrote: > > > > I use Jackrabbit extensively, and one problem that I seem to run into > > > a lot > > > > of times is when I import data, and save the nodes. For saving 4000 > > > nodes, > > > > it almost takes 32 mins to execute the session.save() command. Any > > > way of > > > > fixing this? > > > > > > > > Is it probably because all my data is getting indexed? Could I > > > somehow > > > > specify only specific properties/types to be indexed? > > > > > > I much more suspect that the time is spent talking to the persistence > > > store. Are you using an external database for persistence? > > > > > > The traditional database persistence managers issue a separate SQL > > > statement (causing a network roundtrip to the database) for each node > > > *and* property being saved, which can quickly end up taking a lot of > > > time especially if the network roundtrip to a database server takes > > > more than a few milliseconds. > > > > > > Good solutions to this problem are either to switch to the bundle > > > persistence (which uses just a single statement for a node and all > > > it's properties) included in Jackrabbit 1.3 and/or using an embedded > > > database like the default Derby. > > > > > > BR, > > > > > > Jukka Zitting > > > > > > > >