Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 83821 invoked from network); 5 Oct 2007 07:42:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Oct 2007 07:42:08 -0000 Received: (qmail 141 invoked by uid 500); 5 Oct 2007 07:41:55 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 122 invoked by uid 500); 5 Oct 2007 07:41:55 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 113 invoked by uid 99); 5 Oct 2007 07:41:55 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Oct 2007 00:41:55 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [217.12.11.79] (HELO smtp010.mail.ukl.yahoo.com) (217.12.11.79) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 05 Oct 2007 07:41:55 +0000 Received: (qmail 91657 invoked from network); 5 Oct 2007 07:40:33 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.co.uk; h=Received:X-YMail-OSG:From:To:Subject:Date:Message-ID:MIME-Version:Content-Type:X-Mailer:thread-index:Content-Language; b=TsdIs+9HeiEoLP/CFEMAXI3sbKpbkoeAN5ZJUi31Fo7PIFmu5OPO2d7utpA8E45DeTaX9O9rwnz+OSiid3pZwgXpp1F818hFtylY/WH0GlG5gON7zMAE3ctC5yei36u9JZ1lsAYT3DvbRRgQ0VWk9Le7tZfzau+7Jqs44Njb2xU= ; Received: from unknown (HELO miqsbarriba) (sbarriba@81.149.172.165 with login) by smtp010.mail.ukl.yahoo.com with SMTP; 5 Oct 2007 07:40:31 -0000 X-YMail-OSG: jaoygYcVM1nUJhTvGJF8vUXsPHLtyx3RzAI7r.cYugY2HJls7M8zm5NOhqyEYQas7z_151skQEDicRZX2qrsQI_jSZAwRbvLbTXbGDy3Q8fig5oP From: "sbarriba" To: Subject: Memory usage issues of importml/exportsysview Date: Fri, 5 Oct 2007 08:40:29 +0100 Message-ID: <035a01c80723$020a2c10$061e8430$@co.uk> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_035B_01C8072B.63CE9410" X-Mailer: Microsoft Office Outlook 12.0 thread-index: AcgHIwAYgWdQ3iagQUS5qRSrkIzeyA== Content-Language: en-gb X-Virus-Checked: Checked by ClamAV on apache.org ------=_NextPart_000_035B_01C8072B.63CE9410 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Hi all, During a recent thread Hot Backup Tools were discussed - see http://www.mail-archive.com/users@jackrabbit.apache.org/msg04255.html. As an outcome of that we're doing 2 things: 1) "Low-level" backup o Backing up the database o Backing up the repository file system 2) "High-level" backup o Running exportsysview on each workspace When migrating between environments or restoring backups solution 2) is very useful although the XML files are getting very large where the content has lots of binaries etc. The main issue is that the memory requirements of "importxml" increase linearly with the size of the XML file. I presume this is due to either a) the memory required to parse the file, and/or b) the memory required to hold the transient state of the import. We're now needing to use a 1GB heap size for some imports and obviously this will hit a crunch point. Any suggestions on how to resolve this memory issue? For example, could the "importxml" not use a SAX event model to avoid parsing the XML into a complete DOM etc (note I don't know the internals of importxml as it stands). All suggestions welcome. Regards, Shaun ------=_NextPart_000_035B_01C8072B.63CE9410--