Return-Path: Delivered-To: apmail-lucene-solr-dev-archive@locus.apache.org Received: (qmail 75197 invoked from network); 17 Nov 2008 03:05:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 17 Nov 2008 03:05:14 -0000 Received: (qmail 20112 invoked by uid 500); 17 Nov 2008 03:05:21 -0000 Delivered-To: apmail-lucene-solr-dev-archive@lucene.apache.org Received: (qmail 20080 invoked by uid 500); 17 Nov 2008 03:05:21 -0000 Mailing-List: contact solr-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-dev@lucene.apache.org Delivered-To: mailing list solr-dev@lucene.apache.org Received: (qmail 20069 invoked by uid 99); 17 Nov 2008 03:05:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 Nov 2008 19:05:21 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 Nov 2008 03:04:08 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 74AC7234C256 for ; Sun, 16 Nov 2008 19:04:49 -0800 (PST) Message-ID: <731791095.1226891089476.JavaMail.jira@brutus> Date: Sun, 16 Nov 2008 19:04:49 -0800 (PST) From: "Bill Au (JIRA)" To: solr-dev@lucene.apache.org Subject: [jira] Commented: (SOLR-857) Memory Leak during the indexing of large xml files In-Reply-To: <216351507.1226676764346.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12648070#action_12648070 ] Bill Au commented on SOLR-857: ------------------------------ HashMap$Entry is on top of your list there. I would look for a large HashMap in the heap dump. > Memory Leak during the indexing of large xml files > -------------------------------------------------- > > Key: SOLR-857 > URL: https://issues.apache.org/jira/browse/SOLR-857 > Project: Solr > Issue Type: Bug > Affects Versions: 1.3 > Environment: Verified on Ubuntu 8.0.4 (1.7GB RAM, 2.4GHz dual core) and Windows XP (2GB RAM, 2GHz pentium) both with a Java5 SDK > Reporter: Ruben Jimenez > Attachments: OQ_SOLR_00001.xml.zip, schema.xml, solr256MBHeap.jpg > > > While indexing a set of SOLR xml files that contain 5000 document adds within them and are about 30MB each, SOLR 1.3 seems to continually use more and more memory until the heap is exhausted, while the same files are indexed without issue with SOLR 1.2. > Steps used to reproduce. > 1 - Download SOLR 1.3 > 2 - Modify example schema.xml to match fields required > 3 - start example server with following command java -Xms512m -Xmx1024m -XX:MaxPermSize=128m -jar start.jar > 4 - Index files as follow java -Xmx128m -jar .../examples/exampledocs/post.jar *.xml > Directory with xml files contains about 100 xml files each of about 30MB each. While indexing after about the 25th file SOLR 1.3 runs out of memory, while SOLR 1.2 is able to index the entire set of files without any problems. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.