From mpogue@apache.org Thu May 11 20:25:26 2000 Return-Path: Mailing-List: contact general-help@xml.apache.org; run by ezmlm Delivered-To: mailing list general@xml.apache.org Received: (qmail 91670 invoked from network); 11 May 2000 20:25:26 -0000 Received: from mg03.austin.ibm.com (HELO mailgate3.austin.ibm.com) (192.35.232.20) by locus.apache.org with SMTP; 11 May 2000 20:25:26 -0000 Received: from netmail1.austin.ibm.com (netmail1.austin.ibm.com [9.53.250.96]) by mailgate3.austin.ibm.com (AIX4.3/8.9.3/8.9.3) with ESMTP id PAA40494 for ; Thu, 11 May 2000 15:07:22 -0500 Received: from popmail.austin.ibm.com (popmail.austin.ibm.com [9.53.247.178]) by netmail1.austin.ibm.com (8.8.5/8.8.5) with ESMTP id PAA62022 for ; Thu, 11 May 2000 15:25:25 -0500 Received: from apache.org (socks2.almaden.ibm.com [9.1.40.50]) by popmail.austin.ibm.com (AIX4.2/UCB 8.7/8.7-client1.01) with ESMTP id PAA26572 for ; Thu, 11 May 2000 15:25:22 -0500 (CDT) Message-ID: <391B1797.932F650B@apache.org> Date: Thu, 11 May 2000 13:27:03 -0700 From: Mike Pogue Organization: xml.apache.org X-Mailer: Mozilla 4.72 [en] (Windows NT 5.0; U) X-Accept-Language: en MIME-Version: 1.0 To: general@xml.apache.org Subject: Re: DOM size References: <3.0.32.20000511085445.014a0370@pop.intergate.ca> <391AE040.118704E4@apache.org> <391AF513.86402F6A@selectacast.net> <391AFC30.8EF5DBB6@apache.org> <391B064B.AA0AEDC5@selectacast.net> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Spam-Rating: locus.apache.org 1.6.2 0/1000/N So, does this figure include code + data tables, including lazily evaluated data? (It sounds like you're looking at the memory taken by the whole JVM?) One way to check this would be to do 600 documents, and see how the numbers compare. Also, try doubling the size of each individual document, and see what happens. Either way, the answers will tell us where we should look... Mike P.S. Which DOM did you use for the test? Did you traverse the resulting DOM? Joseph Shraibman wrote: > > Mike Pogue wrote: > > > How did you measure the memory consumption? Did you also account for > > gc? > > > > Mike > > > > I viewed the size in kpm. And I explicitly did System.gc(); System.runFinalization(); > System.gc(); > > > > > Joseph Shraibman wrote: > > > > > > 500 simple Documents in Xerces-J using version 1.0.2 takes up 53668K. In version > > > 1.0.4 it is 52584K. A document should not take up 100k. > > > > > > Documents were created from the String: > > > > > > > > > Mike Pogue wrote: > > > > > > > Also note that both Xerces-J and Xerces-C have recently had memory optimizations > > > > done to the DOM (courtesy of Arnaud LeHors). In some cases, nodes got 30% smaller, > > > > at virtually no cost in performance. > > > > > > > > So, be sure to measure using the latest 1.0.4 Xerces-J. Xerces-C will be doing a > > > > build with these changes in it shortly... > > > > > > > > Mike > > > > > > > > Tim Bray wrote: > > > > > > > > > > At 08:57 AM 5/11/00 +0200, Dennis Thrysoe - netnord wrote: > > > > > >Does anybody have a ballpark figure on how much memory a DOM would eat up if > > > > > >it was an entire 16 mb XML file. (The implementation is IBM's XML4J 2.0.15 - > > > > > >but I'm just looking for a qualified guess). > > > > > > > > > > N * 16M, where N ranges between 1.5 for a fine-tuned readonly DOM > > > > > implementation in C [done by David Megginson but never released I think], to > > > > > over 20 for some perl implementations. Also it depends (a lot) on how > > > > > dense the tags are. -Tim > > > > > > > > > > --------------------------------------------------------------------- > > > > > In case of troubles, e-mail: webmaster@xml.apache.org > > > > > To unsubscribe, e-mail: general-unsubscribe@xml.apache.org > > > > > For additional commands, e-mail: general-help@xml.apache.org > > > > > > > > --------------------------------------------------------------------- > > > > In case of troubles, e-mail: webmaster@xml.apache.org > > > > To unsubscribe, e-mail: general-unsubscribe@xml.apache.org > > > > For additional commands, e-mail: general-help@xml.apache.org > > > > > > --------------------------------------------------------------------- > > > In case of troubles, e-mail: webmaster@xml.apache.org > > > To unsubscribe, e-mail: general-unsubscribe@xml.apache.org > > > For additional commands, e-mail: general-help@xml.apache.org > > > > --------------------------------------------------------------------- > > In case of troubles, e-mail: webmaster@xml.apache.org > > To unsubscribe, e-mail: general-unsubscribe@xml.apache.org > > For additional commands, e-mail: general-help@xml.apache.org > > --------------------------------------------------------------------- > In case of troubles, e-mail: webmaster@xml.apache.org > To unsubscribe, e-mail: general-unsubscribe@xml.apache.org > For additional commands, e-mail: general-help@xml.apache.org