poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 33681] - Cannot load 500kb file with usermodel or event model
Date Thu, 28 Jul 2005 04:24:55 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG·
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=33681>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND·
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=33681





------- Additional Comments From jheight@subcorp.com.au  2005-07-28 06:24 -------
Ok i know what is happening here. SST records it seems *CAN* have duplicate 
entries @#^&%!

This means that the current implementation which adds spaces onto the string to
prevent duplicate entries being put into the Binary Tree is called continually!
Which of course is going to raise an exceptionally large number of exceptions &
re-create a large number of strings, effectively killing the whole performance.

As part of the patch to http://issues.apache.org/bugzilla/show_bug.cgi?id=31906
I have implemented the RichText stuff in UnicodeRecord, including comparision
for uniqueness based on the rich text stuff.  After taking out the code in the
SSTDeserializer.addToStringTable which ensured uniqueness, i now get exceptions
being raised with duplicates.

So in actual fact we should allow for duplicates in the SST record. I will look
into this over the next few days.

Jason

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org
Mailing List:    http://jakarta.apache.org/site/mail2.html#poi
The Apache Jakarta POI Project: http://jakarta.apache.org/poi/


Mime
View raw message