poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject [Bug 54790] Word Document loading strategy is memory hungry and causes OutOfMemoryError
Date Sat, 06 Apr 2013 15:21:42 GMT
https://issues.apache.org/bugzilla/show_bug.cgi?id=54790

--- Comment #2 from Dmitry <dma_k@mail.ru> ---
To be more precise:
- Opening fails with -Xmx800MB
- Opening succeeded with -Xmx900MB

Expected:
- Opening succeeds with -Xmx300MB

I repeat: DOC file size is 70MB. Potentially I can cut or put it as is to
fileshare.

> And to use TextPiece just as some lightweigh proxy to DocumentStream going to be very
ineffective (due to required character encoding-deconding process).

Deferred encoding-deconding is not a problem: the only flag is
"unicode=true|false". The problem is that DocumentStream is cut into millions
of tiny char buffers.

> Also, disabling preserveTextTable means the whole text is reconstructed into single buffer
(StringBuilder).

OOM happens before whole text is reconstructed. I would agree for x3 memory
consumption, that is 70MB -> 210MB heap. But x10 is too much. And yes,
"preserveTextTable" is disabled by default as far as I can see, unless it is
enabled by system property.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Mime
View raw message