poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sam Li <sam...@gmail.com>
Subject Extracting all text from a document, including texboxes
Date Mon, 16 Apr 2012 10:37:01 GMT
I'm currently unable to extract all the text from the office 2007 office xml formats; namely
textboxes. What I really need is just a word count but the word counter isn't very accurate.
Any ideas on how to solve this problem? I know that the regular .doc files that contain textboxes
can be extracted fine. Just having trouble with the docx files. 
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Mime
View raw message