poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From nacho210 <ican...@truelogic.com.ar>
Subject errors in text extraction
Date Mon, 19 May 2008 22:18:57 GMT

i´m using hwpf to extract text from word documents. 

WordExtractor extractor = new WordExtractor(fis);

String body = extractor.getText();

Returns invalid characters like: \u0013 \u0014 \u000b

any suggestion on what the problem might be?

View this message in context: http://www.nabble.com/errors-in-text-extraction-tp17329385p17329385.html
Sent from the POI - User mailing list archive at Nabble.com.

To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org

View raw message