poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Burch <nick.bu...@alfresco.com>
Subject Re: WordExtractor.getText() returns ^U on word docs.
Date Mon, 11 Jan 2010 16:30:03 GMT
On Mon, 11 Jan 2010, maxSchlein wrote:
> I tried what you suggested:
>
>          WordExtractor wordExt = new WordExtractor(is);
>          String bodyText = WordExtractor.stripFields(wordExt.getText());
>
> But the  is still in the text.

Can you create a new bug on bugzilla, and upload a sample file that shows 
this behaviour? In the mean time, you'll need to go with Mark's suggestion 
of manually removing them though

Cheers
Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Mime
View raw message