poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rajeev Mohanraj <rajeevab...@gmail.com>
Subject Re: How to extract special character and symbols from the word document
Date Wed, 03 Aug 2011 08:36:24 GMT
Nick Burch <nick.burch <at> alfresco.com> writes:

> 
> On Mon, 1 Aug 2011, Rajeev Mohanraj wrote:
> > Am also facing similiar kind of problem. Poi doesnt read the special 
> > characters & symbols from word document. for example in my word document 
> > contains µĪĦĜăĂ content, when i read this with poi it gives ?????? like 
> > that. how to get the exact special character.. Pls help me out.
> 
> Looks like you've sent an incorrect encoding on your output. This comes up 
> a lot, mostly with people trying to use excel, see the archives for 
> details on how to resolve it for your given platform
> 
> Nick
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe <at> poi.apache.org
> For additional commands, e-mail: user-help <at> poi.apache.org


Hi Nick,

I already set encoding method to UTF-8. am using wordtohtml conversion using 
poi. and i set output encoding method as UTF-8. but i still get special 
characters µĪĦĜăĂ as ?????? only. whether i need to read content as UTF-8?


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Mime
View raw message