lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steven Rowe <sar...@syr.edu>
Subject Re: Lucene indexing for pdf files
Date Fri, 31 Aug 2007 16:07:15 GMT
Hi Madhu,

Madhu wrote:
> i am indexing pdf document using pdfbox 7.4, its working fine for some pdf
> files. for japanese pdf files its giving the below exception.
> 
> caught a class java.io.IOException
>  with message: Unknown encoding for 'UniJIS-UCS2-H'
> 
> Can any one help me , how to set the encoding while reading pdf files.

This question will get much better and quicker answers from PDFBox
mailing lists/forums.  The SF forums look much more active than the
mailing lists:

   http://sourceforge.net/forum/?group_id=78314

Steve

-- 
Steve Rowe
Center for Natural Language Processing
http://www.cnlp.org/tech/lucene.asp

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message