pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Hewson <j...@jahewson.com>
Subject Re: extract chinese letters from pdf
Date Wed, 29 Apr 2015 05:36:47 GMT

> On 28 Apr 2015, at 20:21, Gang Fu <gangfu1982@gmail.com> wrote:
> 
> Hi,
> 
>  
> I want to parse the PDF file with both Chinese and English letters. Which encoding should
I use?
> 
> The sample file is attached. 
> 

UTF-8. You want to use the trunk version of PDFBox (2.0) too.

Our mailing list removes binary attachments, so you’ll have to post your PDF file somewhere
public so that we can see it.

— John
> Thank you very much!
> 
>  
> Best,
> 
> Gang
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message