pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Renaud Billen <renaudbil...@nic.be>
Subject Extraction of chinese characters
Date Tue, 06 Jan 2015 10:59:47 GMT

fresh new user of pdfbox, I’ve got some problems extracting the text of pdfs with Chinese
characters in it.

I use pdfbox from the command line with the command : java -jar C:/pdfbox-app.jar ExtractText
C:/Test_Pdfbox.pdf C:/Test_Pdfbox.txt

Result text only contains question marks..

Here is the document : 

Thanks for your help,
View raw message