pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Czech, Christian" <c.cz...@elo.com>
Subject Slovakian characters
Date Tue, 17 Jul 2012 12:42:12 GMT
Hello,

I have a PDF document with Slovakian characters e.g. "Nápovda pro klienta". How can I extract
it correctly?

My code:


PDDocument document = null;
document = PDDocument.load(pdfFile, true);
PDFTextStripper stripper = null;
stripper = new PDFTextStripper("ISO-8859-2");
stripper.getText(document);

I always get this result: "N\?pověda pro klienta"

Thanks
Christian



________________________________

ELO Digital Office GmbH
Firmensitz: Heilbronner Strasse 150, 70191 Stuttgart
Fon: +49 711 806089-0, Fax: +49 711 806089-19, Web: www.elo.com
Geschäftsführer: Karl Heinz Mosbach, Matthias Thiele
BW-Bank, Konto-Nr. 2089782, BLZ 600 501 01
Registergericht Stuttgart HRB 15059 - USt-IdNr.: DE812471516

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message