pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Czech, Christian" <c.cz...@elo.com>
Subject text extraction - czech characters
Date Thu, 14 Jun 2012 16:02:01 GMT
Hello,

I have problem with czech characters in PDF.

My code:

PDFTextStripper stripper = null;
stripper = new PDFTextStripper(encoding);
stripper.setStartPage(startPage);
stripper.setEndPage(endPage);
try {
stripper.writeText(document, outputWriter);
.......

http://download.eloit.de/czech/Dokument (PDF text).pdf
http://download.eloit.de/czech/Dokument (PDF text).txt

Can somebody help me?
Thanks

Christian


________________________________

ELO Digital Office GmbH
Firmensitz: Heilbronner Strasse 150, 70191 Stuttgart
Fon: +49 711 806089-0, Fax: +49 711 806089-19, Web: www.elo.com
Gesch?ftsf?hrer: Karl Heinz Mosbach, Matthias Thiele
BW-Bank, Konto-Nr. 2089782, BLZ 600 501 01
Registergericht Stuttgart HRB 15059 - USt-IdNr.: DE812471516

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message