pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wilson dos Santos Batista Junior <wilsonbatist...@gmail.com>
Subject Fwd: problem with text extract
Date Wed, 04 Feb 2009 23:37:30 GMT
Dear pdfBox users,
I am trying extract texts of pdf documents with pdfbox, but I have a problem
with some pdf. The result of a extract is the attached figure. These ocurred
when tx-pdf is the productor. What I can do?
Thanks!



//I am using the next simple lines
PDFTextStripper stripper = new PDFTextStripper();
docText = stripper.getText(new PDDocument(cosDoc));

Mime
  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message