pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wouter De Borger <wouter.debor...@inmanta.com>
Subject Make PDFBox fail on bad pdf
Date Thu, 30 Mar 2017 09:59:32 GMT
Hi All,

When a pdf has bad encoding, PDFBox produces garbage (as explained in the
FAQ https://pdfbox.apache.org/2.0/faq.html#gibberish).

Can I make PDFBox fail in this case instead of producing garbage?

(I'm working on a system that can also do OCR, so at the least sign of
trouble, I would like PDF box to fail and try OCR.)


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message