pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "OYEBISI, Daniel" <doyeb...@bdoc.com>
Subject PDType0Font toUnicode Mapping
Date Mon, 18 Jul 2016 09:08:40 GMT
Hi,

While extracting text from a PDF (screenshot attached), I came across a No Unicode Mapping
warning. The resulting extracted text does not contain the Wingding3 characters present in
the PDF. I have been trying to debug this PDF for some time now but I can't seem to understand
the issues involved.

Please can someone explain why PDFBox is unable to correctly extract these symbols?

Kindly find the links related to this PDF below:

PDF file on Dropbox
https://www.dropbox.com/s/57cvb36h4x2v96k/page2.pdf?dl=0

Screenshot (Text extraction)
https://www.dropbox.com/s/ftb3tuwvq3npg8o/page2%20no%20unicode%20mapping.PNG?dl=0


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message