pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brian Carrier (JIRA)" <j...@apache.org>
Subject [jira] Created: (PDFBOX-419) Provide info on number of characters in document that were mapped and decoded.
Date Wed, 04 Feb 2009 19:02:04 GMT
Provide info on number of characters in document that were mapped and decoded.
------------------------------------------------------------------------------

                 Key: PDFBOX-419
                 URL: https://issues.apache.org/jira/browse/PDFBOX-419
             Project: PDFBox
          Issue Type: New Feature
          Components: Text extraction
            Reporter: Brian Carrier
            Priority: Minor


For various reasons, some text cannot be extracted from PDF files. A "?" is saved in the text
output for those cases, but this does not allow an automated system to determine how much
of the document that  PDFBox was able to process. There should be a way for the caller to
determine how much of the file PDFBox could process. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message