pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brian Carrier (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (PDFBOX-419) Provide info on number of characters in document that were mapped and decoded.
Date Wed, 04 Feb 2009 19:53:59 GMT

     [ https://issues.apache.org/jira/browse/PDFBOX-419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Brian Carrier resolved PDFBOX-419.
----------------------------------

    Resolution: Fixed
      Assignee: Brian Carrier

Fixed by keeping track of data and adding methods to access them.

Sending        PDFStreamEngine.java
Transmitting file data .
Committed revision 740843.

> Provide info on number of characters in document that were mapped and decoded.
> ------------------------------------------------------------------------------
>
>                 Key: PDFBOX-419
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-419
>             Project: PDFBox
>          Issue Type: New Feature
>          Components: Text extraction
>            Reporter: Brian Carrier
>            Assignee: Brian Carrier
>            Priority: Minor
>
> For various reasons, some text cannot be extracted from PDF files. A "?" is saved in
the text output for those cases, but this does not allow an automated system to determine
how much of the document that  PDFBox was able to process. There should be a way for the caller
to determine how much of the file PDFBox could process. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message