jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "William Woodward (JIRA)" <j...@apache.org>
Subject [jira] Created: (JCR-2388) Upgrade PDFBox to version 0.8.0
Date Fri, 06 Nov 2009 18:13:32 GMT
Upgrade PDFBox to version 0.8.0

                 Key: JCR-2388
                 URL: https://issues.apache.org/jira/browse/JCR-2388
             Project: Jackrabbit Content Repository
          Issue Type: Improvement
          Components: jackrabbit-text-extractors
    Affects Versions: 2.0-beta1
            Reporter: William Woodward
             Fix For: 2.0-beta2

The most recent version of PDFBox fixes a bug in their PDFParser class that caused a null
pointer when attempting to extract text from documents created w/ Acrobat Pro version 9. see:
https://issues.apache.org/jira/browse/PDFBOX-361. Since this is the first Apache incubator
release they have also changed the package names. Therefore, simply getting the new PDFBox
in not an option because the Jackrabbit text extractor references the old package names.

This is a MAJOR problem for us since our user community recently updated to Acrobat 9 (and
we have no control over this decision). Our users produce time sensitive reports. Without
an updated Jackrabbit (w/ updated PDFBox) we can no longer extract and index text from the
user's PDFs.

Thank you for your consideration in this matter,

Bill Woodward

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message