Return-Path: Delivered-To: apmail-jackrabbit-dev-archive@www.apache.org Received: (qmail 20858 invoked from network); 6 Nov 2009 18:13:55 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Nov 2009 18:13:55 -0000 Received: (qmail 16041 invoked by uid 500); 6 Nov 2009 18:13:54 -0000 Delivered-To: apmail-jackrabbit-dev-archive@jackrabbit.apache.org Received: (qmail 15971 invoked by uid 500); 6 Nov 2009 18:13:54 -0000 Mailing-List: contact dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@jackrabbit.apache.org Delivered-To: mailing list dev@jackrabbit.apache.org Received: (qmail 15962 invoked by uid 99); 6 Nov 2009 18:13:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 Nov 2009 18:13:54 +0000 X-ASF-Spam-Status: No, hits=-10.5 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 Nov 2009 18:13:52 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 7C85E234C052 for ; Fri, 6 Nov 2009 10:13:32 -0800 (PST) Message-ID: <1446055153.1257531212508.JavaMail.jira@brutus> Date: Fri, 6 Nov 2009 18:13:32 +0000 (UTC) From: "William Woodward (JIRA)" To: dev@jackrabbit.apache.org Subject: [jira] Created: (JCR-2388) Upgrade PDFBox to version 0.8.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Upgrade PDFBox to version 0.8.0 ------------------------------- Key: JCR-2388 URL: https://issues.apache.org/jira/browse/JCR-2388 Project: Jackrabbit Content Repository Issue Type: Improvement Components: jackrabbit-text-extractors Affects Versions: 2.0-beta1 Reporter: William Woodward Fix For: 2.0-beta2 The most recent version of PDFBox fixes a bug in their PDFParser class that caused a null pointer when attempting to extract text from documents created w/ Acrobat Pro version 9. see: https://issues.apache.org/jira/browse/PDFBOX-361. Since this is the first Apache incubator release they have also changed the package names. Therefore, simply getting the new PDFBox in not an option because the Jackrabbit text extractor references the old package names. This is a MAJOR problem for us since our user community recently updated to Acrobat 9 (and we have no control over this decision). Our users produce time sensitive reports. Without an updated Jackrabbit (w/ updated PDFBox) we can no longer extract and index text from the user's PDFs. Thank you for your consideration in this matter, Bill Woodward Developer -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.