From lucene-user-return-11615-apmail-jakarta-lucene-user-archive=jakarta.apache.org@jakarta.apache.org Fri Dec 03 14:46:51 2004 Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 28618 invoked from network); 3 Dec 2004 14:46:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 3 Dec 2004 14:46:51 -0000 Received: (qmail 86039 invoked by uid 500); 3 Dec 2004 14:46:06 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 86004 invoked by uid 500); 3 Dec 2004 14:46:06 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 85989 invoked by uid 99); 3 Dec 2004 14:46:06 -0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: domain of ben@csh.rit.edu designates 129.21.60.6 as permitted sender) Received: from blacksheep.csh.rit.edu (HELO blacksheep.csh.rit.edu) (129.21.60.6) by apache.org (qpsmtpd/0.28) with ESMTP; Fri, 03 Dec 2004 06:46:04 -0800 Received: from fury.csh.rit.edu (fury.csh.rit.edu [IPv6:2001:470:1f00:135:a00:20ff:fe8d:5399]) by blacksheep.csh.rit.edu (Postfix) with ESMTP id A9B0E937C for ; Fri, 3 Dec 2004 09:46:01 -0500 (EST) Received: by fury.csh.rit.edu (Postfix, from userid 38448) id 5F82B14C5; Fri, 3 Dec 2004 09:46:01 -0500 (EST) Received: from localhost (localhost [127.0.0.1]) by fury.csh.rit.edu (Postfix) with ESMTP id 37CB414A9 for ; Fri, 3 Dec 2004 09:46:01 -0500 (EST) Date: Fri, 3 Dec 2004 09:46:01 -0500 (EST) From: Ben Litchfield To: Lucene Users List Subject: Re: PDF Indexing Error In-Reply-To: <031701c4d944$b0f0c870$7703d00a@hypermedia.com> Message-ID: References: <01e901c4d8c2$6451e440$7703d00a@hypermedia.com> <031701c4d944$b0f0c870$7703d00a@hypermedia.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N I don't think that is a good solution, as there are many bug fixes and enhancements in the current version and you would never be able to upgrade. The message that you are seeing "You do not have permission to extract text" is not a bug but intended functionality of PDFBox. PDFBox honors the security settings in a PDF, if you don't have permission to extract the text then PDFBox won't allow you to do it, just as Acrobat will not allow you to do it. PDFBox supports *modification* of PDF documents as well as text extraction. Ben On Fri, 3 Dec 2004, Luke Shannon wrote: > Hi Ben; > > Actually I think I did update PDFBox. I will put it back to the version I > previously had. > > Luke > > ----- Original Message ----- > From: "Ben Litchfield" > To: "Lucene Users List" > Sent: Thursday, December 02, 2004 8:20 PM > Subject: Re: PDF Indexing Error > > > > > > This error is because of security settings that have been applied to the > > PDF document which disallow text extraction. > > > > Not sure why you would all of a sudden get this error, unless you upgraded > > recently. Older versions of PDFBox did not fully support PDF security. > > > > Ben > > > > On Thu, 2 Dec 2004, Luke Shannon wrote: > > > > > Hello All; > > > > > > Perhaps this should be on the PDFBox forum but I was curious if anyone > has > > > seen this error parsing PDF documents using packages other than PDFBox. > > > > > > /usr/tomcat/fb_hub/GM/Administration/Document/java/java_io.pdf > > > java.io.IOException: You do not have permission to extract text > > > > > > The weird thing is it gave this error on a document I have indexed a > million > > > times over the last 3 weeks. > > > > > > Thanks, > > > > > > Luke > > > > > > > > > > > > --------------------------------------------------------------------- > > > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > > > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org