Return-Path: Delivered-To: apmail-jackrabbit-dev-archive@www.apache.org Received: (qmail 94252 invoked from network); 25 Feb 2007 13:39:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Feb 2007 13:39:29 -0000 Received: (qmail 5312 invoked by uid 500); 25 Feb 2007 13:39:36 -0000 Delivered-To: apmail-jackrabbit-dev-archive@jackrabbit.apache.org Received: (qmail 5267 invoked by uid 500); 25 Feb 2007 13:39:35 -0000 Mailing-List: contact dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@jackrabbit.apache.org Delivered-To: mailing list dev@jackrabbit.apache.org Received: (qmail 5256 invoked by uid 99); 25 Feb 2007 13:39:35 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Feb 2007 05:39:35 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Feb 2007 05:39:26 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 66DC1714041 for ; Sun, 25 Feb 2007 05:39:06 -0800 (PST) Message-ID: <24157096.1172410746418.JavaMail.jira@brutus> Date: Sun, 25 Feb 2007 05:39:06 -0800 (PST) From: "Jukka Zitting (JIRA)" To: dev@jackrabbit.apache.org Subject: [jira] Updated: (JCR-764) PdfTextFilter may leave parsed document open in case of errors In-Reply-To: <6977513.1172263685765.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/JCR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting updated JCR-764: ------------------------------ Component/s: indexing Fix Version/s: 1.2.3 Assignee: Jukka Zitting Priority: Minor (was: Trivial) Affects Version/s: 1.0 1.0.1 1.1 1.1.1 1.2.1 > PdfTextFilter may leave parsed document open in case of errors > -------------------------------------------------------------- > > Key: JCR-764 > URL: https://issues.apache.org/jira/browse/JCR-764 > Project: Jackrabbit > Issue Type: Bug > Components: indexing > Affects Versions: 1.0, 1.0.1, 1.1, 1.1.1, 1.2.1, 1.2.2 > Reporter: fabrizio giustina > Assigned To: Jukka Zitting > Priority: Minor > Fix For: 1.2.3 > > Attachments: textfilter_close.diff > > > In case of errors in a parsed PDF document jackrabbit may fail to properly close the parsed document. PDFBox will write a stack trace to system out at finalize to warn agains this. > this is the resulting log: > WARN org.apache.jackrabbit.core.query.LazyReader LazyReader.java(read:82) 20.02.2007 15:42:50 exception initializing reader org.apache.jackrabbit.core.query.PdfTextFilter$1: java.io.IOException: Error: Expected hex number, actual=' 2' > java.lang.Throwable: Warning: You did not close the PDF Document > at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384) > at java.lang.ref.Finalizer.invokeFinalizeMethod(Native Method) > at java.lang.ref.Finalizer.runFinalizer(Finalizer.java:83) > at java.lang.ref.Finalizer.access$100(Finalizer.java:14) > at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:160) > this may happens because the parse() method at > parser = new PDFParser(new BufferedInputStream(in)); > parser.parse(); > immediately creates a document, but it can throw an exception while processing the file. > PdfTextFilter should check if parser still holds a document and close it appropriately. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.