Return-Path: Delivered-To: apmail-lucene-tika-dev-archive@www.apache.org Received: (qmail 48597 invoked from network); 15 Apr 2010 08:55:16 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Apr 2010 08:55:16 -0000 Received: (qmail 43817 invoked by uid 500); 15 Apr 2010 08:55:16 -0000 Delivered-To: apmail-lucene-tika-dev-archive@lucene.apache.org Received: (qmail 43682 invoked by uid 500); 15 Apr 2010 08:55:14 -0000 Mailing-List: contact tika-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: tika-dev@lucene.apache.org Delivered-To: mailing list tika-dev@lucene.apache.org Received: (qmail 43668 invoked by uid 99); 15 Apr 2010 08:55:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Apr 2010 08:55:13 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Apr 2010 08:55:11 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o3F8snxY018042 for ; Thu, 15 Apr 2010 04:54:49 -0400 (EDT) Message-ID: <29981479.136231271321689673.JavaMail.jira@thor> Date: Thu, 15 Apr 2010 04:54:49 -0400 (EDT) From: "Jukka Zitting (JIRA)" To: tika-dev@lucene.apache.org Subject: [jira] Resolved: (TIKA-243) Fire event at start- and end of archive parsing In-Reply-To: <1918097570.1244056747328.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/TIKA-243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-243. -------------------------------- Assignee: Jukka Zitting Resolution: Won't Fix Resolving as Won't Fix, since nowadays it's possible to pass a custom parser instance through the parse context that'll be called for each component document within an archive. This is the recommended way to implement custom handling of nested documents. > Fire event at start- and end of archive parsing > ----------------------------------------------- > > Key: TIKA-243 > URL: https://issues.apache.org/jira/browse/TIKA-243 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 0.3 > Reporter: Jan Goyvaerts > Assignee: Jukka Zitting > Priority: Minor > > Archive parsers fire a start- and stop document event. Although this is event is suppressed. Probably because only one start- and stop document events are allowed. > Getting these events - or equivalents - in one way or another would be needed to index the files the archive contains separately. Because when Lucene has a hit on a file that is inside an archive the choice must be given to the users to download just that file and not the entire archive. And let the user figure out how to decompress the archive - assuming the user has a suitable decompressor installed - and locate the required file. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira