Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 3176 invoked from network); 5 Nov 2003 08:51:42 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 5 Nov 2003 08:51:42 -0000 Received: (qmail 31348 invoked by uid 500); 5 Nov 2003 08:51:21 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 31316 invoked by uid 500); 5 Nov 2003 08:51:21 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 31303 invoked from network); 5 Nov 2003 08:51:20 -0000 Received: from unknown (HELO gic-msg-lin-01.genotec.ch) (212.80.184.211) by daedalus.apache.org with SMTP; 5 Nov 2003 08:51:20 -0000 Received: from gic-msg-lin-01.genotec.ch (localhost [127.0.0.1]) by gic-msg-lin-01.virge.genotec.ch (Postfix) with ESMTP id 702A81A089 for ; Wed, 5 Nov 2003 09:51:32 +0100 (CET) Received: from it1024 (unknown [80.238.139.246]) by gic-msg-lin-01.genotec.ch (Postfix) with ESMTP id 9E7F51997A for ; Wed, 5 Nov 2003 09:51:31 +0100 (CET) From: =?us-ascii?Q?Marcel_Stor?= To: Subject: Index entire filesystem Date: Wed, 5 Nov 2003 09:51:29 +0100 Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.4024 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1106 Importance: Normal X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi all, I'm thinkin' about writing a search tool for my filesystem. I know such things exist already but programming it myself is much more fun ;-) So, I would have Lucene crawl through my filesystem and pass each file to an appropriate indexer (PDF -> PDFbox, etc.). Yes, I run a Windows system and would depend on the file ending to distinguish the file type. Is this a good idea in general? Is there a list of available indexer for the the different file types? Any other comments are also welcome. Regards, Marcel --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org