Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 42684 invoked from network); 10 Nov 2004 16:55:44 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 10 Nov 2004 16:55:44 -0000 Received: (qmail 77354 invoked by uid 500); 10 Nov 2004 16:54:17 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 77257 invoked by uid 500); 10 Nov 2004 16:54:16 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 77161 invoked by uid 99); 10 Nov 2004 16:54:15 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=DNS_FROM_RFC_ABUSE X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from [216.136.173.241] (HELO web12704.mail.yahoo.com) (216.136.173.241) by apache.org (qpsmtpd/0.28) with SMTP; Wed, 10 Nov 2004 08:54:11 -0800 Received: (qmail 41445 invoked by uid 60001); 10 Nov 2004 16:54:07 -0000 Comment: DomainKeys? See http://antispam.yahoo.com/domainkeys DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; b=M8pcp/Q6oa+PcY0MJgSc48f3jitxSWo8vQv+C06SCXSfdP82QaqszkGWjQVh0jkzXKOD8hx+kge66hN1sE1RaZ7yMVfOi0h0iYy04wSCPGFPQ7sjghLVZT0IfDfGVJRhTDsDn78IMBN3ZGAfwMbVETKWVI6uabvT5M1qA+mD5lM= ; Message-ID: <20041110165406.41443.qmail@web12704.mail.yahoo.com> Received: from [216.194.17.194] by web12704.mail.yahoo.com via HTTP; Wed, 10 Nov 2004 08:54:06 PST Date: Wed, 10 Nov 2004 08:54:06 -0800 (PST) From: Otis Gospodnetic Subject: Re: Indexing MS Files To: Lucene Users List In-Reply-To: <02ff01c4c742$e0393130$7703d00a@hypermedia.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N That's one place to start. The other one would be textmining.org, at least for Word files. I used both POI and Textmining API in Lucene in Action, and the latter was much simpler to use. You can also find some comments about both libs in lucene-user archives. People tend to like Textmining API better. Otis --- Luke Shannon wrote: > I need to index Word, Excel and Power Point files. > > Is this the place to start? > > http://jakarta.apache.org/poi/ > > Is there something better? > > Thanks, > > Luke --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org