From lucene-user-return-1750-qmlist-jakarta-archive-lucene-user=jakarta.apache.org@jakarta.apache.org Thu May 23 00:12:30 2002 Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 41553 invoked from network); 23 May 2002 00:12:30 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 23 May 2002 00:12:30 -0000 Received: (qmail 20480 invoked by uid 97); 23 May 2002 00:12:35 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 20464 invoked by uid 97); 23 May 2002 00:12:35 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 20447 invoked by uid 98); 23 May 2002 00:12:34 -0000 X-Antivirus: nagoya (v4198 created Apr 24 2002) Message-Id: <4.3.2.7.0.20020522200352.02509a80@mail.hq.nasa.gov> X-Sender: baltner@mail.hq.nasa.gov X-Mailer: QUALCOMM Windows Eudora Version 4.3.2 Date: Wed, 22 May 2002 20:13:46 -0400 To: lucene-user@jakarta.apache.org From: Bruce Altner Subject: powerpoint: sometimes it works...sometimes it doesn't Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Greetings: I am brand new to Lucene so please forgive me if the following is too naive for polite replies... I have built a web-based application to schedule and archive brown bag talks. The system uses a database for the scheduling and searching by title, author, topic, abstract, etc. but I want to add full text searching of the powerpoint files actually presented during the seminars. So I ran a quick index (using the demo API) on the ppt file of a past talk I'd given and Lucene handled it very well, finding hits 95% of the time. I was quite impressed and excited about the possibilities but then I indexed a more recent talk and Lucene failed completely, never once finding a term. Any idea why it would work on one ppt file but not on another? The first was created using powerpoint from Office 97 and the latter (failed) example from Office 2000 so that's a strong possibility but I wanted to run this by folks on the list for opinions. Thanks! Bruce PS My brown bag app is my second go-round with the jakarta Turbine framework. Ai'nt open source great! -- To unsubscribe, e-mail: For additional commands, e-mail: