Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 27570 invoked from network); 22 Jul 2004 03:53:37 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 22 Jul 2004 03:53:37 -0000 Received: (qmail 60434 invoked by uid 500); 22 Jul 2004 03:53:29 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 60387 invoked by uid 500); 22 Jul 2004 03:53:29 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 60373 invoked by uid 99); 22 Jul 2004 03:53:29 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [203.200.20.178] (HELO crimsonlogic.co.in) (203.200.20.178) by apache.org (qpsmtpd/0.27.1) with ESMTP; Wed, 21 Jul 2004 20:53:26 -0700 Received: from INA098tnat (localhost.localdomain [127.0.0.1]) by crimsonlogic.co.in (Postfix) with ESMTP id 131DD233DE for ; Thu, 22 Jul 2004 09:22:53 +0530 (IST) From: "Natarajan.T" To: "'Lucene Users List'" Subject: RE: Use of Convertes or Parser Date: Thu, 22 Jul 2004 09:27:38 +0530 Message-ID: <000301c46fa0$081dd5c0$8714a8c0@ssl> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.2627 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1106 In-Reply-To: <20040721160251.36951.qmail@web12708.mail.yahoo.com> Importance: Normal X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Ok Thanks. -----Original Message----- From: Otis Gospodnetic [mailto:otis_gospodnetic@yahoo.com] Sent: Wednesday, July 21, 2004 9:33 PM To: Lucene Users List Subject: Re: Use of Convertes or Parser Lucene cannot parse those document formats that you mentioned. You need 3rd party parsers to do that. For example, POI will parse Excel and MS Word docs, PDFBox will parse PDF. Otis --- "Natarajan.T" wrote: > Hi Guys, > > I have a small query, ie. Lucene 1.4 APIs directly indexing all the > documents(PPT,PDF,WORD,etc.) then why we go for Converters or > Parsers. > > > Thanks, > Natarajan. > > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org