lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Use of Convertes or Parser
Date Wed, 21 Jul 2004 16:02:51 GMT
Lucene cannot parse those document formats that you mentioned.  You
need 3rd party parsers to do that.  For example, POI will parse Excel
and MS Word docs, PDFBox will parse PDF.

Otis

--- "Natarajan.T" <natarajant@crimsonlogic.co.in> wrote:
> Hi Guys,
>  
> I have a small query, ie. Lucene 1.4 APIs directly indexing all the
> documents(PPT,PDF,WORD,etc.) then why we go for Converters or
> Parsers.
>  
>  
> Thanks,
> Natarajan.
>  
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message