lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: How to index pdf, html, doc and other MIME types in lucene
Date Wed, 31 Dec 2008 13:58:47 GMT
See Tika:  http://lucene.apache.org/tika

-Grant

On Dec 31, 2008, at 8:48 AM, Aaron Schon wrote:

> Do a search on list archives - has been asked/answered several times.
>
>
>
> ----- Original Message ----
> From: NageswaraRao M <mnrao13@gmail.com>
> To: java-user@lucene.apache.org
> Sent: Wednesday, December 31, 2008 8:44:50 AM
> Subject: How to index pdf, html, doc and other MIME types in lucene
>
> Hi,
>
>     Please let me know how i can index different mime type files  
> like (pdf,
> html, doc ... etc) using lucene
>
> thanks
> Nagesh
>
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

--------------------------
Grant Ingersoll

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ











---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message