lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sertic Mirko, Bedag" <Mirko.Ser...@bedag.ch>
Subject AW: Parsing MSWord
Date Wed, 12 Nov 2008 08:25:47 GMT
Hi

You can also use a tool called "antiword" to extract the text from a .doc file, and then
give the text to lucene.

See here : http://en.wikipedia.org/wiki/Antiword

Regards
Mirko

-----Urspr√ľngliche Nachricht-----
Von: dipesh [mailto:dipshrestha@gmail.com] 
Gesendet: Mittwoch, 12. November 2008 04:38
An: java-user@lucene.apache.org
Betreff: Parsing MSWord

Hello,
I wanted to know if there are classes in Lucene that support parsing MSWord
documents.
Many thanks,
Dipesh

----------------------------------------
"Help Ever Hurt Never"- Baba
Mime
View raw message