lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kelvin Tan" <kel...@relevanz.com>
Subject Re: Contributor Document class repository proposal
Date Sat, 01 Dec 2001 03:22:12 GMT
[snip]
> PDF is hot.  MS Word would be awesome (anyone
> done this?).  Folks would salivate just to be able to download Lucene and
> point it at a directory tree and have it indexed without having to write
any
> code - and only when they needed something custom would they then dig
deeper
> and learn its API.
>

Have done something which seems to work <emphasis>ok</emphasis> on most MS
Word, Excel and PowerPoint files. Runs through the binary files and extracts
text...

Kelvin



--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message