lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bill Janssen <>
Subject Re: Google Desktop Could be Better
Date Sun, 17 Oct 2004 11:09:18 GMT
Bill Tschumy writes:
> I've looked at pdfBox, but the jar file is so big that I 
> hate to burden my users by incorporating it.


My system (see uses
pdftotext underneath.  I've been very satisfied with that.  Another
Java solution would be to use Multivalent
(  Multivalent, by the way, advertises
the following:

"Extract text from all formats. Full-text search with Lucene."


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message