lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "MOYSE Gilles (Cetelem)" <gilles.mo...@cetelem.fr>
Subject RE: Does the Lucene search engine work with PDF's?
Date Mon, 20 Oct 2003 07:34:34 GMT
You can also use the TextMining.org toolbox, which provides classes to
extract text from PDF and DOC files, using the Jakarta POI project. They are
all free, under Apache Licence. 

The URL
:http://www.textmining.org/modules.php?op=modload&name=News&file=article&sid
=6&mode=thread&order=0&thold=0).
(URL tested today) 

You can try the JGuru page : http://www.jguru.com/faq/view.jsp?EID=1074237

Gilles Moyse


-----Message d'origine-----
De : Andre Hughes [mailto:ahughes@emanagelaw.com]
Envoyé : samedi 18 octobre 2003 00:05
À : lucene-user@jakarta.apache.org
Objet : Does the Lucene search engine work with PDF's?


Hello,
Can the Lucene search engine index and search though PDF documents?
What are the file format limits for Lucene search engine.
 
Thanks in Advance,
 
Andre'

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message