lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ulf Dittmer <...@ulfdittmer.com>
Subject Re: indexing pdfs
Date Thu, 08 Mar 2007 10:12:46 GMT
For DOC files you can use the Jakarta POI library. Text extraction is  
outlined here: http://jakarta.apache.org/poi/hwpf/quick-guide.html

Ulf

On 08.03.2007, at 10:37, ashwin kumar wrote:

> hi can some one help me by giving any sample programs for indexing  
> pdfs and .doc files


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message