poi-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Burch <n...@torchbox.com>
Subject Re: using POI with Lucene/Solr for document search
Date Mon, 08 Dec 2008 18:29:17 GMT
On Thu, 4 Dec 2008, Steve Ruzila wrote:
> I'm working on a project that would use Lucene/Solr as the backend for 
> searching through thousands of MS Office and PDF files. I want to be 
> able to do keyword searches on these files. I'm not quite clear as to 
> how POI works with Lucene and what relationship the Tika project has 
> with it....since Tika seems to use POI.

You might find it easier to just use tika, and let it handle the poi 
specifics

Otherwise, we do have lots of dedicated text extractors in poi. You'd 
probably want to use those

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@poi.apache.org
For additional commands, e-mail: general-help@poi.apache.org


Mime
View raw message