poi-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Burch <n...@torchbox.com>
Subject Re: using POI with Lucene/Solr for document search
Date Mon, 08 Dec 2008 18:29:17 GMT
On Thu, 4 Dec 2008, Steve Ruzila wrote:
> I'm working on a project that would use Lucene/Solr as the backend for 
> searching through thousands of MS Office and PDF files. I want to be 
> able to do keyword searches on these files. I'm not quite clear as to 
> how POI works with Lucene and what relationship the Tika project has 
> with it....since Tika seems to use POI.

You might find it easier to just use tika, and let it handle the poi 

Otherwise, we do have lots of dedicated text extractors in poi. You'd 
probably want to use those


To unsubscribe, e-mail: general-unsubscribe@poi.apache.org
For additional commands, e-mail: general-help@poi.apache.org

View raw message