lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "ExtractingRequestHandler" by GrantIngersoll
Date Sun, 16 Nov 2008 16:24:04 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The following page has been changed by GrantIngersoll:
http://wiki.apache.org/solr/ExtractingRequestHandler

------------------------------------------------------------------------------
  }}}
  
  NOTE, you can override this by implementing your own !SolrContentHandler as described below.
+ 
+ == When To Use ==
+ 
+ The !ExtractingRequestHandler can be used any time you have the need to index both the metadata
and text of binary documents like Word, PDF, etc.  It doesn't, however, make sense to use
it if you are only interested in indexing the metadata about documents, since it will be much
faster to determine the metadata on the client side and then send that as a normal Solr document.
 In fact, it might make sense for someone to write a piece for SolrJ that uses Tika on the
client-side to construct Solr documents.
  
  = Getting Started =
  

Mime
View raw message