lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From vineet yadav <vineet.yadav.i...@gmail.com>
Subject Calling custom request handler with data import
Date Fri, 30 Jan 2015 14:07:22 GMT
Hi,
I am using data import handler to import data from mysql, and I want to
identify name entities from it. So I am using following example(
http://www.searchbox.com/named-entity-recognition-ner-in-solr/). where I am
using stanford ner to identify name entities. I am using following
requesthandler

<requestHandler name="/dataimport"
class="org.apache.solr.handler.dataimport.DataImportHandler">
<lst name="defaults">
 <str name="config">data-import.xml</str>
 </lst>
</requestHandler>

for importing data from mysql and

<requestHandler name="/ner" class="com.searchbox.ner.NerHandler" />
  <updateRequestProcessorChain name="mychain" >
   <processor class="com.searchbox.ner.NerProcessorFactory" >
     <lst name="queryFields">
       <str name="queryField">content</str>
     </lst>
   </processor>
   <processor class="solr.LogUpdateProcessorFactory" />
   <processor class="solr.RunUpdateProcessorFactory" />
 </updateRequestProcessorChain>
 <requestHandler name="/update" class="solr.UpdateRequestHandler">
       <lst name="defaults">
         <str name="update.chain">mychain</str>
       </lst>
  </requestHandler>

for identifying name entities.NER request handler identifies name entities
from content field, but store extracted entities in solr fields.

NER request handler was working when I am using nutch with solr. But When I
am importing data from mysql, ner request handler is not invoked. So
entities are not stored in solr for imported documents. Can anybody tell me
how to call custom request handler in data import handler.

Otherwise if I can invoke ner request handler externally, so that it can
index person, organization and location in solr for imported document. It
is also fine. Any suggestion are welcome.

Thanks
Vineet Yadav

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message