lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Della Bitta <>
Subject Re: Need additional data processing in Data Import Handler prior to indexing
Date Tue, 29 Oct 2013 21:09:19 GMT
Hi Dileepa,

You can write your own Transformers in Java. If it doesn't make sense to
run Stanbol calls in a Transformer, maybe setting up a web service that
grabs a record out of MySQL, sends the data to Stanbol, and displays the
results could be used in conjunction with HttpDataSource rather than

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions <> | g+:<>
w: <>

On Tue, Oct 29, 2013 at 4:47 PM, Dileepa Jayakody <
> wrote:

> Hi All,
> I'm a newbie to Solr, and I have a requirement to import data from a mysql
> database; enhance  the imported content to identify Persons mentioned  and
> index it as a separate field in Solr along with the other fields defined
> for the original db query.
> I'm using Apache Stanbol [1] for the content enhancement requirement.
> I can get enhancement results for 'Person' type data in the content as the
> enhancement result.
> The data flow will be;
> mysql-db > Solr data-import handler > Stanbol enhancer > Solr index
> For the above requirement I need to perform additional processing at the
> data-import handler prior to indexing to send a request to Stanbol and
> process the enhancement response. I found some related examples on
> modifying mysql data import handler to customize the query results in
> db-data-config.xml by using a transformer script.
> As per my requirement, In the data-import-handler I need to send a request
> to Stanbol and process the response prior to indexing. But I'm not sure if
> this can be achieved using a simple javascript.
> Is there any other better way of achieving my requirement? Maybe writing a
> custom filter in Solr?
> Please share your thoughts. Appreciate any pointers as I'm a beginner for
> Solr.
> Thanks,
> Dileepa
> [1]

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message