lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pulkit Singhal (Issue Comment Edited) (JIRA)" <j...@apache.org>
Subject [jira] [Issue Comment Edited] (SOLR-1499) SolrEntityProcessor - DIH EntityProcessor that queries an external Solr via SolrJ
Date Tue, 11 Oct 2011 00:30:29 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13124522#comment-13124522
] 

Pulkit Singhal edited comment on SOLR-1499 at 10/11/11 12:30 AM:
-----------------------------------------------------------------

The updated patch is for lucene-solr trunk is attached. Sorry for naming it badly but apparently
I can't edit the file name after attaching it: SOLR-1499.rev1181269.buggy.patch

I need to message multivalued fields, is there any guidance around that? I know its not tested
but how should one go about experimenting with it?

FYI: To prove the patch works, I got a basic sanity-test to work where the data-config.xml
file in my bbyopen2 core got its data from the initital bbyopen core:
  1 <dataConfig>
  2   <document>
  3     <entity name="sep"
  4             processor="SolrEntityProcessor"
  5             url="http://localhost:8983/solr/bbyopen"
  6             query="sku:1000159"
  7             format="javabin"
  8             transformer="TemplateTransformer">
  9       <field column="sku" template="COPYOF-${sep.sku}"/>
 10     </entity>
 11    </document>
 12 </dataConfig>
                
      was (Author: pulkitsinghal@gmail.com):
    The updated patch is for lucene-solr trunk is attached.

I need to message multivalued fields, is there any guidance around that? I know its not tested
but how should one go about experimenting with it?

FYI: To prove the patch works, I got a basic sanity-test to work where the data-config.xml
file in my bbyopen2 core got its data from the initital bbyopen core:
  1 <dataConfig>
  2   <document>
  3     <entity name="sep"
  4             processor="SolrEntityProcessor"
  5             url="http://localhost:8983/solr/bbyopen"
  6             query="sku:1000159"
  7             format="javabin"
  8             transformer="TemplateTransformer">
  9       <field column="sku" template="COPYOF-${sep.sku}"/>
 10     </entity>
 11    </document>
 12 </dataConfig>
                  
> SolrEntityProcessor - DIH EntityProcessor that queries an external Solr via SolrJ
> ---------------------------------------------------------------------------------
>
>                 Key: SOLR-1499
>                 URL: https://issues.apache.org/jira/browse/SOLR-1499
>             Project: Solr
>          Issue Type: New Feature
>          Components: contrib - DataImportHandler
>            Reporter: Lance Norskog
>             Fix For: 3.5, 4.0
>
>         Attachments: SOLR-1499.patch, SOLR-1499.patch, SOLR-1499.patch, SOLR-1499.patch,
SOLR-1499.patch, SOLR-1499.patch, SOLR-1499.rev1181269.buggy.patch
>
>
> The SolrEntityProcessor queries an external Solr instance. The Solr documents returned
are unpacked and emitted as DIH fields.
> The SolrEntityProcessor uses the following attributes:
> * solr='http://localhost:8983/solr/sms'
> ** This gives the URL of the target Solr instance.
> *** Note: the connection to the target Solr uses the binary SolrJ format.
> * query='Jefferson&sort=id+asc'
> ** This gives the base query string use with Solr. It can include any standard Solr request
parameter. This attribute is processed under the variable resolution rules and can be driven
in an inner stage of the indexing pipeline.
> * rows='10'
> ** This gives the number of rows to fetch per request..
> ** The SolrEntityProcessor always fetches every document that matches the request..
> * fields='id,tag'
> ** This selects the fields to be returned from the Solr request.
> ** These must also be declared as <field> elements.
> ** As with all fields, template processors can be used to alter the contents to be passed
downwards.
> * timeout='30'
> ** This limits the query to 5 seconds. This can be used as a fail-safe to prevent the
indexing session from freezing up. By default the timeout is 5 minutes.
> Limitations:
> * Solr errors are not handled correctly.
> * Loop control constructs have not been tested.
> * Multi-valued returned fields have not been tested.
> The unit tests give examples of how to use it as the root entity and an inner entity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message