manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1503) UpdateProcessor SolrCloud and ManifoldCF
Date Wed, 25 Apr 2018 10:30:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16452026#comment-16452026
] 

Karl Wright commented on CONNECTORS-1503:
-----------------------------------------

Hi [~Moltroon],

I'm still just trying to understand how you have things set up.

(1) The parameters you use to configure the Tika extractor are not affected by how you configure
the Solr output connector.

(2) It sounds like you have a complex pipeline, which probably includes the Tika Extractor,
and the Metadata Adjuster too.  It sounds furthermore like the Metadata Adjuster comes after
the Tika Extractor.  Is this in fact the case?

(3) If that is your setup, then the right handler to use is not the Extracting Update Handler.
 It is the standard Update Handler.

(4) It sounds like the "processor" argument works with the Extracting Update Handler but does
not work with the standard Update Handler.

Is this summary correct?  If it is, I can look into why the processor argument is not working
for that handler.  Please verify.




> UpdateProcessor SolrCloud and ManifoldCF
> ----------------------------------------
>
>                 Key: CONNECTORS-1503
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1503
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Solr 6.x component
>    Affects Versions: ManifoldCF 2.9.1
>         Environment: SolrCloud 6.6
> ManifoldCF 2.9.1
>            Reporter: Maxence SAUNIER
>            Assignee: Shinichiro Abe
>            Priority: Major
>         Attachments: 20170421-1740.png, jira_update_processor.png, manifoldcf_arguments_uniqFields.png,
manifoldcf_output_conf.zip
>
>
> Hello,
> [Link to Apache mail archive|http://mail-archives.apache.org/mod_mbox/manifoldcf-user/201804.mbox/%3C079e01d3d7da%24807b8f60%248172ae20%24%40citya.com%3E]
> When we used Argument option in ManifoldCF for SolrCloud, ManifoldCF add they arguments
on the POST request and not on the url parameters. So, for add a (pre)processor or a post-processor
with the url, it's not possible.
> [SolrConfig updateRequestProcessorChain|https://lucene.apache.org/solr/guide/6_6/config-api.html#ConfigAPI-Whatabout_updateRequestProcessorChain_]
> [call UpdateRequestProcessors|https://lucene.apache.org/solr/guide/6_6/update-request-processors.html#UpdateRequestProcessors-Processor_Post-ProcessorRequestParameters]
> [Conf image|https://image.ibb.co/cZC8bn/jira_update_processor.png]
> Solr response:
> org.apache.solr.common.SolrException: ERROR: [doc=file://///srvics01/ways_holding/gestion_ged/gerance/3573/201102081135_ENVOIDEVISPP.doc]
unknown field 'processor'



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message