manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1503) UpdateProcessor SolrCloud and ManifoldCF
Date Tue, 24 Apr 2018 18:06:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16450313#comment-16450313
] 

Karl Wright commented on CONNECTORS-1503:
-----------------------------------------

Hi [~Moltroon],

This exception comes from Tika, and clearly also from the Tika running in Solr, as you know.
 You can disable the exception, as you know.  But if your documents have no content, what
are you indexing?  Are you trying to index metadata only?  Or do you expect there to be content
but there isn't any getting sent to Solr?

You can perhaps learn more about what's being sent to Solr by looking at the Solr log [INFO]
messages -- which should tell you the content length (among other things).  If you are seeing
a zero content length there, then something is wrong in how you have set up your pipeline
in ManifoldCF.  If the content length is *not* zero, then something is wrong with how you
have set up Solr.



> UpdateProcessor SolrCloud and ManifoldCF
> ----------------------------------------
>
>                 Key: CONNECTORS-1503
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1503
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Solr 6.x component
>    Affects Versions: ManifoldCF 2.9.1
>         Environment: SolrCloud 6.6
> ManifoldCF 2.9.1
>            Reporter: Maxence SAUNIER
>            Assignee: Shinichiro Abe
>            Priority: Major
>         Attachments: 20170421-1740.png, jira_update_processor.png, manifoldcf_arguments_uniqFields.png,
manifoldcf_output_conf.zip
>
>
> Hello,
> [Link to Apache mail archive|http://mail-archives.apache.org/mod_mbox/manifoldcf-user/201804.mbox/%3C079e01d3d7da%24807b8f60%248172ae20%24%40citya.com%3E]
> When we used Argument option in ManifoldCF for SolrCloud, ManifoldCF add they arguments
on the POST request and not on the url parameters. So, for add a (pre)processor or a post-processor
with the url, it's not possible.
> [SolrConfig updateRequestProcessorChain|https://lucene.apache.org/solr/guide/6_6/config-api.html#ConfigAPI-Whatabout_updateRequestProcessorChain_]
> [call UpdateRequestProcessors|https://lucene.apache.org/solr/guide/6_6/update-request-processors.html#UpdateRequestProcessors-Processor_Post-ProcessorRequestParameters]
> [Conf image|https://image.ibb.co/cZC8bn/jira_update_processor.png]
> Solr response:
> org.apache.solr.common.SolrException: ERROR: [doc=file://///srvics01/ways_holding/gestion_ged/gerance/3573/201102081135_ENVOIDEVISPP.doc]
unknown field 'processor'



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message