manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1533) Solr Connector is unable to ingest documents
Date Tue, 25 Sep 2018 11:54:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16627214#comment-16627214
] 

Karl Wright commented on CONNECTORS-1533:
-----------------------------------------

The better fix is not actually better so I had to revisit it.

The fix now does take into account the situation where URLs get too long for normal PUT-style
operations, and the code attempts to use multipart in that situation.  I will need [~julienFL]
to check out trunk and try it, though, and check to be sure his document with lots of metadata
gets properly indexed.  I'm not worried about deletes now because they all go through the
default pathway, and I've confirmed that extracting update handler requests all do the right
thing and use multipart.  Standard update requests use PUT except when the URL is too long.

r1841918
r1841919 (release branch)



> Solr Connector is unable to ingest documents
> --------------------------------------------
>
>                 Key: CONNECTORS-1533
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1533
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Lucene/SOLR connector
>    Affects Versions: ManifoldCF 2.11
>            Reporter: Julien Massiera
>            Assignee: Karl Wright
>            Priority: Major
>             Fix For: ManifoldCF 2.11
>
>         Attachments: 2018-09-23-012800.png, CONNECTORS-1533.patch
>
>
> The "r69acbd9 - Fix solr connector content deletion bug" has introduced another bug :

> It is now impossible to ingest documents into Solr 7.4.0, we obtain the following error
: Error from server at http://localhost:8983/solr/FileShare: missing content stream
> The fact is, the requestWriter.getContentWriter(request) object is equal to null only
on commit requests. So the new lines of code introduced by the fix, which are based on the
test of this object, result in a null Collection<ContentStream> streams object and
so the update request is failing.
> Concerned class : org.apache.manifoldcf.agents.output.solr.ModifiedHttpSolrClient



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message