manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Massiera (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (CONNECTORS-1533) Solr Connector is unable to ingest documents
Date Mon, 24 Sep 2018 10:26:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16625612#comment-16625612
] 

Julien Massiera edited comment on CONNECTORS-1533 at 9/24/18 10:25 AM:
-----------------------------------------------------------------------

[~kwright@metacarta.com],

I tested what you wanted, based on trunk :
 HttpPoster.java l291 replaced by this one : 
 solrServer = new HttpSolrClient.Builder(httpSolrServerUrl).withHttpClient(localClient).withResponseParser(new
XMLResponseParser()).allowCompression(allowCompression).build();

HttpPoster.java l173 replaced by this one : 
 final CloudSolrClient cloudSolrServer = new CloudSolrClient.Builder().withZkHost(zookeeperHosts).withLBHttpSolrClient(new
LBHttpSolrClient.Builder().withHttpClient(HttpClientUtil.createClient(null)).build()).build();

Result : document ingestions and deletions are working, Result Code OK for all documents and
they are present in the Solr index after the crawl and removed after the job deletion.

EDIT : Also tested with documents resulting in zero length after Tika parsing !


was (Author: julienfl):
[~kwright@metacarta.com],

I tested what you wanted, based on trunk :
HttpPoster.java l291 replaced by this one : 
solrServer = new HttpSolrClient.Builder(httpSolrServerUrl).withHttpClient(localClient).withResponseParser(new
XMLResponseParser()).allowCompression(allowCompression).build();

HttpPoster.java l173 replaced by this one : 
final CloudSolrClient cloudSolrServer = new CloudSolrClient.Builder().withZkHost(zookeeperHosts).withLBHttpSolrClient(new
LBHttpSolrClient.Builder().withHttpClient(HttpClientUtil.createClient(null)).build()).build();

Result : document ingestions and deletions are working, Result Code OK for all documents and
they are present in the Solr index after the crawl and removed after the job deletion.

> Solr Connector is unable to ingest documents
> --------------------------------------------
>
>                 Key: CONNECTORS-1533
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1533
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Lucene/SOLR connector
>    Affects Versions: ManifoldCF 2.11
>            Reporter: Julien Massiera
>            Assignee: Karl Wright
>            Priority: Major
>             Fix For: ManifoldCF 2.11
>
>         Attachments: 2018-09-23-012800.png, CONNECTORS-1533.patch
>
>
> The "r69acbd9 - Fix solr connector content deletion bug" has introduced another bug :

> It is now impossible to ingest documents into Solr 7.4.0, we obtain the following error
: Error from server at http://localhost:8983/solr/FileShare: missing content stream
> The fact is, the requestWriter.getContentWriter(request) object is equal to null only
on commit requests. So the new lines of code introduced by the fix, which are based on the
test of this object, result in a null Collection<ContentStream> streams object and
so the update request is failing.
> Concerned class : org.apache.manifoldcf.agents.output.solr.ModifiedHttpSolrClient



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message