manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hans Van Goethem (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CONNECTORS-1546) Optimize Elasticsearch performance by removing 'forcemerge'
Date Tue, 16 Oct 2018 13:20:00 GMT
Hans Van Goethem created CONNECTORS-1546:
--------------------------------------------

             Summary: Optimize Elasticsearch performance by removing 'forcemerge'
                 Key: CONNECTORS-1546
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1546
             Project: ManifoldCF
          Issue Type: Improvement
          Components: Elastic Search connector
            Reporter: Hans Van Goethem


After crawling with ManifoldCF, forcemerge is applied to optimize the Elasticsearch index.
This optimization makes the Elastic faster for read-operations but not for write-opeartions.
On the contrary, performance on the write operations becomes worse after every forcemerge.


Can you remove this forcemerge in ManifoldCF to optimize perfomance for recurrent crawling
to Elasticsearch?

If somene needs this forcemerge, it can be applied mannually against Elasticsearch directly.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message