manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1554) Job stuck during crawl documents on folder
Date Tue, 06 Nov 2018 16:02:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16676928#comment-16676928
] 

Karl Wright commented on CONNECTORS-1554:
-----------------------------------------

Hi [~bisontim], you are using file synchronization, as I feared.

This is deprecated.  You really want to be using Zookeeper synchronization.

Furthermore, your process of cleaning the locks is wrong.  The Tomcat web apps you are using
do not include the agents process, and therefore you are cleaning the locks out from under
a running agents process!  That's never going to work.  The proper process is:

(1) shutdown tomcat
(2) shutdown agents process
(3) clean locks
(4) start agents process
(5) start tomcat

You do not need to shut down solr or postgresql for this; in fact, that's counterproductive.


> Job stuck during crawl documents on folder
> ------------------------------------------
>
>                 Key: CONNECTORS-1554
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1554
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Active Directory authority, File system connector, Tika extractor
>    Affects Versions: ManifoldCF 2.11
>         Environment: Ubuntu Server 18.04
> ManifoldCF 2.11
> Solr 7.5.0
> Tika Server 1.19.1
>            Reporter: Mario Bisonti
>            Assignee: Karl Wright
>            Priority: Major
>             Fix For: ManifoldCF 2.11
>
>         Attachments: SimpleHistory.png, manifoldcf.log
>
>
> Hallo.
> When I start a job that index a Windows Share, it stucks after a 15 minutes near.
>  
> I see error in ManifoldCF.log as you can see in the attachment
>  
> I attached "Simple History" with the last documents crawled.
> Thanks a lot.
> Mario
> [^manifoldcf.log]!SimpleHistory.png!
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message