manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1554) Job stuck during crawl documents on folder
Date Tue, 06 Nov 2018 14:27:01 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16676820#comment-16676820
] 

Karl Wright commented on CONNECTORS-1554:
-----------------------------------------

Hi [~bisontim], I note the following in your log:

{code}
ERROR 2018-11-06T14:31:47,730 (Agents thread) - Exception tossed: Service 'A' of type 'AGENT_org.apache.manifoldcf.crawler.system.CrawlerAgent
is not active
org.apache.manifoldcf.core.interfaces.ManifoldCFException: Service 'A' of type 'AGENT_org.apache.manifoldcf.crawler.system.CrawlerAgent
is not active
	at org.apache.manifoldcf.core.lockmanager.BaseLockManager.endServiceActivity(BaseLockManager.java:462)
~[mcf-core.jar:?]
	at org.apache.manifoldcf.core.lockmanager.LockManager.endServiceActivity(LockManager.java:172)
~[mcf-core.jar:?]
	at org.apache.manifoldcf.agents.system.AgentsDaemon.checkAgents(AgentsDaemon.java:289) ~[mcf-agents.jar:?]
	at org.apache.manifoldcf.agents.system.AgentsDaemon$AgentsThread.run(AgentsDaemon.java:209)
[mcf-agents.jar:?]
{code}

This makes me concerned that you might not be shutting down the agents process cleanly.  If
you are using file-based synchronization, this could lead to stuck locks, which would explain
the behavior you are seeing quite well.  Can you confirm that you are using zookeeper?  Thanks
in advance.

> Job stuck during crawl documents on folder
> ------------------------------------------
>
>                 Key: CONNECTORS-1554
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1554
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Active Directory authority, File system connector, Tika extractor
>    Affects Versions: ManifoldCF 2.11
>         Environment: Ubuntu Server 18.04
> ManifoldCF 2.11
> Solr 7.5.0
> Tika Server 1.19.1
>            Reporter: Mario Bisonti
>            Assignee: Karl Wright
>            Priority: Major
>             Fix For: ManifoldCF 2.11
>
>         Attachments: SimpleHistory.png, manifoldcf.log
>
>
> Hallo.
> When I start a job that index a Windows Share, it stucks after a 15 minutes near.
>  
> I see error in ManifoldCF.log as you can see in the attachment
>  
> I attached "Simple History" with the last documents crawled.
> Thanks a lot.
> Mario
> [^manifoldcf.log]!SimpleHistory.png!
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message