manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <>
Subject [jira] [Commented] (CONNECTORS-1554) Job stuck during crawl documents on folder
Date Tue, 06 Nov 2018 14:27:01 GMT


Karl Wright commented on CONNECTORS-1554:

Hi [~bisontim], I note the following in your log:

ERROR 2018-11-06T14:31:47,730 (Agents thread) - Exception tossed: Service 'A' of type 'AGENT_org.apache.manifoldcf.crawler.system.CrawlerAgent
is not active
org.apache.manifoldcf.core.interfaces.ManifoldCFException: Service 'A' of type 'AGENT_org.apache.manifoldcf.crawler.system.CrawlerAgent
is not active
	at org.apache.manifoldcf.core.lockmanager.BaseLockManager.endServiceActivity(
	at org.apache.manifoldcf.core.lockmanager.LockManager.endServiceActivity(
	at org.apache.manifoldcf.agents.system.AgentsDaemon.checkAgents( ~[mcf-agents.jar:?]
	at org.apache.manifoldcf.agents.system.AgentsDaemon$

This makes me concerned that you might not be shutting down the agents process cleanly.  If
you are using file-based synchronization, this could lead to stuck locks, which would explain
the behavior you are seeing quite well.  Can you confirm that you are using zookeeper?  Thanks
in advance.

> Job stuck during crawl documents on folder
> ------------------------------------------
>                 Key: CONNECTORS-1554
>                 URL:
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Active Directory authority, File system connector, Tika extractor
>    Affects Versions: ManifoldCF 2.11
>         Environment: Ubuntu Server 18.04
> ManifoldCF 2.11
> Solr 7.5.0
> Tika Server 1.19.1
>            Reporter: Mario Bisonti
>            Assignee: Karl Wright
>            Priority: Major
>             Fix For: ManifoldCF 2.11
>         Attachments: SimpleHistory.png, manifoldcf.log
> Hallo.
> When I start a job that index a Windows Share, it stucks after a 15 minutes near.
> I see error in ManifoldCF.log as you can see in the attachment
> I attached "Simple History" with the last documents crawled.
> Thanks a lot.
> Mario
> [^manifoldcf.log]!SimpleHistory.png!

This message was sent by Atlassian JIRA

View raw message