From dev-return-18213-archive-asf-public=cust-asf.ponee.io@manifoldcf.apache.org Thu Jul 26 20:44:18 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 39985180621 for ; Thu, 26 Jul 2018 20:44:16 +0200 (CEST) Received: (qmail 35477 invoked by uid 500); 26 Jul 2018 18:44:15 -0000 Mailing-List: contact dev-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@manifoldcf.apache.org Delivered-To: mailing list dev@manifoldcf.apache.org Received: (qmail 35466 invoked by uid 99); 26 Jul 2018 18:44:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Jul 2018 18:44:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id C25BD1806A8 for ; Thu, 26 Jul 2018 18:44:14 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -110.051 X-Spam-Level: X-Spam-Status: No, score=-110.051 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, KAM_LOTSOFHASH=0.25, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id JGz2V7Ki3FVN for ; Thu, 26 Jul 2018 18:44:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 6B6275F3CE for ; Thu, 26 Jul 2018 18:44:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id C46AAE25B8 for ; Thu, 26 Jul 2018 18:44:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 759BF27756 for ; Thu, 26 Jul 2018 18:44:00 +0000 (UTC) Date: Thu, 26 Jul 2018 18:44:00 +0000 (UTC) From: "Steph van Schalkwyk (JIRA)" To: dev@manifoldcf.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CONNECTORS-1518) MCF shutting down when Tika is used MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CONNECTORS-1518?page=3Dcom.atl= assian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steph van Schalkwyk updated CONNECTORS-1518: -------------------------------------------- Description:=20 =C2=A0=C2=A0```Jul 26, 2018 1:21:51 PM org.apache.tika.config.Initializable= ProblemHandler$3 handleInitializableProblem WARNING: org.xerial's sqlite-jdbc is not loaded. Please provide the jar on your classpath to parse sqlite files. See tika-parsers/pom.xml for the correct version. agents process ran out of memory - shutting down java.lang.OutOfMemoryError: Java heap space \{{ {{ at java.base/java.util.Arrays.copyOf(Arrays.java:3816)}}}} \{{ {{ at java.base/java.util.BitSet.ensureCapacity(BitSet.java:338)}}}} \{{ {{ at java.base/java.util.BitSet.expandTo(BitSet.java:353)}}}} \{{ {{ at java.base/java.util.BitSet.set(BitSet.java:448)}}}} \{{ {{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Bo= ilerpipeHTMLContentHandler.java:267)}}}} \{{ {{ at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(= BoilerpipeContentHandler.java:155)}}}} \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHa= ndlerDecorator.java:146)}}}} \{{ {{ at org.apache.tika.sax.SecureContentHandler.characters(SecureConten= tHandler.java:270)}}}} \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHa= ndlerDecorator.java:146)}}}} \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHa= ndlerDecorator.java:146)}}}} \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHa= ndlerDecorator.java:146)}}}} \{{ {{ at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHan= dler.java:46)}}}} \{{ {{ at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandle= r.java:82)}}}} \{{ {{ at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler= .java:140)}}}} \{{ {{ at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHan= dler.java:287)}}}} \{{ {{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentH= andler.java:279)}}}} \{{ {{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentH= andler.java:306)}}}} \{{ {{ at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:3= 4)}}}} \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener= .processSheet(ExcelExtractor.java:609)}}}} \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener= .internalProcessRecord(ExcelExtractor.java:392)}}}} \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener= .processRecord(ExcelExtractor.java:343)}}}} \{{ {{ at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.pr= ocessRecord(FormatTrackingHSSFListener.java:92)}}}} \{{ {{ at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSS= FRequest.java:109)}}}} \{{ {{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProce= ssEvents(HSSFEventFactory.java:179)}}}} \{{ {{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvent= s(HSSFEventFactory.java:136)}}}} \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener= .processFile(ExcelExtractor.java:319)}}}} \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtra= ctor.java:170)}}}} \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser= .java:184)}}}} \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser= .java:132)}}}} \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.jav= a:280)}}}} \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.jav= a:280)}}}} \{{ {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.j= ava:143)}}}} {{ [Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped Se= rverConnector@37095ded\{HTTP/1.1}{{ {0.0.0.0:8345} }}}} {{ {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - S= topped o.e.j.w.WebAppContext@5a6d5a8f {/mcf-api-service,[file:/tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-ap= i-service-any-14189461872304124764.dir/webapp/,UNAVAILABLE|file:///tmp/jett= y-0.0.0.0-8345-mcf-api-service.war-_mcf-api-service-any-1418946187230412476= 4.dir/webapp/,UNAVAILABLE]} }}{{ {/opt/manifoldcf/manifoldcf_single/././web/war/mcf-api-service.war}}}}} {{ [Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Sto= pped o.e.j.w.WebAppContext@6979efad\{/mcf-authority-service,[file:/tmp/jett= y-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1161944= 5383548662284.dir/webapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-au= thority-service.war-_mcf-authority-service-any-11619445383548662284.dir/web= app/,UNAVAILABLE]}{/opt/manifoldcf/manifoldcf_single/././web/war/mcf-author= ity-service.war}}} 2018-07-26 13:22:47,170 qtp2061226112-492 FATAL Unable to register shutdow= n hook because JVM is shutting down. java.lang.IllegalStateException: Canno= t add new shutdown hook as this is not started. Current state: STOPPED \{{ {{ at org.apache.logging.log4j.core.util.DefaultShutdownCallbackRegist= ry.addShutdownCallback(DefaultShutdownCallbackRegistry.java:113)}}}} \{{ {{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.addShutdo= wnCallback(Log4jContextFactory.java:271)}}}} \{{ {{ at org.apache.logging.log4j.core.LoggerContext.setUpShutdownHook(Lo= ggerContext.java:256)}}}} \{{ {{ at org.apache.logging.log4j.core.LoggerContext.start(LoggerContext.= java:216)}}}} \{{ {{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.getContex= t(Log4jContextFactory.java:146)}}}} \{{ {{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.getContex= t(Log4jContextFactory.java:41)}}}} \{{ {{ at org.apache.logging.log4j.LogManager.getContext(LogManager.java:2= 70)}}}} \{{ {{ at org.apache.log4j.Logger$PrivateManager.getContext(Logger.java:59= )}}}} \{{ {{ at org.apache.log4j.Logger.getLogger(Logger.java:37)}}}} \{{ {{ at org.apache.velocity.runtime.log.Log4JLogChute.init(Log4JLogChute= .java:72)}}}} \{{ {{ at org.apache.velocity.runtime.log.LogManager.createLogChute(LogMan= ager.java:157)}}}} \{{ {{ at org.apache.velocity.runtime.log.LogManager.updateLog(LogManager.= java:269)}}}} \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.initializeLog(Runtim= eInstance.java:871)}}}} \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.init(RuntimeInstance= .java:262)}}}} \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.requireInitializatio= n(RuntimeInstance.java:302)}}}} \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.getTemplate(RuntimeI= nstance.java:1531)}}}} \{{ {{ at org.apache.velocity.app.VelocityEngine.mergeTemplate(VelocityEng= ine.java:343)}}}} \{{ {{ at org.apache.manifoldcf.ui.i18n.Messages.outputResourceWithVelocit= y(Messages.java:159)}}}} \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.Messages.output= ResourceWithVelocity(Messages.java:136)}}}} \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.o= utputSpecificationBody(TikaExtractor.java:544)}}}} \{{ {{ at org.apache.jsp.editjob_jsp._jspService(editjob_jsp.java:3002)}}}= } \{{ {{ at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:7= 0)}}}} \{{ {{ at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)}}}} \{{ {{ at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWr= apper.java:388)}}}} \{{ {{ at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.j= ava:313)}}}} \{{ {{ at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260= )}}}} \{{ {{ at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)}}}} \{{ {{ at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.jav= a:769)}}}} \{{ {{ at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler= .java:585)}}}} \{{ {{ at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHand= ler.java:143)}}}} \{{ {{ at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandle= r.java:577)}}}} \{{ {{ at org.eclipse.jetty.server.session.SessionHandler.doHandle(Session= Handler.java:223)}}}} \{{ {{ at org.eclipse.jetty.server.handler.ContextHandler.doHandle(Context= Handler.java:1125)}}}} \{{ {{ at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.= java:515)}}}} \{{ {{ at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionH= andler.java:185)}}}} \{{ {{ at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextH= andler.java:1059)}}}} \{{ {{ at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHand= ler.java:141)}}}} \{{ {{ at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle= (ContextHandlerCollection.java:215)}}}} \{{ {{ at org.eclipse.jetty.server.handler.HandlerList.handle(HandlerList.= java:52)}}}} \{{ {{ at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWr= apper.java:97)}}}} \{{ {{ at org.eclipse.jetty.server.Server.handle(Server.java:497)}}}} \{{ {{ at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:311= )}}}} \{{ {{ at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnectio= n.java:248)}}}} \{{ {{ at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection= .java:540)}}}} \{{ {{ at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThre= adPool.java:610)}}}} \{{ {{ at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThrea= dPool.java:539)}}}} \{{ {{ at java.base/java.lang.Thread.run(Thread.java:844)}}}}[Worker threa= d '35'] WARN org.apache.tika.parser.microsoft.AbstractPOIFSExtractor - Igno= ring unexpected exception while parsing summary entry SummaryInformation java.lang.RuntimeException: java.nio.channels.ClosedByInterruptException \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuff= erIterator.(NPOIFSStream.java:151)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream.getBlockIterator(NP= OIFSStream.java:95)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSDocument.getBlockIterator(= NPOIFSDocument.java:179)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.(NDoc= umentInputStream.java:82)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.DocumentInputStream.(Docum= entInputStream.java:65)}}}} \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaryEn= tryIfExists(SummaryExtractor.java:83)}}}} \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaries= (SummaryExtractor.java:73)}}}} \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser= .java:156)}}}} \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser= .java:132)}}}} \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.jav= a:280)}}}} \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.jav= a:280)}}}} \{{ {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.j= ava:143)}}}} \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.pars= e(TikaParser.java:74)}}}} \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.a= ddOrReplaceDocumentWithException(TikaExtractor.java:235)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalInges= ter.java:3226)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er$PipelineObjectWithVersions.addOrReplaceDocumentWithException(Incremental= Ingester.java:2708)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er.documentIngest(IncrementalIngester.java:756)}}}} \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivit= y.ingestDocumentWithException(WorkerThread.java:1583)}}}} \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivit= y.ingestDocumentWithException(WorkerThread.java:1548)}}}} \{{ {{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnecto= r.processDocuments(FileConnector.java:448)}}}} \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThre= ad.java:399)}}}} Caused by: java.nio.channels.ClosedByInterruptException \{{ {{ at java.base/java.nio.channels.spi.AbstractInterruptibleChannel.end= (AbstractInterruptibleChannel.java:199)}}}} \{{ {{ at java.base/sun.nio.ch.FileChannelImpl.size(FileChannelImpl.java:3= 88)}}}} \{{ {{ at org.apache.poi.poifs.nio.FileBackedDataSource.size(FileBackedDat= aSource.java:137)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.getChainLoopDet= ector(NPOIFSFileSystem.java:627)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuff= erIterator.(NPOIFSStream.java:149)}}}} \{{ {{ ... 21 more}}}} [Worker thread '35'] WARN org.apache.tika.parser.microsoft.AbstractPOIFSEx= tractor - Ignoring unexpected exception while parsing summary entry Documen= tSummaryInformation java.lang.RuntimeException: java.nio.channels.ClosedChannelException \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuff= erIterator.(NPOIFSStream.java:151)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream.getBlockIterator(NP= OIFSStream.java:95)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSMiniStore.getBlockAt(NPOIF= SMiniStore.java:67)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuff= erIterator.next(NPOIFSStream.java:169)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuff= erIterator.next(NPOIFSStream.java:142)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.readFully(N= DocumentInputStream.java:264)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.read(NDocum= entInputStream.java:162)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.DocumentInputStream.read(Documen= tInputStream.java:127)}}}} \{{ {{ at org.apache.poi.util.BoundedInputStream.read(BoundedInputStream.j= ava:121)}}}} \{{ {{ at org.apache.poi.util.BoundedInputStream.read(BoundedInputStream.j= ava:103)}}}} \{{ {{ at org.apache.poi.util.IOUtils.copy(IOUtils.java:312)}}}} \{{ {{ at org.apache.poi.util.IOUtils.peekFirstNBytes(IOUtils.java:70)}}}} \{{ {{ at org.apache.poi.hpsf.PropertySet.isPropertySetStream(PropertySet.= java:393)}}}} \{{ {{ at org.apache.poi.hpsf.PropertySet.(PropertySet.java:191)}}}} \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaryEn= tryIfExists(SummaryExtractor.java:83)}}}} \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaries= (SummaryExtractor.java:74)}}}} \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser= .java:156)}}}} \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser= .java:132)}}}} \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.jav= a:280)}}}} \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.jav= a:280)}}}} \{{ {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.j= ava:143)}}}} \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.pars= e(TikaParser.java:74)}}}} \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.a= ddOrReplaceDocumentWithException(TikaExtractor.java:235)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalInges= ter.java:3226)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er$PipelineObjectWithVersions.addOrReplaceDocumentWithException(Incremental= Ingester.java:2708)}}}} \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngest= er.documentIngest(IncrementalIngester.java:756)}}}} \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivit= y.ingestDocumentWithException(WorkerThread.java:1583)}}}} \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivit= y.ingestDocumentWithException(WorkerThread.java:1548)}}}} \{{ {{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnecto= r.processDocuments(FileConnector.java:448)}}}} \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThre= ad.java:399)}}}} Caused by: java.nio.channels.ClosedChannelException \{{ {{ at java.base/sun.nio.ch.FileChannelImpl.ensureOpen(FileChannelImpl.= java:158)}}}} \{{ {{ at java.base/sun.nio.ch.FileChannelImpl.size(FileChannelImpl.java:3= 73)}}}} \{{ {{ at org.apache.poi.poifs.nio.FileBackedDataSource.size(FileBackedDat= aSource.java:137)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.getChainLoopDet= ector(NPOIFSFileSystem.java:627)}}}} \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuff= erIterator.(NPOIFSStream.java:149)}}}} \{{ {{ ... 30 more}}}}=C2=A0```}}{{Following up:When these exceptions occu= r, the heap runs out:13:39:39.856 [Worker thread '49'] WARN org.apache.mani= foldcf.jobs - Service interruption reported for job 1532551209410 connectio= n 'file': IO exception: null 13:39:39.970 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:40.415 [Worker thread '34'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:40.469 [Worker thread '1'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:43.739 [Worker thread '32'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:44.697 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:45.756 [Worker thread '33'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:45.775 [Worker thread '36'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:46.751 [Worker thread '35'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:46.753 [Worker thread '40'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:47.536 [Worker thread '45'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:48.734 [Worker thread '44'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:50.922 [Worker thread '30'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:39:54.930 [Worker thread '28'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null 13:40:33.660 [Worker thread '29'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null agents process ran out of memory - shutting down java.lang.OutOfMemoryError: Java heap space \{{ at java.base/java.lang.StringLatin1.newString(StringLatin1.java:549)}} \{{ at java.base/java.lang.StringBuilder.toString(StringBuilder.java:415)}= } \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boile= rpipeHTMLContentHandler.java:341)}} \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Boile= rpipeHTMLContentHandler.java:198)}} \{{ at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(Boi= lerpipeContentHandler.java:155)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandl= erDecorator.java:146)}} \{{ at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHa= ndler.java:270)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandl= erDecorator.java:146)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandl= erDecorator.java:146)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandl= erDecorator.java:146)}} \{{ at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandle= r.java:46)}} \{{ at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.j= ava:82)}} \{{ at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.ja= va:140)}} \{{ at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandle= r.java:287)}} \{{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHand= ler.java:279)}} \{{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHand= ler.java:306)}} \{{ at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:34)}= } \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.pr= ocessSheet(ExcelExtractor.java:609)}} \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.in= ternalProcessRecord(ExcelExtractor.java:392)}} \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.pr= ocessRecord(ExcelExtractor.java:343)}} \{{ at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.proce= ssRecord(FormatTrackingHSSFListener.java:92)}} \{{ at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSSFRe= quest.java:109)}} \{{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessE= vents(HSSFEventFactory.java:179)}} \{{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents(H= SSFEventFactory.java:136)}} \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.pr= ocessFile(ExcelExtractor.java:319)}} \{{ at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtracto= r.java:170)}} \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.ja= va:184)}} \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.ja= va:132)}} \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:2= 80)}} \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:2= 80)}} \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java= :143)}} \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(T= ikaParser.java:74)}} agents process ran out of memory - shutting down java.lang.OutOfMemoryError: Java heap space \{{ at java.base/java.util.Arrays.copyOf(Arrays.java:3744)}} \{{ at java.base/java.lang.AbstractStringBuilder.ensureCapacityInternal(Ab= stractStringBuilder.java:146)}} \{{ at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuil= der.java:531)}} \{{ at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuil= der.java:550)}} \{{ at java.base/java.lang.StringBuilder.append(StringBuilder.java:171)}} \{{ at java.base/java.util.regex.Matcher.appendReplacement(Matcher.java:10= 02)}} \{{ at java.base/java.util.regex.Matcher.replaceAll(Matcher.java:1181)}} \{{ at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.j= ava:40)}} \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boile= rpipeHTMLContentHandler.java:296)}} \{{ at de.l3s.boilerpipe.sax.CommonTagActions$3.end(CommonTagActions.java:= 143)}} \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.endElement(Boile= rpipeHTMLContentHandler.java:183)}} \{{ at org.apache.tika.parser.html.BoilerpipeContentHandler.endElement(Boi= lerpipeContentHandler.java:175)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandl= erDecorator.java:136)}} \{{ at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHa= ndler.java:256)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandl= erDecorator.java:136)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandl= erDecorator.java:136)}} \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandl= erDecorator.java:136)}} \{{ at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandle= r.java:273)}} \{{ at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHan= dler.java:224)}} \{{ at org.apache.tika.parser.txt.TXTParser.parse(TXTParser.java:109)}} \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:2= 80)}} \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:2= 80)}} \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java= :143)}} \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(T= ikaParser.java:74)}} \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addO= rReplaceDocumentWithException(TikaExtractor.java:235)}} \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$= PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester= .java:3226)}} \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$= PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}} \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$= PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIng= ester.java:2708)}} \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.= documentIngest(IncrementalIngester.java:756)}} \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.i= ngestDocumentWithException(WorkerThread.java:1583)}} \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.i= ngestDocumentWithException(WorkerThread.java:1548)}} \{{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector.p= rocessDocuments(FileConnector.java:448)}} 13:40:33.995 [Worker thread '42'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null [Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped Serve= rConnector@5d235104\{HTTP/1.1}{0.0.0.0:8345} {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stop= ped o.e.j.w.WebAppContext@6105f8a3{/mcf-api-service,[file:/tmp/jetty-0.0.0.= 0-8345-mcf-api-service.war-_mcf-api-service-any-9896962439762567079.dir/web= app/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-ap= i-service-any-9896962439762567079.dir/webapp/,UNAVAILABLE]}\{/opt/manifoldc= f/manifoldcf_single/././web/war/mcf-api-service.war} }} {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stop= ped o.e.j.w.WebAppContext@12365c88{/mcf-authority-service,[file:/tmp/jetty-= 0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-395430836= 0064638561.dir/webapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-autho= rity-service.war-_mcf-authority-service-any-3954308360064638561.dir/webapp/= ,UNAVAILABLE]} {/opt/manifoldcf/manifoldcf_single/././web/war/mcf-authority-service.war} }} =C2=A0 Follow-up: When these issues occur, the jvm runs out of space: 13:39:39.856 [Worker thread '49'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:39.970 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:40.415 [Worker thread '34'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:40.469 [Worker thread '1'] WARN org.apache.manifoldcf.jobs - Service = interruption reported for job 1532551209410 connection 'file': IO exception= : null 13:39:43.739 [Worker thread '32'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:44.697 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:45.756 [Worker thread '33'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:45.775 [Worker thread '36'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:46.751 [Worker thread '35'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:46.753 [Worker thread '40'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:47.536 [Worker thread '45'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:48.734 [Worker thread '44'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:50.922 [Worker thread '30'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:39:54.930 [Worker thread '28'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null 13:40:33.660 [Worker thread '29'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null agents process ran out of memory - shutting down java.lang.OutOfMemoryError: Java heap space at java.base/java.lang.StringLatin1.newString(StringLatin1.java:549) at java.base/java.lang.StringBuilder.toString(StringBuilder.java:415) at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boilerpip= eHTMLContentHandler.java:341) at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Boilerpip= eHTMLContentHandler.java:198) at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(Boilerp= ipeContentHandler.java:155) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDe= corator.java:146) at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandle= r.java:270) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDe= corator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDe= corator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDe= corator.java:146) at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.ja= va:46) at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:= 82) at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:1= 40) at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.ja= va:287) at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.= java:279) at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.= java:306) at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:34) at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proces= sSheet(ExcelExtractor.java:609) at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.intern= alProcessRecord(ExcelExtractor.java:392) at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proces= sRecord(ExcelExtractor.java:343) at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.processRe= cord(FormatTrackingHSSFListener.java:92) at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSSFReques= t.java:109) at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessEvent= s(HSSFEventFactory.java:179) at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents(HSSFE= ventFactory.java:136) at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proces= sFile(ExcelExtractor.java:319) at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtractor.ja= va:170) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:1= 84) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:1= 32) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143= ) at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaP= arser.java:74) agents process ran out of memory - shutting down java.lang.OutOfMemoryError: Java heap space at java.base/java.util.Arrays.copyOf(Arrays.java:3744) at java.base/java.lang.AbstractStringBuilder.ensureCapacityInternal(Abstra= ctStringBuilder.java:146) at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuilder.= java:531) at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuilder.= java:550) at java.base/java.lang.StringBuilder.append(StringBuilder.java:171) at java.base/java.util.regex.Matcher.appendReplacement(Matcher.java:1002) at java.base/java.util.regex.Matcher.replaceAll(Matcher.java:1181) at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.java:= 40) at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boilerpip= eHTMLContentHandler.java:296) at de.l3s.boilerpipe.sax.CommonTagActions$3.end(CommonTagActions.java:143) at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.endElement(Boilerpip= eHTMLContentHandler.java:183) at org.apache.tika.parser.html.BoilerpipeContentHandler.endElement(Boilerp= ipeContentHandler.java:175) at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDe= corator.java:136) at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandle= r.java:256) at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDe= corator.java:136) at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDe= corator.java:136) at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDe= corator.java:136) at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.ja= va:273) at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler= .java:224) at org.apache.tika.parser.txt.TXTParser.parse(TXTParser.java:109) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143= ) at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaP= arser.java:74) at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrRep= laceDocumentWithException(TikaExtractor.java:235) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pipe= lineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.jav= a:3226) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pipe= lineAddFanout.sendDocument(IncrementalIngester.java:3077) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pipe= lineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngeste= r.java:2708) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.docu= mentIngest(IncrementalIngester.java:756) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.inges= tDocumentWithException(WorkerThread.java:1583) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.inges= tDocumentWithException(WorkerThread.java:1548) at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector.proce= ssDocuments(FileConnector.java:448) 13:40:33.995 [Worker thread '42'] WARN org.apache.manifoldcf.jobs - Service= interruption reported for job 1532551209410 connection 'file': IO exceptio= n: null [Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped Server= Connector@5d235104\{HTTP/1.1}{0.0.0.0:8345} [Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped= o.e.j.w.WebAppContext@6105f8a3\{/mcf-api-service,file:/tmp/jetty-0.0.0.0-8= 345-mcf-api-service.war-_mcf-api-service-any-9896962439762567079.dir/webapp= /,UNAVAILABLE}{/opt/manifoldcf/manifoldcf_single/././web/war/mcf-api-servic= e.war} [Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped= o.e.j.w.WebAppContext@12365c88\{/mcf-authority-service,file:/tmp/jetty-0.0= .0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-395430836006= 4638561.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf/manifoldcf_single/././web/= war/mcf-authority-service.war}=C2=A0 was: {{=C2=A0}}{{=C2=A0}}{{```}}{{Jul 26, 2018 1:21:51 PM org.apache.tika.config= .InitializableProblemHandler$3 handleInitializableProblem}} {{WARNING: org.xerial's sqlite-jdbc is not loaded.}} {{Please provide the jar on your classpath to parse sqlite files.}} {{See tika-parsers/pom.xml for the correct version.}} {{agents process ran out of memory - shutting down}} {{java.lang.OutOfMemoryError: Java heap space}} {{ \{{ at java.base/java.util.Arrays.copyOf(Arrays.java:3816)}}}} {{ \{{ at java.base/java.util.BitSet.ensureCapacity(BitSet.java:338)}}}} {{ \{{ at java.base/java.util.BitSet.expandTo(BitSet.java:353)}}}} {{ \{{ at java.base/java.util.BitSet.set(BitSet.java:448)}}}} {{ \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Boi= lerpipeHTMLContentHandler.java:267)}}}} {{ \{{ at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(B= oilerpipeContentHandler.java:155)}}}} {{ \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}}}} {{ \{{ at org.apache.tika.sax.SecureContentHandler.characters(SecureContent= Handler.java:270)}}}} {{ \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}}}} {{ \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}}}} {{ \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}}}} {{ \{{ at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHand= ler.java:46)}}}} {{ \{{ at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler= .java:82)}}}} {{ \{{ at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.= java:140)}}}} {{ \{{ at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHand= ler.java:287)}}}} {{ \{{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHa= ndler.java:279)}}}} {{ \{{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHa= ndler.java:306)}}}} {{ \{{ at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:34= )}}}} {{ \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= processSheet(ExcelExtractor.java:609)}}}} {{ \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= internalProcessRecord(ExcelExtractor.java:392)}}}} {{ \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= processRecord(ExcelExtractor.java:343)}}}} {{ \{{ at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.pro= cessRecord(FormatTrackingHSSFListener.java:92)}}}} {{ \{{ at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSSF= Request.java:109)}}}} {{ \{{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProces= sEvents(HSSFEventFactory.java:179)}}}} {{ \{{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents= (HSSFEventFactory.java:136)}}}} {{ \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= processFile(ExcelExtractor.java:319)}}}} {{ \{{ at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtrac= tor.java:170)}}}} {{ \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:184)}}}} {{ \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:132)}}}} {{ \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}}}} {{ \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}}}} {{ \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.ja= va:143)}}}} {{ {{[Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped S= erverConnector@37095ded\{HTTP/1.1}}}{{{0.0.0.0:8345}}}{{}}}} {{ {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - St= opped o.e.j.w.WebAppContext@5a6d5a8f{/mcf-api-service,[file:/tmp/jetty-0.0.= 0.0-8345-mcf-api-service.war-_mcf-api-service-any-14189461872304124764.dir/= webapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf= -api-service-any-14189461872304124764.dir/webapp/,UNAVAILABLE]}}}{{{/opt/ma= nifoldcf/manifoldcf_single/././web/war/mcf-api-service.war}}}{{}}}} {{ {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - St= opped o.e.j.w.WebAppContext@6979efad{/mcf-authority-service,[file:/tmp/jett= y-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1161944= 5383548662284.dir/webapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-au= thority-service.war-_mcf-authority-service-any-11619445383548662284.dir/web= app/,UNAVAILABLE]}}}{{{/opt/manifoldcf/manifoldcf_single/././web/war/mcf-au= thority-service.war}}}{{}}}} {{2018-07-26 13:22:47,170 qtp2061226112-492 FATAL Unable to register shutdo= wn hook because JVM is shutting down. java.lang.IllegalStateException: Cann= ot add new shutdown hook as this is not started. Current state: STOPPED}} {{ \{{ at org.apache.logging.log4j.core.util.DefaultShutdownCallbackRegistr= y.addShutdownCallback(DefaultShutdownCallbackRegistry.java:113)}}}} {{ \{{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.addShutdow= nCallback(Log4jContextFactory.java:271)}}}} {{ \{{ at org.apache.logging.log4j.core.LoggerContext.setUpShutdownHook(Log= gerContext.java:256)}}}} {{ \{{ at org.apache.logging.log4j.core.LoggerContext.start(LoggerContext.j= ava:216)}}}} {{ \{{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.getContext= (Log4jContextFactory.java:146)}}}} {{ \{{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.getContext= (Log4jContextFactory.java:41)}}}} {{ \{{ at org.apache.logging.log4j.LogManager.getContext(LogManager.java:27= 0)}}}} {{ \{{ at org.apache.log4j.Logger$PrivateManager.getContext(Logger.java:59)= }}}} {{ \{{ at org.apache.log4j.Logger.getLogger(Logger.java:37)}}}} {{ \{{ at org.apache.velocity.runtime.log.Log4JLogChute.init(Log4JLogChute.= java:72)}}}} {{ \{{ at org.apache.velocity.runtime.log.LogManager.createLogChute(LogMana= ger.java:157)}}}} {{ \{{ at org.apache.velocity.runtime.log.LogManager.updateLog(LogManager.j= ava:269)}}}} {{ \{{ at org.apache.velocity.runtime.RuntimeInstance.initializeLog(Runtime= Instance.java:871)}}}} {{ \{{ at org.apache.velocity.runtime.RuntimeInstance.init(RuntimeInstance.= java:262)}}}} {{ \{{ at org.apache.velocity.runtime.RuntimeInstance.requireInitialization= (RuntimeInstance.java:302)}}}} {{ \{{ at org.apache.velocity.runtime.RuntimeInstance.getTemplate(RuntimeIn= stance.java:1531)}}}} {{ \{{ at org.apache.velocity.app.VelocityEngine.mergeTemplate(VelocityEngi= ne.java:343)}}}} {{ \{{ at org.apache.manifoldcf.ui.i18n.Messages.outputResourceWithVelocity= (Messages.java:159)}}}} {{ \{{ at org.apache.manifoldcf.agents.transformation.tika.Messages.outputR= esourceWithVelocity(Messages.java:136)}}}} {{ \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.ou= tputSpecificationBody(TikaExtractor.java:544)}}}} {{ \{{ at org.apache.jsp.editjob_jsp._jspService(editjob_jsp.java:3002)}}}} {{ \{{ at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:70= )}}}} {{ \{{ at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)}}}} {{ \{{ at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWra= pper.java:388)}}}} {{ \{{ at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.ja= va:313)}}}} {{ \{{ at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260)= }}}} {{ \{{ at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)}}}} {{ \{{ at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java= :769)}}}} {{ \{{ at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.= java:585)}}}} {{ \{{ at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandl= er.java:143)}}}} {{ \{{ at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler= .java:577)}}}} {{ \{{ at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionH= andler.java:223)}}}} {{ \{{ at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextH= andler.java:1125)}}}} {{ \{{ at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.j= ava:515)}}}} {{ \{{ at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHa= ndler.java:185)}}}} {{ \{{ at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHa= ndler.java:1059)}}}} {{ \{{ at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandl= er.java:141)}}}} {{ \{{ at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(= ContextHandlerCollection.java:215)}}}} {{ \{{ at org.eclipse.jetty.server.handler.HandlerList.handle(HandlerList.j= ava:52)}}}} {{ \{{ at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWra= pper.java:97)}}}} {{ \{{ at org.eclipse.jetty.server.Server.handle(Server.java:497)}}}} {{ \{{ at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:311)= }}}} {{ \{{ at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection= .java:248)}}}} {{ \{{ at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.= java:540)}}}} {{ \{{ at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThrea= dPool.java:610)}}}} {{ \{{ at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThread= Pool.java:539)}}}} {{ \{{ at java.base/java.lang.Thread.run(Thread.java:844)}}}}{{[Worker thre= ad '35'] WARN org.apache.tika.parser.microsoft.AbstractPOIFSExtractor - Ign= oring unexpected exception while parsing summary entry SummaryInformation}} {{java.lang.RuntimeException: java.nio.channels.ClosedByInterruptException}= } {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuffe= rIterator.(NPOIFSStream.java:151)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream.getBlockIterator(NPO= IFSStream.java:95)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSDocument.getBlockIterator(N= POIFSDocument.java:179)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.(NDocu= mentInputStream.java:82)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.DocumentInputStream.(Docume= ntInputStream.java:65)}}}} {{ \{{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaryEnt= ryIfExists(SummaryExtractor.java:83)}}}} {{ \{{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaries(= SummaryExtractor.java:73)}}}} {{ \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:156)}}}} {{ \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:132)}}}} {{ \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}}}} {{ \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}}}} {{ \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.ja= va:143)}}}} {{ \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse= (TikaParser.java:74)}}}} {{ \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.ad= dOrReplaceDocumentWithException(TikaExtractor.java:235)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngest= er.java:3226)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalI= ngester.java:2708)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r.documentIngest(IncrementalIngester.java:756)}}}} {{ \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity= .ingestDocumentWithException(WorkerThread.java:1583)}}}} {{ \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity= .ingestDocumentWithException(WorkerThread.java:1548)}}}} {{ \{{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector= .processDocuments(FileConnector.java:448)}}}} {{ \{{ at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThrea= d.java:399)}}}} {{Caused by: java.nio.channels.ClosedByInterruptException}} {{ \{{ at java.base/java.nio.channels.spi.AbstractInterruptibleChannel.end(= AbstractInterruptibleChannel.java:199)}}}} {{ \{{ at java.base/sun.nio.ch.FileChannelImpl.size(FileChannelImpl.java:38= 8)}}}} {{ \{{ at org.apache.poi.poifs.nio.FileBackedDataSource.size(FileBackedData= Source.java:137)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.getChainLoopDete= ctor(NPOIFSFileSystem.java:627)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuffe= rIterator.(NPOIFSStream.java:149)}}}} {{ \{{ ... 21 more}}}} {{[Worker thread '35'] WARN org.apache.tika.parser.microsoft.AbstractPOIFSE= xtractor - Ignoring unexpected exception while parsing summary entry Docume= ntSummaryInformation}} {{java.lang.RuntimeException: java.nio.channels.ClosedChannelException}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuffe= rIterator.(NPOIFSStream.java:151)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream.getBlockIterator(NPO= IFSStream.java:95)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSMiniStore.getBlockAt(NPOIFS= MiniStore.java:67)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuffe= rIterator.next(NPOIFSStream.java:169)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuffe= rIterator.next(NPOIFSStream.java:142)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.readFully(ND= ocumentInputStream.java:264)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.read(NDocume= ntInputStream.java:162)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.DocumentInputStream.read(Document= InputStream.java:127)}}}} {{ \{{ at org.apache.poi.util.BoundedInputStream.read(BoundedInputStream.ja= va:121)}}}} {{ \{{ at org.apache.poi.util.BoundedInputStream.read(BoundedInputStream.ja= va:103)}}}} {{ \{{ at org.apache.poi.util.IOUtils.copy(IOUtils.java:312)}}}} {{ \{{ at org.apache.poi.util.IOUtils.peekFirstNBytes(IOUtils.java:70)}}}} {{ \{{ at org.apache.poi.hpsf.PropertySet.isPropertySetStream(PropertySet.j= ava:393)}}}} {{ \{{ at org.apache.poi.hpsf.PropertySet.(PropertySet.java:191)}}}} {{ \{{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaryEnt= ryIfExists(SummaryExtractor.java:83)}}}} {{ \{{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummaries(= SummaryExtractor.java:74)}}}} {{ \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:156)}}}} {{ \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:132)}}}} {{ \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}}}} {{ \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}}}} {{ \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.ja= va:143)}}}} {{ \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse= (TikaParser.java:74)}}}} {{ \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.ad= dOrReplaceDocumentWithException(TikaExtractor.java:235)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngest= er.java:3226)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalI= ngester.java:2708)}}}} {{ \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r.documentIngest(IncrementalIngester.java:756)}}}} {{ \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity= .ingestDocumentWithException(WorkerThread.java:1583)}}}} {{ \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity= .ingestDocumentWithException(WorkerThread.java:1548)}}}} {{ \{{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector= .processDocuments(FileConnector.java:448)}}}} {{ \{{ at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThrea= d.java:399)}}}} {{Caused by: java.nio.channels.ClosedChannelException}} {{ \{{ at java.base/sun.nio.ch.FileChannelImpl.ensureOpen(FileChannelImpl.j= ava:158)}}}} {{ \{{ at java.base/sun.nio.ch.FileChannelImpl.size(FileChannelImpl.java:37= 3)}}}} {{ \{{ at org.apache.poi.poifs.nio.FileBackedDataSource.size(FileBackedData= Source.java:137)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.getChainLoopDete= ctor(NPOIFSFileSystem.java:627)}}}} {{ \{{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBuffe= rIterator.(NPOIFSStream.java:149)}}}} {{ \{{ ... 30 more}}}}{{=C2=A0}}{{```}}{{}}{{Following up:}}{{When these ex= ceptions occur, the heap runs out:}}{{13:39:39.856 [Worker thread '49'] WAR= N org.apache.manifoldcf.jobs - Service interruption reported for job 153255= 1209410 connection 'file': IO exception: null}} {{13:39:39.970 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:40.415 [Worker thread '34'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:40.469 [Worker thread '1'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null}} {{13:39:43.739 [Worker thread '32'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:44.697 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:45.756 [Worker thread '33'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:45.775 [Worker thread '36'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:46.751 [Worker thread '35'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:46.753 [Worker thread '40'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:47.536 [Worker thread '45'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:48.734 [Worker thread '44'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:50.922 [Worker thread '30'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:39:54.930 [Worker thread '28'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{13:40:33.660 [Worker thread '29'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{agents process ran out of memory - shutting down}} {{java.lang.OutOfMemoryError: Java heap space}} {{ at java.base/java.lang.StringLatin1.newString(StringLatin1.java:549)}} {{ at java.base/java.lang.StringBuilder.toString(StringBuilder.java:415)}} {{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boilerp= ipeHTMLContentHandler.java:341)}} {{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Boilerp= ipeHTMLContentHandler.java:198)}} {{ at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(Boile= rpipeContentHandler.java:155)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146)}} {{ at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHand= ler.java:270)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146)}} {{ at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.= java:46)}} {{ at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.jav= a:82)}} {{ at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java= :140)}} {{ at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.= java:287)}} {{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandle= r.java:279)}} {{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandle= r.java:306)}} {{ at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:34)}} {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proc= essSheet(ExcelExtractor.java:609)}} {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.inte= rnalProcessRecord(ExcelExtractor.java:392)}} {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proc= essRecord(ExcelExtractor.java:343)}} {{ at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.process= Record(FormatTrackingHSSFListener.java:92)}} {{ at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSSFRequ= est.java:109)}} {{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessEve= nts(HSSFEventFactory.java:179)}} {{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents(HSS= FEventFactory.java:136)}} {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proc= essFile(ExcelExtractor.java:319)}} {{ at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtractor.= java:170)}} {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java= :184)}} {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java= :132)}} {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= )}} {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= )}} {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1= 43)}} {{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(Tik= aParser.java:74)}} {{agents process ran out of memory - shutting down}} {{java.lang.OutOfMemoryError: Java heap space}} {{ at java.base/java.util.Arrays.copyOf(Arrays.java:3744)}} {{ at java.base/java.lang.AbstractStringBuilder.ensureCapacityInternal(Abst= ractStringBuilder.java:146)}} {{ at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuilde= r.java:531)}} {{ at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuilde= r.java:550)}} {{ at java.base/java.lang.StringBuilder.append(StringBuilder.java:171)}} {{ at java.base/java.util.regex.Matcher.appendReplacement(Matcher.java:1002= )}} {{ at java.base/java.util.regex.Matcher.replaceAll(Matcher.java:1181)}} {{ at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.jav= a:40)}} {{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boilerp= ipeHTMLContentHandler.java:296)}} {{ at de.l3s.boilerpipe.sax.CommonTagActions$3.end(CommonTagActions.java:14= 3)}} {{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.endElement(Boilerp= ipeHTMLContentHandler.java:183)}} {{ at org.apache.tika.parser.html.BoilerpipeContentHandler.endElement(Boile= rpipeContentHandler.java:175)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136)}} {{ at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHand= ler.java:256)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136)}} {{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136)}} {{ at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.= java:273)}} {{ at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandl= er.java:224)}} {{ at org.apache.tika.parser.txt.TXTParser.parse(TXTParser.java:109)}} {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= )}} {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= )}} {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1= 43)}} {{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(Tik= aParser.java:74)}} {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrR= eplaceDocumentWithException(TikaExtractor.java:235)}} {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pi= pelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.j= ava:3226)}} {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pi= pelineAddFanout.sendDocument(IncrementalIngester.java:3077)}} {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pi= pelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalInges= ter.java:2708)}} {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.do= cumentIngest(IncrementalIngester.java:756)}} {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ing= estDocumentWithException(WorkerThread.java:1583)}} {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ing= estDocumentWithException(WorkerThread.java:1548)}} {{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector.pro= cessDocuments(FileConnector.java:448)}} {{13:40:33.995 [Worker thread '42'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null}} {{[Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped Serv= erConnector@5d235104\{HTTP/1.1}{0.0.0.0:8345}}} {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopp= ed o.e.j.w.WebAppContext@6105f8a3\{/mcf-api-service,file:/tmp/jetty-0.0.0.0= -8345-mcf-api-service.war-_mcf-api-service-any-9896962439762567079.dir/weba= pp/,UNAVAILABLE}{/opt/manifoldcf/manifoldcf_single/././web/war/mcf-api-serv= ice.war}}} {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopp= ed o.e.j.w.WebAppContext@12365c88\{/mcf-authority-service,file:/tmp/jetty-0= .0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-3954308360= 064638561.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf/manifoldcf_single/././we= b/war/mcf-authority-service.war}}} =C2=A0 > MCF shutting down when Tika is used > ----------------------------------- > > Key: CONNECTORS-1518 > URL: https://issues.apache.org/jira/browse/CONNECTORS-151= 8 > Project: ManifoldCF > Issue Type: Bug > Components: Tika extractor > Affects Versions: ManifoldCF 2.10 > Environment: Centos 7 > Prior to crash: > $free -h > total used free shared buff/cache available > Mem: 15G 1.8G 12G 98M 1.1G 13G > Swap: 2.0G 0B 2.0G > After crash: > $free -h > total used free shared buff/cache available > Mem: 15G 10G 4.0G 98M 1.1G 4.4G > Swap: 2.0G 0B 2.0G > =C2=A0 > {{start-options.env.unix :}} > {{-Xss500m}} > {{-Xms1g}} > {{-Xmx8g}} > {{-Dorg.apache.manifoldcf.configfile=3D./properties.xml}} > {{-Dorg.apache.manifoldcf.jettyshutdowntoken=3Dsecret_token}} > {{-cp}} > {{.:./lib/mcf-core.jar:./lib/mcf-agents.jar:./lib/mcf-pull-agent.jar:./li= b/mcf-ui-core.jar:./lib/mcf-jetty-runner.jar:./lib/jetty-continuation-9.2.3= .v20140905.jar:./lib/jetty-http-9.2.3.v20140905.jar:./lib/jetty-io-9.2.3.v2= 0140905.jar:./lib/jetty-jndi-9.2.3.v20140905.jar:./lib/jetty-jsp-jdt-2.3.3.= jar:./lib/jetty-plus-9.2.3.v20140905.jar:./lib/jetty-schemas-3.1.M0.jar:./l= ib/jetty-security-9.2.3.v20140905.jar:./lib/jetty-server-9.2.3.v20140905.ja= r:./lib/jetty-servlet-9.2.3.v20140905.jar:./lib/jetty-util-9.2.3.v20140905.= jar:./lib/jetty-webapp-9.2.3.v20140905.jar:./lib/jetty-xml-9.2.3.v20140905.= jar:./lib/hsqldb-2.3.2.jar:./lib/postgresql-42.1.3.jar:./lib/commons-codec-= 1.10.jar:./lib/commons-collections-3.2.1.jar:./lib/commons-collections4-4.1= .jar:./lib/commons-discovery-0.5.jar:./lib/commons-el-1.0.jar:./lib/commons= -exec-1.3.jar:./lib/commons-fileupload-1.2.2.jar:./lib/commons-io-2.5.jar:.= /lib/commons-lang-2.6.jar:./lib/commons-lang3-3.6.jar:./lib/commons-logging= -1.2.jar:./lib/ecj-4.3.1.jar:./lib/gson-2.8.0.jar:./lib/guava-21.0.jar:./li= b/httpclient-4.5.3.jar:./lib/httpcore-4.4.6.jar:./lib/jasper-6.0.35.jar:./l= ib/jasper-el-6.0.35.jar:./lib/javax.servlet-api-3.1.0.jar:./lib/jna-4.1.0.j= ar:./lib/jna-platform-4.1.0.jar:./lib/json-simple-1.1.1.jar:./lib/jsp-api-2= .1-glassfish-2.1.v20091210.jar:./lib/juli-6.0.35.jar:./lib/log4j-1.2-api-2.= 4.1.jar:./lib/log4j-api-2.4.1.jar:./lib/log4j-core-2.4.1.jar:./lib/mail-1.4= .5.jar:./lib/serializer-2.7.1.jar:./lib/slf4j-api-1.7.24.jar:./lib/slf4j-si= mple-1.7.24.jar:./lib/velocity-1.7.jar:./lib/xalan-2.7.1.jar:./lib/xercesIm= pl-2.10.0.jar:./lib/xml-apis-1.4.01.jar:./lib/zookeeper-3.4.10.jar:}} > Reporter: Steph van Schalkwyk > Priority: Major > > =C2=A0=C2=A0```Jul 26, 2018 1:21:51 PM org.apache.tika.config.Initializab= leProblemHandler$3 handleInitializableProblem > WARNING: org.xerial's sqlite-jdbc is not loaded. > Please provide the jar on your classpath to parse sqlite files. > See tika-parsers/pom.xml for the correct version. > agents process ran out of memory - shutting down > java.lang.OutOfMemoryError: Java heap space > \{{ {{ at java.base/java.util.Arrays.copyOf(Arrays.java:3816)}}}} > \{{ {{ at java.base/java.util.BitSet.ensureCapacity(BitSet.java:338)}}}} > \{{ {{ at java.base/java.util.BitSet.expandTo(BitSet.java:353)}}}} > \{{ {{ at java.base/java.util.BitSet.set(BitSet.java:448)}}}} > \{{ {{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(= BoilerpipeHTMLContentHandler.java:267)}}}} > \{{ {{ at org.apache.tika.parser.html.BoilerpipeContentHandler.character= s(BoilerpipeContentHandler.java:155)}}}} > \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(Content= HandlerDecorator.java:146)}}}} > \{{ {{ at org.apache.tika.sax.SecureContentHandler.characters(SecureCont= entHandler.java:270)}}}} > \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(Content= HandlerDecorator.java:146)}}}} > \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(Content= HandlerDecorator.java:146)}}}} > \{{ {{ at org.apache.tika.sax.ContentHandlerDecorator.characters(Content= HandlerDecorator.java:146)}}}} > \{{ {{ at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentH= andler.java:46)}}}} > \{{ {{ at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHand= ler.java:82)}}}} > \{{ {{ at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandl= er.java:140)}}}} > \{{ {{ at org.apache.tika.sax.SafeContentHandler.characters(SafeContentH= andler.java:287)}}}} > \{{ {{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLConten= tHandler.java:279)}}}} > \{{ {{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLConten= tHandler.java:306)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java= :34)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListen= er.processSheet(ExcelExtractor.java:609)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListen= er.internalProcessRecord(ExcelExtractor.java:392)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListen= er.processRecord(ExcelExtractor.java:343)}}}} > \{{ {{ at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.= processRecord(FormatTrackingHSSFListener.java:92)}}}} > \{{ {{ at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(H= SSFRequest.java:109)}}}} > \{{ {{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericPro= cessEvents(HSSFEventFactory.java:179)}}}} > \{{ {{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEve= nts(HSSFEventFactory.java:136)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListen= er.processFile(ExcelExtractor.java:319)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExt= ractor.java:170)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficePars= er.java:184)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficePars= er.java:132)}}}} > \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.j= ava:280)}}}} > \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.j= ava:280)}}}} > \{{ {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser= .java:143)}}}} > {{ [Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped = ServerConnector@37095ded\{HTTP/1.1}{{ > {0.0.0.0:8345} > }}}} > {{ {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler -= Stopped o.e.j.w.WebAppContext@5a6d5a8f > {/mcf-api-service,[file:/tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-= api-service-any-14189461872304124764.dir/webapp/,UNAVAILABLE|file:///tmp/je= tty-0.0.0.0-8345-mcf-api-service.war-_mcf-api-service-any-14189461872304124= 764.dir/webapp/,UNAVAILABLE]} > }}{{ > {/opt/manifoldcf/manifoldcf_single/././web/war/mcf-api-service.war}}}}} > {{ [Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - S= topped o.e.j.w.WebAppContext@6979efad\{/mcf-authority-service,[file:/tmp/je= tty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-11619= 445383548662284.dir/webapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-= authority-service.war-_mcf-authority-service-any-11619445383548662284.dir/w= ebapp/,UNAVAILABLE]}{/opt/manifoldcf/manifoldcf_single/././web/war/mcf-auth= ority-service.war}}} > 2018-07-26 13:22:47,170 qtp2061226112-492 FATAL Unable to register shutd= own hook because JVM is shutting down. java.lang.IllegalStateException: Can= not add new shutdown hook as this is not started. Current state: STOPPED > \{{ {{ at org.apache.logging.log4j.core.util.DefaultShutdownCallbackRegi= stry.addShutdownCallback(DefaultShutdownCallbackRegistry.java:113)}}}} > \{{ {{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.addShut= downCallback(Log4jContextFactory.java:271)}}}} > \{{ {{ at org.apache.logging.log4j.core.LoggerContext.setUpShutdownHook(= LoggerContext.java:256)}}}} > \{{ {{ at org.apache.logging.log4j.core.LoggerContext.start(LoggerContex= t.java:216)}}}} > \{{ {{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.getCont= ext(Log4jContextFactory.java:146)}}}} > \{{ {{ at org.apache.logging.log4j.core.impl.Log4jContextFactory.getCont= ext(Log4jContextFactory.java:41)}}}} > \{{ {{ at org.apache.logging.log4j.LogManager.getContext(LogManager.java= :270)}}}} > \{{ {{ at org.apache.log4j.Logger$PrivateManager.getContext(Logger.java:= 59)}}}} > \{{ {{ at org.apache.log4j.Logger.getLogger(Logger.java:37)}}}} > \{{ {{ at org.apache.velocity.runtime.log.Log4JLogChute.init(Log4JLogChu= te.java:72)}}}} > \{{ {{ at org.apache.velocity.runtime.log.LogManager.createLogChute(LogM= anager.java:157)}}}} > \{{ {{ at org.apache.velocity.runtime.log.LogManager.updateLog(LogManage= r.java:269)}}}} > \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.initializeLog(Runt= imeInstance.java:871)}}}} > \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.init(RuntimeInstan= ce.java:262)}}}} > \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.requireInitializat= ion(RuntimeInstance.java:302)}}}} > \{{ {{ at org.apache.velocity.runtime.RuntimeInstance.getTemplate(Runtim= eInstance.java:1531)}}}} > \{{ {{ at org.apache.velocity.app.VelocityEngine.mergeTemplate(VelocityE= ngine.java:343)}}}} > \{{ {{ at org.apache.manifoldcf.ui.i18n.Messages.outputResourceWithVeloc= ity(Messages.java:159)}}}} > \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.Messages.outp= utResourceWithVelocity(Messages.java:136)}}}} > \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor= .outputSpecificationBody(TikaExtractor.java:544)}}}} > \{{ {{ at org.apache.jsp.editjob_jsp._jspService(editjob_jsp.java:3002)}= }}} > \{{ {{ at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java= :70)}}}} > \{{ {{ at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)}}= }} > \{{ {{ at org.apache.jasper.servlet.JspServletWrapper.service(JspServlet= Wrapper.java:388)}}}} > \{{ {{ at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet= .java:313)}}}} > \{{ {{ at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:2= 60)}}}} > \{{ {{ at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)}}= }} > \{{ {{ at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.j= ava:769)}}}} > \{{ {{ at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandl= er.java:585)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHa= ndler.java:143)}}}} > \{{ {{ at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHand= ler.java:577)}}}} > \{{ {{ at org.eclipse.jetty.server.session.SessionHandler.doHandle(Sessi= onHandler.java:223)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.ContextHandler.doHandle(Conte= xtHandler.java:1125)}}}} > \{{ {{ at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandle= r.java:515)}}}} > \{{ {{ at org.eclipse.jetty.server.session.SessionHandler.doScope(Sessio= nHandler.java:185)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.ContextHandler.doScope(Contex= tHandler.java:1059)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHa= ndler.java:141)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.ContextHandlerCollection.hand= le(ContextHandlerCollection.java:215)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.HandlerList.handle(HandlerLis= t.java:52)}}}} > \{{ {{ at org.eclipse.jetty.server.handler.HandlerWrapper.handle(Handler= Wrapper.java:97)}}}} > \{{ {{ at org.eclipse.jetty.server.Server.handle(Server.java:497)}}}} > \{{ {{ at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:3= 11)}}}} > \{{ {{ at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnect= ion.java:248)}}}} > \{{ {{ at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnecti= on.java:540)}}}} > \{{ {{ at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedTh= readPool.java:610)}}}} > \{{ {{ at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThr= eadPool.java:539)}}}} > \{{ {{ at java.base/java.lang.Thread.run(Thread.java:844)}}}}[Worker thr= ead '35'] WARN org.apache.tika.parser.microsoft.AbstractPOIFSExtractor - Ig= noring unexpected exception while parsing summary entry SummaryInformation > java.lang.RuntimeException: java.nio.channels.ClosedByInterruptException > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBu= fferIterator.(NPOIFSStream.java:151)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream.getBlockIterator(= NPOIFSStream.java:95)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSDocument.getBlockIterato= r(NPOIFSDocument.java:179)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.(ND= ocumentInputStream.java:82)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.DocumentInputStream.(Doc= umentInputStream.java:65)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummary= EntryIfExists(SummaryExtractor.java:83)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummari= es(SummaryExtractor.java:73)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficePars= er.java:156)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficePars= er.java:132)}}}} > \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.j= ava:280)}}}} > \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.j= ava:280)}}}} > \{{ {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser= .java:143)}}}} > \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.pa= rse(TikaParser.java:74)}}}} > \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor= .addOrReplaceDocumentWithException(TikaExtractor.java:235)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIng= ester.java:3226)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster$PipelineObjectWithVersions.addOrReplaceDocumentWithException(Increment= alIngester.java:2708)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster.documentIngest(IncrementalIngester.java:756)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActiv= ity.ingestDocumentWithException(WorkerThread.java:1583)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActiv= ity.ingestDocumentWithException(WorkerThread.java:1548)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnec= tor.processDocuments(FileConnector.java:448)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerTh= read.java:399)}}}} > Caused by: java.nio.channels.ClosedByInterruptException > \{{ {{ at java.base/java.nio.channels.spi.AbstractInterruptibleChannel.e= nd(AbstractInterruptibleChannel.java:199)}}}} > \{{ {{ at java.base/sun.nio.ch.FileChannelImpl.size(FileChannelImpl.java= :388)}}}} > \{{ {{ at org.apache.poi.poifs.nio.FileBackedDataSource.size(FileBackedD= ataSource.java:137)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.getChainLoopD= etector(NPOIFSFileSystem.java:627)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBu= fferIterator.(NPOIFSStream.java:149)}}}} > \{{ {{ ... 21 more}}}} > [Worker thread '35'] WARN org.apache.tika.parser.microsoft.AbstractPOIFS= Extractor - Ignoring unexpected exception while parsing summary entry Docum= entSummaryInformation > java.lang.RuntimeException: java.nio.channels.ClosedChannelException > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBu= fferIterator.(NPOIFSStream.java:151)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream.getBlockIterator(= NPOIFSStream.java:95)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSMiniStore.getBlockAt(NPO= IFSMiniStore.java:67)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBu= fferIterator.next(NPOIFSStream.java:169)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBu= fferIterator.next(NPOIFSStream.java:142)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.readFully= (NDocumentInputStream.java:264)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NDocumentInputStream.read(NDoc= umentInputStream.java:162)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.DocumentInputStream.read(Docum= entInputStream.java:127)}}}} > \{{ {{ at org.apache.poi.util.BoundedInputStream.read(BoundedInputStream= .java:121)}}}} > \{{ {{ at org.apache.poi.util.BoundedInputStream.read(BoundedInputStream= .java:103)}}}} > \{{ {{ at org.apache.poi.util.IOUtils.copy(IOUtils.java:312)}}}} > \{{ {{ at org.apache.poi.util.IOUtils.peekFirstNBytes(IOUtils.java:70)}}= }} > \{{ {{ at org.apache.poi.hpsf.PropertySet.isPropertySetStream(PropertySe= t.java:393)}}}} > \{{ {{ at org.apache.poi.hpsf.PropertySet.(PropertySet.java:191)}}= }} > \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummary= EntryIfExists(SummaryExtractor.java:83)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.SummaryExtractor.parseSummari= es(SummaryExtractor.java:74)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficePars= er.java:156)}}}} > \{{ {{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficePars= er.java:132)}}}} > \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.j= ava:280)}}}} > \{{ {{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.j= ava:280)}}}} > \{{ {{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser= .java:143)}}}} > \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.pa= rse(TikaParser.java:74)}}}} > \{{ {{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor= .addOrReplaceDocumentWithException(TikaExtractor.java:235)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIng= ester.java:3226)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster$PipelineObjectWithVersions.addOrReplaceDocumentWithException(Increment= alIngester.java:2708)}}}} > \{{ {{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalInge= ster.documentIngest(IncrementalIngester.java:756)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActiv= ity.ingestDocumentWithException(WorkerThread.java:1583)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActiv= ity.ingestDocumentWithException(WorkerThread.java:1548)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnec= tor.processDocuments(FileConnector.java:448)}}}} > \{{ {{ at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerTh= read.java:399)}}}} > Caused by: java.nio.channels.ClosedChannelException > \{{ {{ at java.base/sun.nio.ch.FileChannelImpl.ensureOpen(FileChannelImp= l.java:158)}}}} > \{{ {{ at java.base/sun.nio.ch.FileChannelImpl.size(FileChannelImpl.java= :373)}}}} > \{{ {{ at org.apache.poi.poifs.nio.FileBackedDataSource.size(FileBackedD= ataSource.java:137)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.getChainLoopD= etector(NPOIFSFileSystem.java:627)}}}} > \{{ {{ at org.apache.poi.poifs.filesystem.NPOIFSStream$StreamBlockByteBu= fferIterator.(NPOIFSStream.java:149)}}}} > \{{ {{ ... 30 more}}}}=C2=A0```}}{{Following up:When these exceptions oc= cur, the heap runs out:13:39:39.856 [Worker thread '49'] WARN org.apache.ma= nifoldcf.jobs - Service interruption reported for job 1532551209410 connect= ion 'file': IO exception: null > 13:39:39.970 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:40.415 [Worker thread '34'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:40.469 [Worker thread '1'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:43.739 [Worker thread '32'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:44.697 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:45.756 [Worker thread '33'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:45.775 [Worker thread '36'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:46.751 [Worker thread '35'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:46.753 [Worker thread '40'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:47.536 [Worker thread '45'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:48.734 [Worker thread '44'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:50.922 [Worker thread '30'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:39:54.930 [Worker thread '28'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > 13:40:33.660 [Worker thread '29'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > agents process ran out of memory - shutting down > java.lang.OutOfMemoryError: Java heap space > \{{ at java.base/java.lang.StringLatin1.newString(StringLatin1.java:549)= }} > \{{ at java.base/java.lang.StringBuilder.toString(StringBuilder.java:415= )}} > \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boi= lerpipeHTMLContentHandler.java:341)}} > \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Boi= lerpipeHTMLContentHandler.java:198)}} > \{{ at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(B= oilerpipeContentHandler.java:155)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}} > \{{ at org.apache.tika.sax.SecureContentHandler.characters(SecureContent= Handler.java:270)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHan= dlerDecorator.java:146)}} > \{{ at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHand= ler.java:46)}} > \{{ at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler= .java:82)}} > \{{ at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.= java:140)}} > \{{ at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHand= ler.java:287)}} > \{{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHa= ndler.java:279)}} > \{{ at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHa= ndler.java:306)}} > \{{ at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:34= )}} > \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= processSheet(ExcelExtractor.java:609)}} > \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= internalProcessRecord(ExcelExtractor.java:392)}} > \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= processRecord(ExcelExtractor.java:343)}} > \{{ at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.pro= cessRecord(FormatTrackingHSSFListener.java:92)}} > \{{ at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSSF= Request.java:109)}} > \{{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProces= sEvents(HSSFEventFactory.java:179)}} > \{{ at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents= (HSSFEventFactory.java:136)}} > \{{ at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.= processFile(ExcelExtractor.java:319)}} > \{{ at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtrac= tor.java:170)}} > \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:184)}} > \{{ at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.= java:132)}} > \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}} > \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}} > \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.ja= va:143)}} > \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse= (TikaParser.java:74)}} > agents process ran out of memory - shutting down > java.lang.OutOfMemoryError: Java heap space > \{{ at java.base/java.util.Arrays.copyOf(Arrays.java:3744)}} > \{{ at java.base/java.lang.AbstractStringBuilder.ensureCapacityInternal(= AbstractStringBuilder.java:146)}} > \{{ at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBu= ilder.java:531)}} > \{{ at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBu= ilder.java:550)}} > \{{ at java.base/java.lang.StringBuilder.append(StringBuilder.java:171)}= } > \{{ at java.base/java.util.regex.Matcher.appendReplacement(Matcher.java:= 1002)}} > \{{ at java.base/java.util.regex.Matcher.replaceAll(Matcher.java:1181)}} > \{{ at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer= .java:40)}} > \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boi= lerpipeHTMLContentHandler.java:296)}} > \{{ at de.l3s.boilerpipe.sax.CommonTagActions$3.end(CommonTagActions.jav= a:143)}} > \{{ at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.endElement(Boi= lerpipeHTMLContentHandler.java:183)}} > \{{ at org.apache.tika.parser.html.BoilerpipeContentHandler.endElement(B= oilerpipeContentHandler.java:175)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHan= dlerDecorator.java:136)}} > \{{ at org.apache.tika.sax.SecureContentHandler.endElement(SecureContent= Handler.java:256)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHan= dlerDecorator.java:136)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHan= dlerDecorator.java:136)}} > \{{ at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHan= dlerDecorator.java:136)}} > \{{ at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHand= ler.java:273)}} > \{{ at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentH= andler.java:224)}} > \{{ at org.apache.tika.parser.txt.TXTParser.parse(TXTParser.java:109)}} > \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}} > \{{ at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java= :280)}} > \{{ at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.ja= va:143)}} > \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse= (TikaParser.java:74)}} > \{{ at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.ad= dOrReplaceDocumentWithException(TikaExtractor.java:235)}} > \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngest= er.java:3226)}} > \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)}} > \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalI= ngester.java:2708)}} > \{{ at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngeste= r.documentIngest(IncrementalIngester.java:756)}} > \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity= .ingestDocumentWithException(WorkerThread.java:1583)}} > \{{ at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity= .ingestDocumentWithException(WorkerThread.java:1548)}} > \{{ at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector= .processDocuments(FileConnector.java:448)}} > 13:40:33.995 [Worker thread '42'] WARN org.apache.manifoldcf.jobs - Serv= ice interruption reported for job 1532551209410 connection 'file': IO excep= tion: null > [Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped Ser= verConnector@5d235104\{HTTP/1.1}{0.0.0.0:8345} > {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - St= opped o.e.j.w.WebAppContext@6105f8a3{/mcf-api-service,[file:/tmp/jetty-0.0.= 0.0-8345-mcf-api-service.war-_mcf-api-service-any-9896962439762567079.dir/w= ebapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-= api-service-any-9896962439762567079.dir/webapp/,UNAVAILABLE]}\{/opt/manifol= dcf/manifoldcf_single/././web/war/mcf-api-service.war} > }} > {{[Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - St= opped o.e.j.w.WebAppContext@12365c88{/mcf-authority-service,[file:/tmp/jett= y-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-3954308= 360064638561.dir/webapp/,UNAVAILABLE|file:///tmp/jetty-0.0.0.0-8345-mcf-aut= hority-service.war-_mcf-authority-service-any-3954308360064638561.dir/webap= p/,UNAVAILABLE]} > {/opt/manifoldcf/manifoldcf_single/././web/war/mcf-authority-service.war} > }} > =C2=A0 > Follow-up: When these issues occur, the jvm runs out of space: > 13:39:39.856 [Worker thread '49'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:39.970 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:40.415 [Worker thread '34'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:40.469 [Worker thread '1'] WARN org.apache.manifoldcf.jobs - Servic= e interruption reported for job 1532551209410 connection 'file': IO excepti= on: null > 13:39:43.739 [Worker thread '32'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:44.697 [Worker thread '43'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:45.756 [Worker thread '33'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:45.775 [Worker thread '36'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:46.751 [Worker thread '35'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:46.753 [Worker thread '40'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:47.536 [Worker thread '45'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:48.734 [Worker thread '44'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:50.922 [Worker thread '30'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:39:54.930 [Worker thread '28'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > 13:40:33.660 [Worker thread '29'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > agents process ran out of memory - shutting down > java.lang.OutOfMemoryError: Java heap space > at java.base/java.lang.StringLatin1.newString(StringLatin1.java:549) > at java.base/java.lang.StringBuilder.toString(StringBuilder.java:415) > at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boilerp= ipeHTMLContentHandler.java:341) > at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(Boilerp= ipeHTMLContentHandler.java:198) > at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(Boile= rpipeContentHandler.java:155) > at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146) > at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHand= ler.java:270) > at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146) > at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146) > at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandler= Decorator.java:146) > at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.= java:46) > at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.jav= a:82) > at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java= :140) > at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.= java:287) > at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandle= r.java:279) > at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandle= r.java:306) > at org.apache.tika.parser.microsoft.TextCell.render(TextCell.java:34) > at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proc= essSheet(ExcelExtractor.java:609) > at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.inte= rnalProcessRecord(ExcelExtractor.java:392) > at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proc= essRecord(ExcelExtractor.java:343) > at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.process= Record(FormatTrackingHSSFListener.java:92) > at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord(HSSFRequ= est.java:109) > at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessEve= nts(HSSFEventFactory.java:179) > at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents(HSS= FEventFactory.java:136) > at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.proc= essFile(ExcelExtractor.java:319) > at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtractor.= java:170) > at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java= :184) > at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java= :132) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= ) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= ) > at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1= 43) > at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(Tik= aParser.java:74) > agents process ran out of memory - shutting down > java.lang.OutOfMemoryError: Java heap space > at java.base/java.util.Arrays.copyOf(Arrays.java:3744) > at java.base/java.lang.AbstractStringBuilder.ensureCapacityInternal(Abst= ractStringBuilder.java:146) > at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuilde= r.java:531) > at java.base/java.lang.AbstractStringBuilder.append(AbstractStringBuilde= r.java:550) > at java.base/java.lang.StringBuilder.append(StringBuilder.java:171) > at java.base/java.util.regex.Matcher.appendReplacement(Matcher.java:1002= ) > at java.base/java.util.regex.Matcher.replaceAll(Matcher.java:1181) > at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.jav= a:40) > at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(Boilerp= ipeHTMLContentHandler.java:296) > at de.l3s.boilerpipe.sax.CommonTagActions$3.end(CommonTagActions.java:14= 3) > at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.endElement(Boilerp= ipeHTMLContentHandler.java:183) > at org.apache.tika.parser.html.BoilerpipeContentHandler.endElement(Boile= rpipeContentHandler.java:175) > at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136) > at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHand= ler.java:256) > at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136) > at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136) > at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandler= Decorator.java:136) > at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.= java:273) > at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandl= er.java:224) > at org.apache.tika.parser.txt.TXTParser.parse(TXTParser.java:109) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= ) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280= ) > at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1= 43) > at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(Tik= aParser.java:74) > at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrR= eplaceDocumentWithException(TikaExtractor.java:235) > at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pi= pelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.j= ava:3226) > at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pi= pelineAddFanout.sendDocument(IncrementalIngester.java:3077) > at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$Pi= pelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalInges= ter.java:2708) > at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.do= cumentIngest(IncrementalIngester.java:756) > at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ing= estDocumentWithException(WorkerThread.java:1583) > at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ing= estDocumentWithException(WorkerThread.java:1548) > at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector.pro= cessDocuments(FileConnector.java:448) > 13:40:33.995 [Worker thread '42'] WARN org.apache.manifoldcf.jobs - Servi= ce interruption reported for job 1532551209410 connection 'file': IO except= ion: null > [Thread-475] INFO org.eclipse.jetty.server.ServerConnector - Stopped Serv= erConnector@5d235104\{HTTP/1.1}{0.0.0.0:8345} > [Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopp= ed o.e.j.w.WebAppContext@6105f8a3\{/mcf-api-service,file:/tmp/jetty-0.0.0.0= -8345-mcf-api-service.war-_mcf-api-service-any-9896962439762567079.dir/weba= pp/,UNAVAILABLE}{/opt/manifoldcf/manifoldcf_single/././web/war/mcf-api-serv= ice.war} > [Thread-475] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopp= ed o.e.j.w.WebAppContext@12365c88\{/mcf-authority-service,file:/tmp/jetty-0= .0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-3954308360= 064638561.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf/manifoldcf_single/././we= b/war/mcf-authority-service.war}=C2=A0 -- This message was sent by Atlassian JIRA (v7.6.3#76005)