manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1169) Missing Class in Tika parser
Date Thu, 26 Feb 2015 11:04:05 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338236#comment-14338236
] 

Karl Wright commented on CONNECTORS-1169:
-----------------------------------------

This should be resolved by rome.jar, which is indeed present in connector-common-lib:

{code}
 Directory of C:\wip\mcf\dev_1x\dist\connector-common-lib

02/22/2015  07:18 PM           219,683 rome-1.0.jar
               1 File(s)        219,683 bytes
               0 Dir(s)  939,984,343,040 bytes free
{code}

The way in which this might fail would be as follows:

(1) You have placed tika.jar and tikaparsers.jar somewhere IN ADDITION TO connector-common-lib;
(2) You have removed connector-common-lib from the properties.xml file, or used an older properties.xml
file which does not contain a reference to connector-common-lib.




> Missing Class in Tika parser
> ----------------------------
>
>                 Key: CONNECTORS-1169
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1169
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Tika extractor
>    Affects Versions: ManifoldCF 1.8
>            Reporter: Kamil ┼╗yta
>            Assignee: Karl Wright
>
> Hi,
> I have a lot of:
> {code}
> FATAL 2015-02-26 10:38:04,744 (Worker thread '13') - Error tossed: Could not initialize
class com.sun.syndication.feed.synd.SyndFeedImpl
> java.lang.NoClassDefFoundError: Could not initialize class com.sun.syndication.feed.synd.SyndFeedImpl
>         at com.sun.syndication.io.SyndFeedInput.build(SyndFeedInput.java:136)
>         at org.apache.tika.parser.feed.FeedParser.parse(FeedParser.java:70)
>         at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
>         at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
>         at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:121)
>         at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:230)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3274)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3125)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2752)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:796)
>         at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1610)
>         at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1558)
>         at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:911)
>         at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:383)
> FATAL 2015-02-26 10:38:04,857 (Worker thread '0') - Error tossed: ucar/nc2/NetcdfFile
> java.lang.NoClassDefFoundError: ucar/nc2/NetcdfFile
>         at org.apache.tika.parser.hdf.HDFParser.parse(HDFParser.java:88)
>         at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
>         at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
>         at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:121)
>         at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:230)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3274)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3125)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2752)
>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:796)
>         at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1610)
>         at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1558)
>         at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:911)
>         at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:383)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message