manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Konstantin Avdeev (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CONNECTORS-1311) Dependencies issues
Date Wed, 04 May 2016 20:39:13 GMT
Konstantin Avdeev created CONNECTORS-1311:
---------------------------------------------

             Summary: Dependencies issues
                 Key: CONNECTORS-1311
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1311
             Project: ManifoldCF
          Issue Type: Bug
          Components: Build
    Affects Versions: ManifoldCF 2.5
         Environment: any
            Reporter: Konstantin Avdeev


There are several issues with the dependencies:

1) POI should be 3.13, since tika 1.12 uses that version. With POI 3.14 tika cannot parse
presentation files (ppt):
{code}
FATAL 2016-05-03 10:39:16,821 (Worker thread '0') - Error tossed: org.apache.poi.xslf.usermodel.XSLFTextShape.getTextType()Lorg/apache/poi/xslf/usermodel/Placeholder;
java.lang.NoSuchMethodError: org.apache.poi.xslf.usermodel.XSLFTextShape.getTextType()Lorg/apache/poi/xslf/usermodel/Placeholder;
	at org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator.extractContent(XSLFPowerPointExtractorDecorator.java:154)
	at org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator.buildXHTML(XSLFPowerPointExtractorDecorator.java:88)
	at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:110)
	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:112)
	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
	at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:48)
{code}

2) jcifs "1.3.17" is used currently. Available is "1.3.18".

3) Java Advanced Imaging (JAI), jbig2 format libs are not included, but required for parsing
embedded images.

Thank you!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message