manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Konstantin Avdeev (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1311) Dependencies issues
Date Wed, 04 May 2016 21:19:12 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15271450#comment-15271450
] 

Konstantin Avdeev commented on CONNECTORS-1311:
-----------------------------------------------

Thank you for the quick fix for the first issue!
Would you mind commenting on the others?..

> Dependencies issues
> -------------------
>
>                 Key: CONNECTORS-1311
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1311
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Build
>    Affects Versions: ManifoldCF 2.5
>         Environment: any
>            Reporter: Konstantin Avdeev
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 2.5
>
>
> There are several issues with the dependencies:
> 1) POI should be 3.13, since tika 1.12 uses that version. With POI 3.14 tika cannot parse
presentation files (ppt):
> {code}
> FATAL 2016-05-03 10:39:16,821 (Worker thread '0') - Error tossed: org.apache.poi.xslf.usermodel.XSLFTextShape.getTextType()Lorg/apache/poi/xslf/usermodel/Placeholder;
> java.lang.NoSuchMethodError: org.apache.poi.xslf.usermodel.XSLFTextShape.getTextType()Lorg/apache/poi/xslf/usermodel/Placeholder;
> 	at org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator.extractContent(XSLFPowerPointExtractorDecorator.java:154)
> 	at org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator.buildXHTML(XSLFPowerPointExtractorDecorator.java:88)
> 	at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:110)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:112)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:48)
> {code}
> 2) jcifs "1.3.17" is used currently. Available is "1.3.18".
> 3) Java Advanced Imaging (JAI), jbig2 format libs are not included, but required for
parsing embedded images.
> Thank you!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message