tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Eccles (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-969) Exception "org.apache.tika.exception.TikaException: Can't read JPEG metada" / "com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It is of type 'class [I" when indexing some items
Date Fri, 03 Aug 2012 12:09:02 GMT

     [ https://issues.apache.org/jira/browse/TIKA-969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Richard Eccles updated TIKA-969:
--------------------------------

    Attachment: withoutISOSpeedStuffProperties.jpg
                withISOSpeedProperties.jpg
                withoutISOspeed.jpg
                withISOSpeed.jpg

When parsing the above files, the file "withoutISOSpeed.jpg" will cause the below exception.
 If you go to the Properties of the file, and enter a 'ISO Speed' and then attempt to parse
the file the exception is not thrown.


<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>
<title>Error 500 org.apache.tika.exception.TikaException: Can't read JPEG metadata

org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: Can't read
JPEG metadata
     at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:220)
     at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
     at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
     at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358)
     at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:341)
     at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:244)
     at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
     at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
     at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
     at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
     at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
     at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
     at org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
     at org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
     at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
     at org.mortbay.jetty.Server.handle(Server.java:326)
     at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542)
     at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:945)
     at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756)
     at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212)
     at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404)
     at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410)
     at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582)
Caused by: org.apache.tika.exception.TikaException: Can't read JPEG metadata
     at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:94)
     at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:66)
     at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
     at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
     at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:138)
     at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:199)
     ... 22 more
Caused by: com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int.  It is
of type 'class [I'.
     at com.drew.metadata.Directory.getInt(Unknown Source)
     at com.drew.metadata.exif.ExifDescriptor.getIsoEquivalentDescription(Unknown Source)
     at com.drew.metadata.exif.ExifDescriptor.getDescription(Unknown Source)
     at com.drew.metadata.Directory.getDescription(Unknown Source)
     at com.drew.metadata.Tag.getDescription(Unknown Source)
     at org.apache.tika.parser.image.ImageMetadataExtractor$CopyUnknownFieldsHandler.handle(ImageMetadataExtractor.java:191)
     at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:133)
     at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:120)
     at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:90)
     ... 27 more
</title>
</head>
<body><h2>HTTP ERROR 500</h2>
<p>Problem accessing /tridion/update/extract. Reason:
<pre>    org.apache.tika.exception.TikaException: Can't read JPEG metadata

org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: Can't read
JPEG metadata
     at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:220)
     at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
     at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
     at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358)
     at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:341)
     at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:244)
     at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
     at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
     at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
     at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
     at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
     at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
     at org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
     at org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
     at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
     at org.mortbay.jetty.Server.handle(Server.java:326)
     at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542)
     at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:945)
     at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756)
     at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212)
     at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404)
     at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410)
     at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582)
Caused by: org.apache.tika.exception.TikaException: Can't read JPEG metadata
     at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:94)
     at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:66)
     at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
     at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
     at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:138)
     at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:199)
     ... 22 more
Caused by: com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int.  It is
of type 'class [I'.
     at com.drew.metadata.Directory.getInt(Unknown Source)
     at com.drew.metadata.exif.ExifDescriptor.getIsoEquivalentDescription(Unknown Source)
     at com.drew.metadata.exif.ExifDescriptor.getDescription(Unknown Source)
     at com.drew.metadata.Directory.getDescription(Unknown Source)
     at com.drew.metadata.Tag.getDescription(Unknown Source)
     at org.apache.tika.parser.image.ImageMetadataExtractor$CopyUnknownFieldsHandler.handle(ImageMetadataExtractor.java:191)
     at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:133)
     at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:120)
     at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:90)
     ... 27 more
</pre></p><hr /><i><small>Powered by Jetty://</small></i><br/>
                                               
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                
<br/>                                                

</body>
</html>

                
> Exception "org.apache.tika.exception.TikaException: Can't read JPEG metada" / "com.drew.metadata.MetadataException:
Tag '34855' cannot be cast to int.  It is of type 'class [I" when indexing some items
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-969
>                 URL: https://issues.apache.org/jira/browse/TIKA-969
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>            Reporter: Richard Eccles
>         Attachments: withISOSpeed.jpg, withISOSpeedProperties.jpg, withoutISOSpeedStuffProperties.jpg,
withoutISOspeed.jpg
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message