any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ANY23-108) Broken schema.org microdata extraction
Date Mon, 14 Jan 2013 01:38:12 GMT

    [ https://issues.apache.org/jira/browse/ANY23-108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13552378#comment-13552378
] 

Lewis John McGibbney commented on ANY23-108:
--------------------------------------------

For reference, this is the stack trace

Internal error.
================================================================
java.lang.IllegalArgumentException: Invalid content ''
	at org.apache.any23.extractor.microdata.ItemPropValue.<init>(ItemPropValue.java:89)
	at org.apache.any23.extractor.microdata.MicrodataParser.getPropertyValue(MicrodataParser.java:310)
	at org.apache.any23.extractor.microdata.MicrodataParser.getItemProps(MicrodataParser.java:394)
	at org.apache.any23.extractor.microdata.MicrodataParser.getItemScope(MicrodataParser.java:471)
	at org.apache.any23.extractor.microdata.MicrodataParser.getMicrodata(MicrodataParser.java:186)
	at org.apache.any23.extractor.microdata.MicrodataParser.getMicrodata(MicrodataParser.java:203)
	at org.apache.any23.extractor.microdata.MicrodataExtractor.run(MicrodataExtractor.java:100)
	at org.apache.any23.extractor.microdata.MicrodataExtractor.run(MicrodataExtractor.java:62)
	at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:477)
	at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:260)
	at org.apache.any23.Any23.extract(Any23.java:294)
	at org.apache.any23.Any23.extract(Any23.java:446)
	at org.apache.any23.servlet.WebResponder.runExtraction(WebResponder.java:113)
	at org.apache.any23.servlet.Servlet.doGet(Servlet.java:74)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:617)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
	at com.googlecode.psiprobe.Tomcat60AgentValve.invoke(Tomcat60AgentValve.java:30)
	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:293)
	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:859)
	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:602)
	at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
	at java.lang.Thread.run(Thread.java:662)
================================================================

                
> Broken schema.org microdata extraction
> --------------------------------------
>
>                 Key: ANY23-108
>                 URL: https://issues.apache.org/jira/browse/ANY23-108
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 0.7.0
>            Reporter: Michele Barbera
>              Labels: microdata
>
> Extraction from http:any23.org with default settings crashes with an error on
> http://www.sunbrellaweb.it/spiaggia/39/porto-selvaggio/litos.html
> Google's webmaster tools rich snippet test tool works on the same input, see: 
> http://www.google.com/webmasters/tools/richsnippets?url=http%3A%2F%2Fwww.sunbrellaweb.it%2Fspiaggia%2F39%2Fporto-selvaggio%2Flitos.html

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message