incubator-any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (ANY23-44) error when parsing a document from http://www.afdsi.org/docs/test/html/RDFa/_food-stream_.htm
Date Fri, 17 Feb 2012 16:14:03 GMT

     [ https://issues.apache.org/jira/browse/ANY23-44?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Lewis John McGibbney updated ANY23-44:
--------------------------------------

    Affects Version/s: 0.7.0
        Fix Version/s: 0.8.0
    
> error when parsing a document from http://www.afdsi.org/docs/test/html/RDFa/_food-stream_.htm
 
> -----------------------------------------------------------------------------------------------
>
>                 Key: ANY23-44
>                 URL: https://issues.apache.org/jira/browse/ANY23-44
>             Project: Apache Any23
>          Issue Type: Bug
>    Affects Versions: 0.7.0
>         Environment: http://any23.org website
>            Reporter: Szymon Danielczyk
>             Fix For: 0.8.0
>
>         Attachments: doc1.htm
>
>
> This document causing the following exception 
> http://any23.org/any23/?format=best&uri=http%3A%2F%2Fwww.afdsi.org%2Fdocs%2Ftest%2Fhtml%2FRDFa%2F_food-stream_.htm
> {code}
> Could not parse input.
> ================================================================
> org.deri.any23.extractor.ExtractionException: An error occurred during the XSLT application.
> 	at org.deri.any23.extractor.rdfa.RDFaExtractor.run(RDFaExtractor.java:138)
> 	at org.deri.any23.extractor.rdfa.RDFaExtractor.run(RDFaExtractor.java:49)
> 	at org.deri.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:422)
> 	at org.deri.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:212)
> 	at org.deri.any23.Any23.extract(Any23.java:279)
> 	at org.deri.any23.Any23.extract(Any23.java:441)
> 	at org.deri.any23.servlet.WebResponder.runExtraction(WebResponder.java:98)
> 	at org.deri.any23.servlet.Servlet.doGet(Servlet.java:71)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:617)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
> 	at com.googlecode.psiprobe.Tomcat60AgentValve.invoke(Tomcat60AgentValve.java:30)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
> 	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:298)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:857)
> 	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:588)
> 	at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
> 	at java.lang.Thread.run(Thread.java:662)
> Caused by: org.deri.any23.extractor.rdfa.XSLTStylesheetException:  An error occurred
during the XSLT transformation
> 	at org.deri.any23.extractor.rdfa.XSLTStylesheet.applyTo(XSLTStylesheet.java:93)
> 	at org.deri.any23.extractor.rdfa.XSLTStylesheet.applyTo(XSLTStylesheet.java:63)
> 	at org.deri.any23.extractor.rdfa.RDFaExtractor.run(RDFaExtractor.java:136)
> 	... 22 more
> Caused by: javax.xml.transform.TransformerException: java.lang.RuntimeException: An attribute
whose value must be a QName had the value 'dct:relation:hasPart'
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(TransformerImpl.java:716)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(TransformerImpl.java:313)
> 	at org.deri.any23.extractor.rdfa.XSLTStylesheet.applyTo(XSLTStylesheet.java:79)
> 	... 24 more
> Caused by: java.lang.RuntimeException: An attribute whose value must be a QName had the
value 'dct:relation:hasPart'
> 	at com.sun.org.apache.xalan.internal.xsltc.runtime.BasisLibrary.runTimeError(BasisLibrary.java:1523)
> 	at com.sun.org.apache.xalan.internal.xsltc.runtime.BasisLibrary.runTimeError(BasisLibrary.java:1527)
> 	at com.sun.org.apache.xalan.internal.xsltc.runtime.BasisLibrary.checkQName(BasisLibrary.java:1381)
> 	at GregorSamsa.property()
> 	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$1()
> 	at GregorSamsa.applyTemplates1()
> 	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
> 	at GregorSamsa.applyTemplates1()
> 	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
> 	at GregorSamsa.applyTemplates1()
> 	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
> 	at GregorSamsa.applyTemplates1()
> 	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
> 	at GregorSamsa.applyTemplates1()
> 	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$0()
> 	at GregorSamsa.applyTemplates()
> 	at GregorSamsa.transform()
> 	at com.sun.org.apache.xalan.internal.xsltc.runtime.AbstractTranslet.transform(AbstractTranslet.java:603)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(TransformerImpl.java:709)
> 	... 26 more
> ================================================================
> {code}
> as I do not know how to attache a file will paste the content of the file below (in case
it is modified or removed from the web )
> {code}
> <?xml version="1.0" encoding="utf-8"?>
> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.1//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-2.dtd">
> <html 
>  xmlns:x="http://www.w3.org/1999/xhtml"
>  xmlns:v="http://rdf.data-vocabulary.org/rdf.xml"
>  xmlns:dc="http://purl.org/dc/elements/1.1/"
>  xmlns:dct="http://purl.org/dc/terms/"
>  xmlns:dctype="http://purl.org/dc/dcmitype/"
>  xmlns:agmes="http://purl.org/agmes/1.1/"
>  version="XHTML+RDFa 1.1"
>  xml:lang="en"
> >
> <head>
> <meta http-equiv="Content-Script-Type" content="text/javascript" />
> <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8" />
> <link rel="stylesheet" type="text/css" href="http://208.109.87.107/public/3a/b4/1c/_html-elements_12-01-25v1_.css"
/>
> <link rel="shortcut icon" href="http://208.109.87.107/public/3a/b2/1c/favicon.ico"
/>
> <title>Food Stream</title>
> <link rel="profile" href="http://microformats.org/profile/hcard" />
> </head>
> <body>
> <h1>Documenting a graphic</h1>
> <dl class="dublincore">
> <dt>Title</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:title">Food
Stream</span></dd>
> <dt>Creator:</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:creator">Gig
Graham</span></dd>
> <dt>Contributor</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:contributor">Garrett
Long</span></dd>
> <dt>Product:</dt>
> <dd><span vocab="http://rdf.data-vocabulary.org/#" typeof="v:Product">Product
text here</span></dd>
> <dd><span property="v:brand">Product brand text here</span></dd>
> <dd><span property="v:description">Product description text here</span></dd>
> <dt>Offer:</dt>
> <dd><span vocab="http://rdf.data-vocabulary.org/#" typeof="v:Offer">Offer
text here</span></dd>
> <dd><span property="v:offerdetails">Offer detail text here</span></dd>
> <dd><span property="v:price" datatype="x:integer">4000</span>
> <span property="v:currency">USD</span></dd>
> <dt>Description:</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:description">Description
text here</span></dd>
> <dt>Publisher:</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:publisher">American
Food Data Systems Institute</span></dd>
> <dt>Date Created:</dt>
> <dd><span vocab="http://purl.org/dc/terms/" property="dct:issued">2012-01-15</span></dd>
> <dt>Date Modified:</dt>
> <dd><span vocab="http://purl.org/dc/terms/" property="dct:modified">Not modified</span></dd>
> <dt>Type</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:type">Set
of images</span></dd>
> <dt>Format</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:format">image/png</span></dd>
> <dt>Identifiier</dt>
> <dd><span vocab="http://purl.org/dc/terms/" property="dct:identifier"><a
class="image" href="http://www.afdsi.org/media/images/drupal/front_page/_food_stream_11-12-05_.png"
onclick="window.open(this.href); return false">Name of graphic</a></span></dd>
> <dt>Includes Content From:</dt>
> <dd><span vocab="http://purl.org/dc/terms/" property="dct:relation:hasPart"><a
class="image" href="http://thenounproject.com/" onclick="window.open(this.href); return false">The
Noun Project</a></span></dd>
> <dt>Is Part Of:</dt>
> <dd><span vocab="http://purl.org/dc/terms/" property="dct:relation:isPartOf"><a
class="image" href="http://www.afdsi.org/media/images/drupal/front_page/_food_stream_11-12-05_.png"
onclick="window.open(this.href); return false">AFDSI front page</a></span></dd>
> <dt>Rights</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:rights"><a
class="url" href="http://creativecommons.org/licenses/by-nc-nd/3.0/" onclick="window.open(this.href);
return false">Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND
3.0)</a></span></dd>
> <dt>Audience</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">News
Media</span></dd>
> <dt>Audience</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Community
Groups</span></dd>
> <dt>Audience</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Parents
and Families</span></dd>
> <dt>Audience</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Policymakers</span></dd>
> <dt>Audience</dt>
> <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Researchers</span></dd>
> <dt>Subject using terms from UN Food and Agriculture Organization Subject Categories
(AGRIS)</dt>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:ASC"><a class="url"
href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false"
rel="tag">AGRIS-1</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:ASC"><a class="url"
href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false"
rel="tag">AGRIS-2</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:ASC"><a class="url"
href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false"
rel="tag">AGRIS-3</a></span></dd>
> <dt>Subject using terms from US Department of Agriculture National Agricultural
Library Thesaurus (NALT)</dt>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:NALT"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">NALT-1</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:NALT"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">NALT-2</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:NALT"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">NALT-3</a></span></dd>
> <dt>Subject using terms from US Library of Congress Subject Headings (LCSH)</dt>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:LCSH"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">LCSH-1</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:LCSH"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">LCSH-2</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:LCSH"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">LCSH-3</a></span></dd>
> <dt>Subject using terms from US National Library of Medicine Medical Subject Headings
(MeSH)</dt>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:MESH"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">MeSH-1</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:MESH"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">MeSH-2</a></span></dd>
> <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:MESH"><a
class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return
false" rel="tag">MeSH-3</a></span></dd>
> </dl>
> </body>
> </html>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message