any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ANY23-169) Incorrect interpretation of relative and absolute paths with Microdata
Date Wed, 10 Jan 2018 02:23:03 GMT

    [ https://issues.apache.org/jira/browse/ANY23-169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16319587#comment-16319587
] 

Lewis John McGibbney commented on ANY23-169:
--------------------------------------------

Using the current any23.org service, I can confirm this bug. The following output is created

{code}
<?xml version="1.0" encoding="UTF-8" ?>
<response>
<extractors>
<extractor>html-microdata</extractor>
<extractor>html-head-title</extractor>
</extractors>
<report>
<message/>
<error/>
<issueReport>
<extractorIssues extractor="html-rdfa11">
<issue level="ERROR" row="-1" col="-1">Attribute name "itemscope" associated with an
element type "span" must be followed by the ' = ' character.</issue>
</extractorIssues>
</issueReport>
<validationReport>
<issues>
</issues>
<ruleActivations>
</ruleActivations>
<errors>
</errors>
</validationReport>
</report>
<data>
<![CDATA[
# OUTPUT FORMAT: Turtle (mimeTypes=text/turtle, application/x-turtle; ext=ttl)
# BEGIN: ExtractionContext(urn:x-any23:html-microdata:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
@prefix doac: <http://ramonantonio.net/doac/0.1/#> .
@prefix rdf: <https://www.w3.org/TR/REC-rdf-syntax#> .
@prefix foaf: <http://xmlns.com/foaf/spec/> .
# BEGIN: ExtractionContext(urn:x-any23:html-head-title:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
@prefix dcterms: <http://purl.org/dc/terms/> .

<https://ruben.verborgh.org/tmp/slash-test.html> dcterms:title "Title" .

_:nodefea56f8ce507e9a90bac1fc5872b158 a <http://schema.org/Person> ;
	<http://schema.org/Person/url> <https://ruben.verborgh.org/tmp/slash-test.html//>
, <https://ruben.verborgh.org/tmp/slash-test.html/other.html> .

<https://ruben.verborgh.org/tmp/slash-test.html> <http://www.w3.org/1999/xhtml/microdata#item>
_:nodefea56f8ce507e9a90bac1fc5872b158 ;
	dcterms:title "Title" .
# END: ExtractionContext(urn:x-any23:html-microdata:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
# END: ExtractionContext(urn:x-any23:html-head-title:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
]]>
</data>
</response>

{code}

> Incorrect interpretation of relative and absolute paths with Microdata
> ----------------------------------------------------------------------
>
>                 Key: ANY23-169
>                 URL: https://issues.apache.org/jira/browse/ANY23-169
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: microdata
>            Reporter: Ruben Verborgh
>              Labels: microdata, url, urls
>             Fix For: 2.2
>
>
> Parsing the following fragment located at http://ruben.verborgh.org/tmp/slash-test.html
>     <a itemprop="url" href="/">Homepage</a>
>     <a itemprop="url" href="other.html">Other</a>
> results in the URIs
>     http://ruben.verborgh.org/tmp/slash-test.html//
>     http://ruben.verborgh.org/tmp/slash-test.html/other.html
> instead of the correct
>     http://ruben.verborgh.org/tmp/
>     http://ruben.verborgh.org/tmp/other.html
> Note that there is no trailing slash in the original.
> Test case:
> http://ruben.verborgh.org/tmp/slash-test.html
> http://any23.org/any23/?format=best&uri=http%3A%2F%2Fruben.verborgh.org%2Ftmp%2Fslash-test.html&validation-mode=none



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message