any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Peter Ansell (JIRA)" <>
Subject [jira] [Commented] (ANY23-226) Extract JSON-LD embedded in HTML
Date Wed, 11 Mar 2015 21:39:40 GMT


Peter Ansell commented on ANY23-226:

JSONLD-Java doesn't currently parse HTML, whereas Any23 does have HTML-based formats that
it supports. Hence, it may be more appropriate to pick out the <script type="application/ld+json">...</script>
elements in Any23 and pass their content to JSONLD-Java for parsing.

If it was going to be added to jsonld-java, then it would probably end up being in an integration
module to pull in the otherwise unused HTML parsing libraries without adding those dependencies
to the core module. Mostly the reasoning for that is that it is a non-normative section in
the spec so it is an optional and not well defined feature.

One immediate question I have is what would happen if there are two or more JSON-LD elements
in a single HTML document. I would think that they would need to be processed to RDF to be
combined, which would lose the context information, but should preserve the other information.

> Extract JSON-LD embedded in HTML
> --------------------------------
>                 Key: ANY23-226
>                 URL:
>             Project: Apache Any23
>          Issue Type: Wish
>          Components: core
>    Affects Versions: 1.0
>            Reporter: Lewis John McGibbney
>             Fix For: 1.3
>  See
> I feel that we need to push this down at the jsonld-java level.
> I am investigating.

This message was sent by Atlassian JIRA

View raw message