any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From St├ęphane Corlosquet <scorlosq...@gmail.com>
Subject Re: any23 RDFAExtractor creates blank nodes
Date Fri, 30 Nov 2012 16:26:28 GMT
On Fri, Nov 30, 2012 at 10:46 AM, Konstantin Pentchev <
kpentchev@googlemail.com> wrote:

> Hi,
> I was testing the RDFA1.1 Extractor from any23 with some xtml+rdfa. While
> it was able to parse my text, it generated BlankNodes for parent html tags
> that do not contain any semantic information and thus generated incorrect
> RDFA. I tested this vs rdfa play, which generates the statements as
> expected. Is there perhaps some configuration that can ammend this?
>
> Here is my code:
>
> private Any23 runner = new Any23();
> DocumentSource source = new ByteArrayDocumentSource(stream, "
> http://temp/document", "text/xhtml");
> ByteArrayOutputStream os = new ByteArrayOutputStream();
> runner.extract(source, new TurtleWriter(os));
>
> Input:
>
> <sentence gate:gateId="4327">
>     <SpaceToken gate:gateId="4577" length="1" kind="space" string=" ">
> </SpaceToken>
>     <span gate:gateId="4305" resource="http://service/resource/id/00139880
> "
> typeof="http://service/vocab/resource/type" rel="
> http://service/resource/mentions">...</span>
> </sentence>
>

Just to be sure, can you confirm this is the output you're expecting from
the above markup (Turtle):

<http://service/resource/id/00139880>
   rdf:type <http://service/vocab/resource/type> .


Steph.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message