incubator-any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paolo Castagna (Created) (JIRA)" <>
Subject [jira] [Created] (ANY23-18) Add a new extractor for RDFa using java-rdfa
Date Sun, 06 Nov 2011 05:22:51 GMT
Add a new extractor for RDFa using java-rdfa

                 Key: ANY23-18
             Project: Apache Any23
          Issue Type: Improvement
            Reporter: Paolo Castagna
            Priority: Minor

I wonder if it is possible to add a new RDFa extractor which uses java-rdfa [1].

java-rdfa is (according to its creator, Damian Steer :-)) "the cruftiest RDFa parser in the
world" (and he is probably right!). java-rdfa is currently passing all conformance tests for
XHTML, and the HTML 4 and 5 tests with one exception [2]. An online service|demo [3] is also
available. java-rdfa, as far as I understand, is currently licensed with a BSD license. The
Maven artifacts are available in the Maven central repository [4].

>From my little understanding of Any23, in order to do this one needs to implement BlindExtractor
(which extends Extractor<URI>) and ContentExtractor (which extends Extractor<InputStream>).


This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message