any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ANY23-226) Extract JSON-LD embedded in HTML
Date Sat, 21 Mar 2015 05:31:38 GMT

    [ https://issues.apache.org/jira/browse/ANY23-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14372528#comment-14372528
] 

Hudson commented on ANY23-226:
------------------------------

UNSTABLE: Integrated in Any23-trunk #1309 (See [https://builds.apache.org/job/Any23-trunk/1309/])
ANY23-226 Extract JSON-LD embedded in HTML (lewis.j.mcgibbney: rev 1e3eb9c31af2f93906eee1081179d73c30a0881b)
* core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/DomUtils.java
* core/src/test/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorTest.java
* plugins/integration-test/src/test/java/org/apache/any23/plugin/PluginIT.java
* core/src/main/resources/org/apache/any23/prefixes/prefixes.properties
* core/src/main/java/org/apache/any23/extractor/rdf/BaseRDFExtractor.java
* core/src/main/resources/org/apache/any23/extractor/html/example-embedded-jsonld.html
* test-resources/src/test/resources/html/html-embedded-jsonld-extractor.html
* core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractor.java
ANY23-226 : Make JSONLD extraction work (p_ansell: rev fd822849190240b8cf981ecc7abd0b4f592381d5)
* core/src/main/resources/META-INF/services/org.apache.any23.extractor.ExtractorFactory
* src/site/apt/any23-plugins.apt
* core/src/test/java/org/apache/any23/extractor/rdf/JSONLDExtractorTest.java
* core/src/main/java/org/apache/any23/cli/MicrodataParser.java
* plugins/html-scraper/src/main/java/org/apache/any23/plugin/htmlscraper/HTMLScraperExtractor.java
* core/src/main/java/org/apache/any23/extractor/html/HResumeExtractorFactory.java
* core/src/main/java/org/apache/any23/writer/TriXWriterFactory.java
* core/src/test/java/org/apache/any23/extractor/html/RDFMergerTest.java
* core/src/main/resources/META-INF/services/org.apache.any23.cli.Tool
* core/src/main/java/org/apache/any23/extractor/rdfa/RDFa11ExtractorFactory.java
* core/src/test/java/org/apache/any23/extractor/html/HCalendarExtractorTest.java
* core/src/main/java/org/apache/any23/extractor/html/HCardExtractorFactory.java
* plugins/basic-crawler/src/main/java/org/apache/any23/cli/Crawler.java
* core/src/main/java/org/apache/any23/writer/URIListWriterFactory.java
* core/src/main/java/org/apache/any23/extractor/rdf/BaseRDFExtractor.java
* core/src/main/java/org/apache/any23/extractor/xpath/XPathExtractorFactory.java
* core/src/test/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorTest.java
* core/src/main/java/org/apache/any23/extractor/html/HeadLinkExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/rdf/JSONLDExtractorFactory.java
* test-resources/src/test/resources/html/html-embedded-jsonld-extractor.html
* core/src/main/java/org/apache/any23/cli/ExtractorDocumentation.java
* plugins/html-scraper/src/main/resources/META-INF/services/org.apache.any23.extractor.ExtractorFactory
* core/src/main/java/org/apache/any23/writer/TurtleWriterFactory.java
* core/src/main/java/org/apache/any23/extractor/rdf/NTriplesExtractorFactory.java
* core/src/main/resources/META-INF/services/org.apache.any23.writer.WriterFactory
* core/src/main/java/org/apache/any23/writer/RDFXMLWriterFactory.java
* core/src/main/java/org/apache/any23/cli/MimeDetector.java
* core/src/test/java/org/apache/any23/extractor/example/ExampleExtractorFactory.java
* core/src/main/java/org/apache/any23/writer/NTriplesWriterFactory.java
* core/src/test/java/org/apache/any23/extractor/rdfa/AbstractRDFaExtractorTestCase.java
* plugins/office-scraper/src/main/resources/META-INF/services/org.apache.any23.extractor.ExtractorFactory
* core/src/main/java/org/apache/any23/extractor/html/AdrExtractorFactory.java
* core/src/main/java/org/apache/any23/cli/VocabPrinter.java
* core/src/main/java/org/apache/any23/extractor/rdf/TriXExtractorFactory.java
* plugins/html-scraper/src/main/java/org/apache/any23/plugin/htmlscraper/HTMLScraperExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractor.java
* core/src/main/java/org/apache/any23/extractor/rdfa/RDFaExtractorFactory.java
* plugins/office-scraper/src/main/java/org/apache/any23/plugin/officescraper/ExcelExtractor.java
* core/src/main/java/org/apache/any23/writer/NQuadsWriterFactory.java
* plugins/office-scraper/src/main/java/org/apache/any23/plugin/officescraper/ExcelExtractorFactory.java
* core/src/test/java/org/apache/any23/extractor/html/SpeciesExtractorTest.java
* core/src/main/java/org/apache/any23/extractor/html/TurtleHTMLExtractorFactory.java
* core/pom.xml
* core/src/main/java/org/apache/any23/extractor/html/SpeciesExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/HTMLMetaExtractorFactory.java
* nquads/src/main/java/org/apache/any23/io/nquads/NQuadsParserFactory.java
* core/src/main/java/org/apache/any23/extractor/html/ICBMExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/microdata/MicrodataExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/GeoExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/XFNExtractorFactory.java
* core/src/test/java/org/apache/any23/extractor/html/HTMLMetaExtractorTest.java
* core/src/test/java/org/apache/any23/extractor/html/HRecipeExtractorTest.java
* core/src/test/java/org/apache/any23/extractor/html/AbstractExtractorTestCase.java
* core/src/test/java/org/apache/any23/extractor/html/HResumeExtractorTest.java
* core/src/main/java/org/apache/any23/extractor/html/HRecipeExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/TitleExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/LicenseExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/HReviewAggregateExtractorFactory.java
* core/src/test/java/org/apache/any23/extractor/html/HCardExtractorTest.java
* test-resources/src/test/resources/html/html-embedded-jsonld-extractor-multiple.html
* core/src/main/java/org/apache/any23/extractor/csv/CSVExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/rdf/NQuadsExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/rdf/TurtleExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/html/HReviewExtractorFactory.java
* core/src/test/java/org/apache/any23/extractor/html/HReviewExtractorTest.java
* core/src/main/java/org/apache/any23/extractor/html/HCalendarExtractorFactory.java
* core/src/main/java/org/apache/any23/extractor/rdf/RDFXMLExtractorFactory.java
* core/src/main/java/org/apache/any23/writer/JSONWriterFactory.java
* core/src/main/java/org/apache/any23/extractor/html/HListingExtractorFactory.java
* core/src/main/java/org/apache/any23/cli/Rover.java
* core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorFactory.java
* plugins/basic-crawler/src/main/resources/META-INF/services/org.apache.any23.cli.Tool
* nquads/src/main/java/org/apache/any23/io/nquads/NQuadsWriterFactory.java
* core/src/test/java/org/apache/any23/extractor/html/TurtleHTMLExtractorTest.java
* core/src/test/java/org/apache/any23/extractor/html/HListingExtractorTest.java
* core/src/main/java/org/apache/any23/cli/PluginVerifier.java
* core/src/test/java/org/apache/any23/extractor/csv/CSVExtractorTest.java


> Extract JSON-LD embedded in HTML
> --------------------------------
>
>                 Key: ANY23-226
>                 URL: https://issues.apache.org/jira/browse/ANY23-226
>             Project: Apache Any23
>          Issue Type: Wish
>          Components: core
>    Affects Versions: 1.0
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>             Fix For: 1.3
>
>
>  See http://www.w3.org/TR/json-ld/#embedding-json-ld-in-html-documents
> I feel that we need to push this down at the jsonld-java level.
> I am investigating.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message