any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From lewismc <...@git.apache.org>
Subject [GitHub] any23 issue #49: ANY23-314 Service fails to return extraction in case of ext...
Date Tue, 12 Dec 2017 22:01:42 GMT
Github user lewismc commented on the issue:

    https://github.com/apache/any23/pull/49
  
    In the case where one encounters a parse and/or extraction error hence an unsuccessful
extraction, one would see the following result. 
    As you can see, the partial extraction is now included at the bottom of the servlet response
which is much better e.g. more forgiving, than a plain stack trace and error message.
    ```
    Failed to fully parse input. The extraction result, at the bottom of this response, if
any, will contain extractions only up until the extraction error.
    ================================================================
    
    ------------ BEGIN Exception context ------------
    ExtractionContext(urn:x-any23:html-rdfa11:root-extraction-result-id:http://any23.apache.org/)
    Errors {
    	ERROR: 	'The entity "copy" was referenced, but not declared.' 	(-1,-1)
    }
    ------------ END   Exception context ------------
    
    org.apache.any23.extractor.ExtractionException: Error while parsing RDF document.
    	at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:109)
    	at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:41)
    	at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:467)
    	at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:256)
    	at org.apache.any23.Any23.extract(Any23.java:300)
    	at org.apache.any23.Any23.extract(Any23.java:452)
    	at org.apache.any23.servlet.WebResponder.runExtraction(WebResponder.java:117)
    	at org.apache.any23.servlet.Servlet.doGet(Servlet.java:82)
    	at javax.servlet.http.HttpServlet.service(HttpServlet.java:624)
    	at javax.servlet.http.HttpServlet.service(HttpServlet.java:731)
    	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:303)
    	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
    	at org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:52)
    	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:241)
    	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
    	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:218)
    	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:122)
    	at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:505)
    	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:169)
    	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:103)
    	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:956)
    	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:116)
    	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:442)
    	at org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1083)
    	at org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:640)
    	at org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:318)
    	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    	at org.apache.tomcat.util.threads.TaskThread$WrappingRunnable.run(TaskThread.java:61)
    	at java.lang.Thread.run(Thread.java:748)
    Caused by: org.eclipse.rdf4j.rio.RDFParseException: org.xml.sax.SAXParseException; lineNumber:
306; columnNumber: 55; The entity "copy" was referenced, but not declared.
    	at org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:111)
    	at org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:95)
    	at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:105)
    	... 29 more
    Caused by: org.semarglproject.rdf.ParseException: org.xml.sax.SAXParseException; lineNumber:
306; columnNumber: 55; The entity "copy" was referenced, but not declared.
    	at org.semarglproject.rdf.rdfa.RdfaParser.processException(RdfaParser.java:1141)
    	at org.semarglproject.source.XmlSource.process(XmlSource.java:50)
    	at org.semarglproject.source.StreamProcessor.processInternal(StreamProcessor.java:87)
    	at org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:167)
    	at org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:154)
    	at org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:109)
    	... 31 more
    Caused by: org.xml.sax.SAXParseException; lineNumber: 306; columnNumber: 55; The entity
"copy" was referenced, but not declared.
    	at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
    	at org.semarglproject.source.XmlSource.process(XmlSource.java:48)
    	... 35 more
    ================================================================
    <data>
    <![CDATA[
    @prefix sindice: <http://vocab.sindice.net/> .
    
    <http://any23.apache.org/> <http://vocab.sindice.net/any23#Date-Revision-yyyymmdd>
"20171101"@en ;
    	<http://vocab.sindice.net/any23#Content-Language> "en"@en ;
    	<http://vocab.sindice.net/any23#viewport> "width=device-width, initial-scale=1.0"@en
;
    	<http://vocab.sindice.net/any23#author> "The Apache Software Foundation"@en .
    @prefix dcterms: <http://purl.org/dc/terms/> .
    
    <http://any23.apache.org/> dcterms:title "Apache Any23 – Apache Any23 - Introduction"@en
.
    ]]>
    </data>
    
    ```


---

Mime
View raw message