From dev-return-5033-archive-asf-public=cust-asf.ponee.io@any23.apache.org Thu Jan 25 05:52:05 2018 Return-Path: X-Original-To: archive-asf-public@eu.ponee.io Delivered-To: archive-asf-public@eu.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by mx-eu-01.ponee.io (Postfix) with ESMTP id C8313180630 for ; Thu, 25 Jan 2018 05:52:05 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id B81D8160C4E; Thu, 25 Jan 2018 04:52:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id D8374160C3C for ; Thu, 25 Jan 2018 05:52:04 +0100 (CET) Received: (qmail 13041 invoked by uid 500); 25 Jan 2018 04:52:04 -0000 Mailing-List: contact dev-help@any23.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@any23.apache.org Delivered-To: mailing list dev@any23.apache.org Received: (qmail 13030 invoked by uid 99); 25 Jan 2018 04:52:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 25 Jan 2018 04:52:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 6EEB618015F for ; Thu, 25 Jan 2018 04:52:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -107.911 X-Spam-Level: X-Spam-Status: No, score=-107.911 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, KAM_ASCII_DIVIDERS=0.8, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 1Wyn97BJK38w for ; Thu, 25 Jan 2018 04:52:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 6D2135FAF7 for ; Thu, 25 Jan 2018 04:52:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 5C99CE099A for ; Thu, 25 Jan 2018 04:52:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 1BDD7240EE for ; Thu, 25 Jan 2018 04:52:00 +0000 (UTC) Date: Thu, 25 Jan 2018 04:52:00 +0000 (UTC) From: "Lewis John McGibbney (JIRA)" To: dev@any23.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (ANY23-271) Address "...The entity "raquo" was referenced, but not declared" SAXParseException MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ANY23-271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-271. ---------------------------------------- Resolution: Fixed Fixed via https://github.com/apache/any23/pull/59 > Address "...The entity "raquo" was referenced, but not declared" SAXParseException > ---------------------------------------------------------------------------------- > > Key: ANY23-271 > URL: https://issues.apache.org/jira/browse/ANY23-271 > Project: Apache Any23 > Issue Type: Bug > Components: extractors > Affects Versions: 1.1 > Reporter: Lewis John McGibbney > Priority: Major > Fix For: 2.2 > > > When attempting extractions on the following URL > http://data.brandweeraa.nl/data/incident/2016/32601/deployment/201601272048400 > I get the following Exception with the Webservice at any23.org > {code} > > > Could not parse input. > > ------------ BEGIN Exception context ------------ > ExtractionContext(urn:x-any23:html-rdfa11:root-extraction-result-id:http://data.brandweeraa.nl/data/incident/2016/32601/deployment/201601272048400) > Errors { > } > ------------ END Exception context ------------ > org.apache.any23.extractor.ExtractionException: Error while parsing RDF document. > at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:109) > at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:41) > at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:463) > at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:255) > at org.apache.any23.Any23.extract(Any23.java:298) > at org.apache.any23.Any23.extract(Any23.java:450) > at org.apache.any23.servlet.WebResponder.runExtraction(WebResponder.java:114) > at org.apache.any23.servlet.Servlet.doGet(Servlet.java:79) > at javax.servlet.http.HttpServlet.service(HttpServlet.java:618) > at javax.servlet.http.HttpServlet.service(HttpServlet.java:725) > at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:301) > at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) > at org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:52) > at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:239) > at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) > at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:219) > at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:106) > at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:503) > at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:136) > at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:74) > at org.apache.catalina.valves.AbstractAccessLogValve.invoke(AbstractAccessLogValve.java:610) > at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:88) > at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:526) > at org.apache.coyote.ajp.AbstractAjpProcessor.process(AbstractAjpProcessor.java:794) > at org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:652) > at org.apache.tomcat.util.net.NioEndpoint$SocketProcessor.doRun(NioEndpoint.java:1575) > at org.apache.tomcat.util.net.NioEndpoint$SocketProcessor.run(NioEndpoint.java:1533) > at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.openrdf.rio.RDFParseException: org.xml.sax.SAXParseException; lineNumber: 14; columnNumber: 105; The entity "raquo" was referenced, but not declared. > at org.semarglproject.sesame.rdf.rdfa.SesameRDFaParser.parse(SesameRDFaParser.java:111) > at org.semarglproject.sesame.rdf.rdfa.SesameRDFaParser.parse(SesameRDFaParser.java:95) > at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:105) > ... 29 more > Caused by: org.semarglproject.rdf.ParseException: org.xml.sax.SAXParseException; lineNumber: 14; columnNumber: 105; The entity "raquo" was referenced, but not declared. > at org.semarglproject.rdf.rdfa.RdfaParser.processException(RdfaParser.java:1130) > at org.semarglproject.source.XmlSource.process(XmlSource.java:50) > at org.semarglproject.source.StreamProcessor.processInternal(StreamProcessor.java:87) > at org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:167) > at org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:154) > at org.semarglproject.sesame.rdf.rdfa.SesameRDFaParser.parse(SesameRDFaParser.java:109) > ... 31 more > Caused by: org.xml.sax.SAXParseException; lineNumber: 14; columnNumber: 105; The entity "raquo" was referenced, but not declared. > at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) > at org.semarglproject.source.XmlSource.process(XmlSource.java:48) > ... 35 more > ]]> > > > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)