lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steve McKay <shubalub...@gmail.com>
Subject Re: Invalid XML Output (Arabic): <int name="0<"
Date Thu, 30 Aug 2012 15:05:32 GMT
Are you sure this isn't an Eclipse issue? The plaintext (quoted-printable) source of the <int>
elements looks like this:

<int name=3D"=D8=A7=D9=84=D9=85=D8=B3=D8=AA=D8=B4=D9=81=D9=89 =D9=88=D9=82=D8=A7=D9=84=D8=AA
=D9=8A=D8=B1=D9=88=D8=AD">0</int>

AFAIK this is valid XML. There are only a few codepoints not allowed by XML 1.0. I don't see
a right-to-left mark, so it seems like Mail.app is trying to infer a right-to-left mark for
the Arabic text and getting it wrong, causing the mangled display.

On Aug 29, 2012, at 2:17 PM, Fuad Efendi <fuad@efendi.ca> wrote:

> Hi all,
> 
> It looks like we have very special "command" character here… which mirrors some "visible"
images… but it is still invalid XML when I try to validate in Eclipse… 
> Solr-4.0.0-BETA
> 
> 
> 
> <?xml version="1.0" encoding="UTF-8"?>
> <response>
> 	<lst name="responseHeader">
> 		<int name="status">0</int>
> 		<int name="QTime">237</int>
> 		<lst name="params">
> 			<str name="facet">true</str>
> 			<str name="facet.offset">100000000</str>
> 			<str name="facet.sort">index</str>
> 			<str name="facet.limit">10</str>
> 			<str name="facet.field">enrich_keywords_string_mv</str>
> 		</lst>
> 	</lst>
> 	<result
> 		name="response"
> 		numFound="0"
> 		start="0"
> 	></result>
> 	<lst name="facet_counts">
> 		<lst name="facet_queries"/>
> 		<lst name="facet_fields">
> 			<lst name="enrich_keywords_string_mv">
> 				<int name="المستشفى وقالت يروح">0</int>
> 				<int name="المستشفى وقالو لي">0</int>
> 				<int name="المستشفى وقالوا خلاص">0</int>
> 				<int name="المستشفى وقالوا عندك">0</int>
> 				<int name="المستشفى وقالوا لا">0</int>
> 				<int name="المستشفى وقالوا لابو">0</int>
> 				<int name="المستشفى وقالوا لهم">0</int>
> 				<int name="المستشفى وقالوا لي">0</int>
> 				<int name="المستشفى وقالى تعالى">0</int>
> 				<int name="المستشفى وقام بعمل">0</int>
> 			</lst>
> 		</lst>
> 		<lst name="facet_dates"/>
> 		<lst name="facet_ranges"/>
> 	</lst>
> </response>


Mime
View raw message