xml-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joseph Shraibman <...@selectacast.net>
Subject accented characters and xerces j
Date Tue, 06 Nov 2001 01:16:52 GMT
I'm using Xerces 1.3.1

I have a file that contains 'รถ', ascii 246


When I try to parse the file using xerces I get:
: 151, 6: An invalid XML character (Unicode: 0x1b6803) was found in the element content of

the document.

Presumably when java reads the file before it gets to xerces it converts 246 to that 
unicode value, but why?  I'm using the default (US) locale.

You can get the files involved from:
http://www.selectacast.net/~jks/xml/pr2.xml
http://www.selectacast.net/~jks/xml/pr2.txt is the original text file.


-- 
Joseph Shraibman
jks@selectacast.net
Increase signal to noise ratio.  http://www.targabot.com


---------------------------------------------------------------------
In case of troubles, e-mail:     webmaster@xml.apache.org
To unsubscribe, e-mail:          general-unsubscribe@xml.apache.org
For additional commands, e-mail: general-help@xml.apache.org


Mime
View raw message