xml-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joseph Shraibman <...@selectacast.net>
Subject accented characters and xerces j
Date Tue, 06 Nov 2001 01:16:52 GMT
I'm using Xerces 1.3.1

I have a file that contains 'รถ', ascii 246

When I try to parse the file using xerces I get:
: 151, 6: An invalid XML character (Unicode: 0x1b6803) was found in the element content of

the document.

Presumably when java reads the file before it gets to xerces it converts 246 to that 
unicode value, but why?  I'm using the default (US) locale.

You can get the files involved from:
http://www.selectacast.net/~jks/xml/pr2.txt is the original text file.

Joseph Shraibman
Increase signal to noise ratio.  http://www.targabot.com

In case of troubles, e-mail:     webmaster@xml.apache.org
To unsubscribe, e-mail:          general-unsubscribe@xml.apache.org
For additional commands, e-mail: general-help@xml.apache.org

View raw message