Return-Path: Delivered-To: apmail-xml-general-archive@xml.apache.org Received: (qmail 89963 invoked by uid 500); 6 Nov 2001 02:36:25 -0000 Mailing-List: contact general-help@xml.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: general@xml.apache.org Delivered-To: mailing list general@xml.apache.org Received: (qmail 89951 invoked from network); 6 Nov 2001 02:36:25 -0000 Subject: Re: accented characters and xerces j To: general@xml.apache.org X-Mailer: Lotus Notes Release 5.0.6a January 17, 2001 Message-ID: From: David_N_Bertoni@lotus.com Date: Mon, 5 Nov 2001 21:35:17 -0500 X-MIMETrack: Serialize by Router on CAMMAIL01a/CAM/M/Lotus(Release 5.0.8 |June 18, 2001) at 11/05/2001 09:35:57 PM MIME-Version: 1.0 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: quoted-printable X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N This is not the best list for Xerces questions. There is a Xerces-J li= st that you should subscribe to. The problem is that your document is encoded incorrectly. There is no ASCII character 246, since ASCII only defines characters up to 127. However, there _is_ a character defined in ISO-8859-1with such a value.= Your document does not contain an XML declaration, so you need to add o= ne and specify the correct encoding: Dave = =20 Joseph = =20 Shraibman To: general@xml.apache.org= =20 Subject: accented characte= rs and xerces j =20 = =20 11/05/2001 = =20 08:16 PM = =20 Please = =20 respond to = =20 general = =20 = =20 = =20 I'm using Xerces 1.3.1 I have a file that contains '=F6', ascii 246 When I try to parse the file using xerces I get: : 151, 6: An invalid XML character (Unicode: 0x1b6803) was found in the= element content of the document. Presumably when java reads the file before it gets to xerces it convert= s 246 to that unicode value, but why? I'm using the default (US) locale. You can get the files involved from: http://www.selectacast.net/~jks/xml/pr2.xml http://www.selectacast.net/~jks/xml/pr2.txt is the original text file. -- Joseph Shraibman jks@selectacast.net Increase signal to noise ratio. http://www.targabot.com --------------------------------------------------------------------- In case of troubles, e-mail: webmaster@xml.apache.org To unsubscribe, e-mail: general-unsubscribe@xml.apache.org For additional commands, e-mail: general-help@xml.apache.org = --------------------------------------------------------------------- In case of troubles, e-mail: webmaster@xml.apache.org To unsubscribe, e-mail: general-unsubscribe@xml.apache.org For additional commands, e-mail: general-help@xml.apache.org