Return-Path: Delivered-To: apmail-jakarta-commons-user-archive@www.apache.org Received: (qmail 87034 invoked from network); 4 Jan 2007 12:47:05 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Jan 2007 12:47:05 -0000 Received: (qmail 35180 invoked by uid 500); 4 Jan 2007 12:47:07 -0000 Delivered-To: apmail-jakarta-commons-user-archive@jakarta.apache.org Received: (qmail 35133 invoked by uid 500); 4 Jan 2007 12:47:07 -0000 Mailing-List: contact commons-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Help: List-Post: List-Id: "Jakarta Commons Users List" Reply-To: "Jakarta Commons Users List" Delivered-To: mailing list commons-user@jakarta.apache.org Received: (qmail 35122 invoked by uid 99); 4 Jan 2007 12:47:07 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Jan 2007 04:47:07 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [216.222.200.2] (HELO buzz.frogspace.net) (216.222.200.2) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Jan 2007 04:46:57 -0800 Received: from tbee.dyndns.org ([82.92.119.213] helo=[10.0.0.100]) by buzz.frogspace.net with esmtp (Exim 4.44) id 1H2RzQ-0007R8-82 for commons-user@jakarta.apache.org; Thu, 04 Jan 2007 04:46:36 -0800 Message-ID: <459CF72C.80201@tbee.org> Date: Thu, 04 Jan 2007 13:46:36 +0100 From: Tom User-Agent: Thunderbird 1.5.0.9 (Windows/20061207) MIME-Version: 1.0 To: Jakarta Commons Users List Subject: Re: [Urgent] UTF-8 encoding problem References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-SA-Exim-Connect-IP: 82.92.119.213 X-SA-Exim-Mail-From: tbee@tbee.org X-Virus-Checked: Checked by ClamAV on apache.org You could also try and use an encoding aware editor to edit the XML file; for example XMLSpy or Eclipse with the web tools plugins. These editors will interpret the specified encoding value and save the XML in that encoding. Very convenient! Tom Jan.Van-Stalle@ec.europa.eu wrote: > Sound more like an encoding problem to me; > > Try submitting an xml-file with "simple" (<127) characters; these are encoded the same way in other encoding schemes (like windows 1252) and utf8; if this works, I would think that the submitted xml-file is not correctly utf8 encoded; as the xml-header defines that the xml is UTF8 encoded, characters like �, �, or the euro-sign will be encoded differently. > > Jan > > ------------------ > Jan Van Stalle > DIGIT.B.03 > tel +32 2 299 49 82 > Bureau MO34 2/54 > > > -----Original Message----- > From: Mark Diggory [mailto:mdiggory@gmail.com] > Sent: Thursday, January 04, 2007 1:08 PM > To: Jakarta Commons Users List > Subject: Re: [Urgent] UTF-8 encoding problem > > > This looks more like an XML / Xerces Parsing issue, I would seek help there. > Sounds like your placing non-UTF encoded chars into your XML file. > > -Mark > > On 12/28/06, DECAFFMEYER MATHIEU wrote: > >> Hi, >> >> I am using Jakarta Configuration to manipulate some XML files. >> I have the following error when I open one of the files : >> >> org.apache.commons.configuration.ConfigurationException: Octet 2 incorrect >> dans la s�quence UTF-8 � 3-octets. >> at org.apache.commons.configuration.XMLConfiguration.load( >> XMLConfiguration.java:620) >> at org.apache.commons.configuration.XMLConfiguration.load( >> XMLConfiguration.java:578) >> at >> org.apache.commons.configuration.XMLConfiguration$XMLFileConfigurationDelegate.load >> (XMLConfiguration.java:1045) >> >> at org.apache.commons.configuration.AbstractFileConfiguration.load >> (AbstractFileConfiguration.java:280) >> [...] >> >> >> >> Caused by: java.io.UTFDataFormatException: Octet 2 incorrect dans la >> s�quence UTF-8 � 3-octets. >> at org.apache.xerces.impl.io.UTF8Reader.invalidByte(Unknown >> Source) >> at org.apache.xerces.impl.io.UTF8Reader.read(Unknown Source) >> at org.apache.xerces.impl.XMLEntityScanner.load(Unknown Source) >> [...] >> >> >> The headlines of the file is : >> >> >> > >> >> >> ]> >> [...] >> >> I have an XML with exactly the same lines at the top, >> and I have no problem with this one : >> >> > >> >> >> ]> >> [...] >> >> What do u suggest me to do ? >> >> Thank u for any help ! Will be greatly appreciated ! >> >> >> ============================================ >> Internet communications are not secure and therefore Fortis Banque >> Luxembourg S.A. does not accept legal responsibility for the contents of >> this message. The information contained in this e-mail is confidential and >> may be legally privileged. It is intended solely for the addressee. If you >> are not the intended recipient, any disclosure, copying, distribution or any >> action taken or omitted to be taken in reliance on it, is prohibited and may >> be unlawful. Nothing in the message is capable or intended to create any >> legally binding obligations on either party and it is not intended to >> provide legal advice. >> ============================================ >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: commons-user-unsubscribe@jakarta.apache.org >> For additional commands, e-mail: commons-user-help@jakarta.apache.org >> >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: commons-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: commons-user-help@jakarta.apache.org > > > --------------------------------------------------------------------- To unsubscribe, e-mail: commons-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: commons-user-help@jakarta.apache.org