Return-Path: Delivered-To: apmail-cocoon-users-archive@www.apache.org Received: (qmail 66824 invoked from network); 6 Apr 2008 17:16:46 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Apr 2008 17:16:46 -0000 Received: (qmail 93235 invoked by uid 500); 6 Apr 2008 17:16:40 -0000 Delivered-To: apmail-cocoon-users-archive@cocoon.apache.org Received: (qmail 93185 invoked by uid 500); 6 Apr 2008 17:16:39 -0000 Mailing-List: contact users-help@cocoon.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: users@cocoon.apache.org List-Id: Delivered-To: mailing list users@cocoon.apache.org Received: (qmail 93174 invoked by uid 99); 6 Apr 2008 17:16:39 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Apr 2008 10:16:39 -0700 X-ASF-Spam-Status: No, hits=3.8 required=10.0 tests=HTML_MESSAGE,MIME_QP_LONG_LINE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of Magnus.Haraldsen.Amundsen@computas.com designates 193.71.42.14 as permitted sender) Received: from [193.71.42.14] (HELO mail.computas.com) (193.71.42.14) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Apr 2008 17:15:55 +0000 Received: by mail.computas.com (Postfix, from userid 102) id 8E51013666; Sun, 6 Apr 2008 19:16:03 +0200 (CEST) Received: from ATLAS.computas.int (neptun.computas.com [192.168.4.113])by mail.computas.com (Postfix) with ESMTP id 5F0C21331Bfor ; Sun, 6 Apr 2008 19:16:03 +0200 (CEST) X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative;boundary="----_=_NextPart_001_01C89809.E44A764B" Subject: Cocoon and UTF-8: Invalid byte 2 of 3-byte UTF-8 sequence Date: Sun, 6 Apr 2008 19:16:03 +0200 Message-ID: <495295EE69BB4B4AAE73D1B0D8AD551801BA2987@ATLAS.computas.int> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Cocoon and UTF-8: Invalid byte 2 of 3-byte UTF-8 sequence Thread-Index: AciYCeRDa2GEMmutTI+Xr/hcgACEEw== From: "Magnus Haraldsen Amundsen" To: X-Virus-Checked: Checked by ClamAV on apache.org ------_=_NextPart_001_01C89809.E44A764B Content-Type: text/plain;charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi, I'm still having problems with Cocoon and UTF-8 using Windows XP/Vista. Every time a searchresult/page content etc. contains the norwegian = characters "=E6 =F8 =E5" I get a org.xml.sax.SAXParseException: Invalid = byte 2 of 3-byte UTF-8 sequence. This problem does not occur with Linux. I've created a smallest possible code example to recreate the exception. = This code (zipped) can be found here: = https://submarine.computas.com/sublima/trunk/temp/Cocoontest.zip The basic flow of the code example is: 1. Request a URL 2. Sitemap matches the URL and calls a StatelessAppleController 3. The StatelessAppleController adds a String containg the special = characters to a Map, and forwards it using res.sendPage("xml/test", = bizData); 4. Sitemap matches xml/test and=20 The jx.xml takes the String from the Map in the StatelessAppleController = from a I've followed the How to configure consistent encoding in Cocoon-steps, = but it still doesn't work. Could anyone take a look at the code and see if the spot the = problem/solution?=20 - Magnus IMPORTANT NOTICE: This message may contain confidential information. If you have received this e-mail in error, do not use, copy or distribute it. Do not open any attachments. Delete it immediately from your system and notify the sender promptly by e-mail that you have done so. Thank you. ------_=_NextPart_001_01C89809.E44A764B Content-Type: text/html;charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Cocoon and UTF-8: Invalid byte 2 of 3-byte UTF-8 sequence

Hi,

I'm still having problems with Cocoon and UTF-8 using Windows = XP/Vista.
Every time a searchresult/page content etc. contains the norwegian = characters "=E6 =F8 =E5" I get a = org.xml.sax.SAXParseException: Invalid byte 2 of 3-byte UTF-8 sequence. = This problem does not occur with Linux.
I've created a smallest possible code example to recreate the exception. = This code (zipped) can be found here: https://submarine.computas.com/sublima/trunk/temp/Cocoontest.zip
=
The basic flow of the code example is:

1. Request a URL
2. Sitemap matches the URL and calls a StatelessAppleController
3. The StatelessAppleController adds a String containg the special = characters to a Map, and forwards it using = res.sendPage("xml/test", bizData);
4. Sitemap matches xml/test and

<map:match pattern=3D"xml/*">
  <map:generate src=3D"templates/{1}.jx.xml" = type=3D"jx"/>
  <map:transform src=3D"transforms/test.xslt"/>
  <map:serialize type=3D"xml"/>
</map:match>

The jx.xml takes the String from the Map in the StatelessAppleController = from a <jx:out value=3D"#{testresults}" = xmlize=3D"true"/>

I've followed the How to configure consistent encoding in Cocoon-steps, = but it still doesn't work.

Could anyone take a look at the code and see if the spot the = problem/solution?

- Magnus


IMPORTANT NOTICE: This message may contain confidential information. If you have received this e-mail in error, do not use, copy or distribute it. Do not open any attachments. Delete it immediately from your system and notify the sender promptly by e-mail that you have done so. Thank you.
------_=_NextPart_001_01C89809.E44A764B--