Return-Path: Mailing-List: contact cocoon-users-help@xml.apache.org; run by ezmlm Delivered-To: mailing list cocoon-users@xml.apache.org Received: (qmail 38545 invoked from network); 2 Mar 2001 19:33:30 -0000 Received: from pivsbh1.ms.com (199.89.64.103) by h31.sny.collab.net with SMTP; 2 Mar 2001 19:33:30 -0000 Received: (from uucp@localhost) by pivsbh1.ms.com (8.9.3/fw v1.30) id OAA15684; Fri, 2 Mar 2001 14:33:22 -0500 (EST) Received: from localhost(127.0.0.1) by pivsbh1 via smap (4.1) id sma.9835616011.015636; Fri, 2 Mar 01 14:33:21 -0500 Received: (from uucp@localhost) by pivsbh1.ms.com (8.9.3/8.11.2) id OAA15588; Fri, 2 Mar 2001 14:33:20 -0500 (EST) Received: from hasmh1.ms.com(138.20.197.23) by pivsbh1 via smap (4.1) id sma.9835615991.015513; Fri, 2 Mar 01 14:33:19 -0500 Received: from msdw.com (cw017617.morgan.com [138.20.218.52]) by hasmh1.morgan.com (8.8.5/imap+ldap v2.4) with ESMTP id TAA19713; Fri, 2 Mar 2001 19:33:25 GMT Message-ID: <3A9FF449.28FC1A9A@msdw.com> Date: Fri, 02 Mar 2001 19:28:09 +0000 From: Werner Guttmann Reply-To: Werner.Guttmann@msdw.com Organization: Morgan Stanley Dean Witter & Co. X-Mailer: Mozilla 4.75 [en]C-CCK-MCD MS4.75 V20001029.1 (WinNT; U) X-Accept-Language: en,ja MIME-Version: 1.0 To: Robin Green CC: cocoon-users@xml.apache.org Subject: Re: Unicode characters References: Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Spam-Rating: h31.sny.collab.net 1.6.2 0/1000/N Robin, here's an excerpt of the trace as produced by Lynx. Sending HTTP request. HTTP: WRITE delivered OK HTTP request sent; waiting for response. HTTP: Trying to read 1023 HTTP: Read 321 Read 321 bytes of data. HTTP: Rx: HTTP/1.0 200 OK HTTP: Scanned 2 fields from line_buffer --- Talking HTTP1. HTTP/1.0 200 OK HTFormat: Constructing stream stack for www/mime to www/present HTFormat: Looking up presentation for www/mime to www/present StreamStack: found weak wildcard match: www/present FindPresentation: found exact match: www/mime StreamStack: found exact match: www/mime HTMIME: Content-type: text/html; charset=UTF-8 Content-length: 15745 Servlet-engine: Tomcat Web Server/3.2.1 (JSP 1.1; Servlet 2.2; Java 1.2.2; SunOS 5.7 sparc; java.vendor=Sun Microsystems Inc.) The below line > Content-type: text/html; charset=UTF-8 seems to indicate that the response stream is using the UTF-8 character set. I guess that's what I am looking for, right ? Btw, I've added an encoding line to my cocoon.properties after reading your below comment # HTML 4.0 (strict) formatter.text/html.doctype-public = -//W3C//DTD HTML 4.0//EN formatter.text/html.doctype-system = http://www.w3.org/TR/REC-html40/strict.dtd formatter.text/html.encoding = UTF-8 to make sure that the right character set is picked. Werner Robin Green wrote: > Werner Guttmann wrote: > >Bounced Tomcat, > > Sounds painful. Poor kitty. Do you mean restarted? ;) Well, yes .. ;-). And no, I'd never bounce any kitty ... > >reloaded my application, and then I looked at the output > >of a sample page where I show the character encoding of the page via the > > element. To my surprise, it still > >shows > > > >8859_1 > > > >which afair is the Western European encoding. Any idea what's going wrong > >? > > Simple. The char encoding is not set at the XSP stage - it's set after the > Formatter is invoked, which is later. You need to check at the client side - > use lynx, or telnet, or something, maybe. > > _________________________________________________________________________ > Get Your Private, Free E-mail from MSN Hotmail at http://www.hotmail.com.