Return-Path: Delivered-To: apmail-xml-forrest-dev-archive@www.apache.org Received: (qmail 44932 invoked from network); 11 May 2004 13:23:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 11 May 2004 13:23:06 -0000 Received: (qmail 45268 invoked by uid 500); 11 May 2004 13:24:01 -0000 Delivered-To: apmail-xml-forrest-dev-archive@xml.apache.org Received: (qmail 45034 invoked by uid 500); 11 May 2004 13:23:59 -0000 Mailing-List: contact forrest-dev-help@xml.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: forrest-dev@xml.apache.org Delivered-To: mailing list forrest-dev@xml.apache.org Received: (qmail 44986 invoked by uid 98); 11 May 2004 13:23:59 -0000 Received: from uv@upaya.co.uk by hermes.apache.org by uid 82 with qmail-scanner-1.20 (clamuko: 0.70. Clear:RC:0(66.111.4.26):. Processed in 0.075874 secs); 11 May 2004 13:23:59 -0000 X-Qmail-Scanner-Mail-From: uv@upaya.co.uk via hermes.apache.org X-Qmail-Scanner: 1.20 (Clear:RC:0(66.111.4.26):. Processed in 0.075874 secs) Received: from unknown (HELO out2.smtp.messagingengine.com) (66.111.4.26) by hermes.apache.org with SMTP; 11 May 2004 13:23:57 -0000 X-Sasl-enc: r0IZtsJ2pE47ruJxOZe2pQ 1084281626 Received: from upaya.co.uk (unknown [213.48.13.39]) by www.fastmail.fm (Postfix) with ESMTP id DBC72B4D563 for ; Tue, 11 May 2004 09:20:25 -0400 (EDT) Message-ID: <40A0D311.8090404@upaya.co.uk> Date: Tue, 11 May 2004 14:20:17 +0100 From: Upayavira User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.4) Gecko/20030624 X-Accept-Language: en-us, en, de, ar-kw MIME-Version: 1.0 To: forrest-dev@xml.apache.org Subject: Re: Forrest and UTF-8 References: <56409.10.0.0.1.1084138122.squirrel@ags01.agsoftware.dnsalias.com> <409FB61D.7040508@che-che.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Rating: hermes.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Sjur N�rsteb� Moshagen wrote: > P� 11. mai. 2004 kl. 13.00 skrev Fabrice Bacchella: > >> Le 10 mai 04, � 19:04, Juan Jose Pablos a �crit : >> >>> +1. UTF-8 should be the default encoding. >>> >> Mmm. Does somebody knows the minimum version of Internet Explorer or >> Netscape that support UTF-8 ? Using UTF-8 might break some not so old >> navigators. > > > The alternative is to support a (potentially large) set of other > encodings (which we probably should do anyway). But considering that: > - ASCII is a true subset of UTF-8, and > - Xalan, when serializing to HTML, will render all characters defined > as entities in the HTML spec as entities (= ASCII) (the defined > entities cover most of the non-ASCII part of the 8859-series, as well > as other characters), > > UTF-8 should be no problem for most of the browsers, even old ones. > AND UTF-8 solves a lot of _other_ encoding problems in a multilingual > world, of which many are just as problematic for old browsers as UTF-8. > > I first perceived the Xalan behaviour as buggy, generating unnecessary > large files in a UTF-8 setting (entitites use more space than a > multibyte UTF-8 character), but considering backwards compatibility > the behaviour is actually not so bad. > > To sum up: > +1 - UTF-8 should be default, with alternative encodings available as > an option. But, if Xalan does as you say, does the encoding make much difference? Upayavira