Return-Path: Delivered-To: apmail-tomcat-users-archive@www.apache.org Received: (qmail 43099 invoked from network); 25 Jun 2008 16:56:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Jun 2008 16:56:53 -0000 Received: (qmail 97245 invoked by uid 500); 25 Jun 2008 16:56:42 -0000 Delivered-To: apmail-tomcat-users-archive@tomcat.apache.org Received: (qmail 97129 invoked by uid 500); 25 Jun 2008 16:56:41 -0000 Mailing-List: contact users-help@tomcat.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "Tomcat Users List" Delivered-To: mailing list users@tomcat.apache.org Received: (qmail 97118 invoked by uid 99); 25 Jun 2008 16:56:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Jun 2008 09:56:41 -0700 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL,WHOIS_NETSOLPR X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [76.96.30.80] (HELO QMTA08.emeryville.ca.mail.comcast.net) (76.96.30.80) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Jun 2008 16:55:49 +0000 Received: from OMTA12.emeryville.ca.mail.comcast.net ([76.96.30.44]) by QMTA08.emeryville.ca.mail.comcast.net with comcast id iFy61Z00B0x6nqcA806M00; Wed, 25 Jun 2008 16:55:59 +0000 Received: from [192.168.1.125] ([68.55.225.178]) by OMTA12.emeryville.ca.mail.comcast.net with comcast id iGvw1Z0083ra03G8YGvwVV; Wed, 25 Jun 2008 16:55:57 +0000 X-Authority-Analysis: v=1.0 c=1 a=1WbabGOjOGIA:10 a=Q-3M8mMfjh4A:10 a=mV9VRH-2AAAA:8 a=74_OhrOYAAAA:8 a=SSmOFEACAAAA:8 a=xe8BsctaAAAA:8 a=pj4PKCVLILdwjZI-OhAA:9 a=h_QB7Te8ATPkMsA_9dQA:7 a=VJlrcZzTK5WfXzrtizlMl-qJGcoA:4 a=rPt6xJ-oxjAA:10 Message-ID: <4862789A.6040705@christopherschultz.net> Date: Wed, 25 Jun 2008 12:55:54 -0400 From: Christopher Schultz User-Agent: Thunderbird 2.0.0.14 (Windows/20080421) MIME-Version: 1.0 To: Tomcat Users List Subject: Re: <4860E82D.4060603@ice-sa.com> <4860FA34.8080205@christopherschultz.net> <48610546.4040506@ice-sa.com> <48613892.9070507@christopherschultz.net> <48620E44.7090707@ice-sa.com> In-Reply-To: <48620E44.7090707@ice-sa.com> X-Enigmail-Version: 0.95.6 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 André, André Warnier wrote: | It has been recently shown in a thread in this same forum | that one does not normally need a filter, and I would submit that using | a filter as indicated will corrupt data in some instances. I disagree. The filter is required for clients which silently submit UTF-8. In that case, the server defaults to ISO-8859-1 and your data is corrupted. Your demand that nobody should use GET parameters is unreasonable. Ergo, the filter is required. | In the article at | > http://wiki.apache.org/tomcat/FAQ/CharacterEncoding | there is also a problem in the form shown under the title Then fix it. You are a member of this community, too, now ;) | How can I test if my configuration will work correctly? Create a URL that contains encoded UTF-8 characters that would displayed differently if interpreted as ISO-8859-1. This is not difficult to do. For instance, http://my.server.com/something?psi=ψ | As demonstrated by a recent thread here also, the
tag as shown, | is missing a | enctype="multipart/form-data" | attribute. The default form enctype is application/x-www-form-urlencoded, which is fine when no form elements are present (see http://www.w3.org/TR/html401/interact/forms.html#h-17.3). | This will cause Tomcat to misinterpret the form data in some cases. | One could also argue that adding an attribute | accept-charset="UTF-8" | would make it even more failsafe. Fair enough. I'm not sure this affects GET requests generated by forms, though. Also, not all parameters in GET URLs are from forms: some are normal links (and are often problematic). | In addition, the article also repeats a mistake often seen, which is to | tell people that it's ok to send form data via a GET and use non | US-ASCII data. This is a receipe for problems, see the first mentioned | article at java.sun.com. The only reason it's a recipe for problems is because clients are inconsistent about their use of character encoding in URLs. Non-ASCII characters are fine as long as the client and server agree on the encoding (which is sometimes problematic). Don't confuse the issue of non-ascii characters in URLs (which is fine) with the inability of clients and servers to communicate the character encoding properly (which is not fine). | Now, I know that these are Wiki articles and can be corrected by anyone, | but isn't that a problem ? For better or worse, these articles are used | as reference by Tomcat users. See your own response above. | If someone goes ahead and posts incorrect technical stuff there, there | is a problem, no ? Yes. If something is wrong, it should be fixed. We might only find out that it's broken because someone reads it and finds a problem. Given your passion for the Truth-with-a-capital-T, please correct the article. Someone in the future may re-correct it if your version of the truth turns out to be ... lacking. | I mean that I, as a mere user, don't feel at ease going ahead and | modifying the Wiki article of someone else unilaterally, nor of posting | another one saying the previous one is all wrong. But maybe there | should be some form of authoritative control of the accuracy of what is | posted there ? The Wiki is a wiki so that the documentation can grow organically, rather than having to wait until some blessed Tomcat developer gets around to fixing the documentation. The power has been placed into your hands for a reason. Wikis keep versions, so if you replace everything with porn, it'll just get reverted and you'll get booted off. Given that you will likely be making a positive contribution, I'm sure your changes will stick. You have to abandon the "us versus them" mentality that you have about you and the rest of the community. Most of the active users on this list are not Tomcat developers. There is no "them". There is only "us". - -chris -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (MingW32) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iEYEARECAAYFAkhieJoACgkQ9CaO5/Lv0PAsdACgqgKUeQEB+5y+hGWePFNEfpfk l/AAoKEItRcDZfU1BQmPss8qZ5OXc/Hu =cy91 -----END PGP SIGNATURE----- --------------------------------------------------------------------- To start a new topic, e-mail: users@tomcat.apache.org To unsubscribe, e-mail: users-unsubscribe@tomcat.apache.org For additional commands, e-mail: users-help@tomcat.apache.org