Return-Path: Delivered-To: apmail-jakarta-httpcomponents-dev-archive@www.apache.org Received: (qmail 4996 invoked from network); 10 Mar 2007 06:11:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 10 Mar 2007 06:11:31 -0000 Received: (qmail 85290 invoked by uid 500); 10 Mar 2007 06:11:39 -0000 Delivered-To: apmail-jakarta-httpcomponents-dev-archive@jakarta.apache.org Received: (qmail 85268 invoked by uid 500); 10 Mar 2007 06:11:38 -0000 Mailing-List: contact httpcomponents-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "HttpComponents Project" Delivered-To: mailing list httpcomponents-dev@jakarta.apache.org Received: (qmail 85259 invoked by uid 99); 10 Mar 2007 06:11:38 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2007 22:11:38 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2007 22:11:29 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6C012714062 for ; Fri, 9 Mar 2007 22:11:09 -0800 (PST) Message-ID: <25617922.1173507069438.JavaMail.jira@brutus> Date: Fri, 9 Mar 2007 22:11:09 -0800 (PST) From: "Ralf Hauser (JIRA)" To: httpcomponents-dev@jakarta.apache.org Subject: [jira] Commented: (HTTPCLIENT-642) browser encoded UTF-8 character gets truncated by URI upon escaping In-Reply-To: <9565412.1173462669391.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HTTPCLIENT-642?page=3Dcom.atlas= sian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12479807 ]= =20 Ralf Hauser commented on HTTPCLIENT-642: ---------------------------------------- when doing the same e.g. on apache httpd plus typo3, I get=20 %26%2333021%3B that also properly decodes with=20 URLDecoder.decode() but if I were to escape # as %23 before URI escaping, I'll end up with %252= 3, hmm... > browser encoded UTF-8 character gets truncated by URI upon escaping > ------------------------------------------------------------------- > > Key: HTTPCLIENT-642 > URL: https://issues.apache.org/jira/browse/HTTPCLIENT-642 > Project: HttpComponents HttpClient > Issue Type: Bug > Affects Versions: 3.0.1 > Reporter: Ralf Hauser > > a mozilla 1.5.0.10 get request of an iso-8859-1 form where a user inadver= tently entered a chinese character arrives at my tomcat like=20 > String url=3D "/hp/index.php?address=3Daddr&email=3Dhauser@acm.org&name= =3DRalf&#33021; GMX&subject=3DNewsletter" > the chinese charcter =E8=83=BD being encoded as &#33021; > URI uri =3D new URI(url, false, "ISO-8859= -1"); > =09=09=09=09GetMethod httpGet =3D new GetMethod(uri.getEscapedURI()); > =09=09=09=09log.debug(httpGet.getURI()); > "/hp/index.php?address=3Daddr&email=3Dhauser@acm.org&name=3DRalf&" > How should I deal with that until the v4 is out? Will that no longer happ= en there? > =20 > see also HTTPCLIENT-577 --=20 This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: httpcomponents-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: httpcomponents-dev-help@jakarta.apache.org