Return-Path: Delivered-To: apmail-httpd-apreq-dev-archive@www.apache.org Received: (qmail 53450 invoked from network); 10 Mar 2005 22:47:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 10 Mar 2005 22:47:08 -0000 Received: (qmail 78175 invoked by uid 500); 10 Mar 2005 22:47:07 -0000 Delivered-To: apmail-httpd-apreq-dev-archive@httpd.apache.org Received: (qmail 78145 invoked by uid 500); 10 Mar 2005 22:47:07 -0000 Mailing-List: contact apreq-dev-help@httpd.apache.org; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Delivered-To: mailing list apreq-dev@httpd.apache.org Received: (qmail 78129 invoked by uid 99); 10 Mar 2005 22:47:06 -0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: domain of gcaa-apreq-dev@m.gmane.org designates 80.91.229.2 as permitted sender) Received: from main.gmane.org (HELO ciao.gmane.org) (80.91.229.2) by apache.org (qpsmtpd/0.28) with ESMTP; Thu, 10 Mar 2005 14:47:05 -0800 Received: from list by ciao.gmane.org with local (Exim 4.43) id 1D9WNT-0000P0-Dm for apreq-dev@httpd.apache.org; Thu, 10 Mar 2005 23:43:36 +0100 Received: from adsl-3-9-19.mia.bellsouth.net ([65.3.9.19]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 10 Mar 2005 23:43:35 +0100 Received: from joe+gmane by adsl-3-9-19.mia.bellsouth.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 10 Mar 2005 23:43:35 +0100 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: apreq-dev@httpd.apache.org To: apreq-dev@httpd.apache.org From: Joe Schaefer Subject: Re: today's code review.. Date: Thu, 10 Mar 2005 17:40:47 -0500 Lines: 27 Message-ID: <87u0njktu8.fsf@gemini.sunstarsys.com> References: <20050309230726.GA30466@roonstrasse.net> <87oedsmju4.fsf@gemini.sunstarsys.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Complaints-To: usenet@sea.gmane.org X-Gmane-NNTP-Posting-Host: adsl-3-9-19.mia.bellsouth.net Mail-Copies-To: never User-Agent: Gnus/5.1007 (Gnus v5.10.7) Emacs/21.3 (gnu/linux) Cancel-Lock: sha1:HD23cV7hM7hkoWSdJtwdi435KZc= Sender: news X-Gmane-MailScanner: Found to be clean X-Gmane-MailScanner: Found to be clean X-MailScanner-From: gcaa-apreq-dev@m.gmane.org X-MailScanner-To: apreq-dev@httpd.apache.org X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Joe Schaefer writes: > Max Kellermann writes: [...] >> - we should define what apreq_decode() does when it sees 8 bit ASCII >> escapes and unicode escapes in one string; the current behaviour >> (ignoring that) isn't a good solution. Maybe mark with that utf8 >> flag? (could also solve the previous item on my list) > > Hmm, tough question. If we start marking things as utf8, we need to > do it accurately. Can you be sure the unicode escapes don't come from > some other UTF encoding? IIRC the current url_decode implementation treats %uXXXX as the numeric 16 bit codepoint, and it decodes that into a utf8 character. The %uXXXX comes from the escape() function of ecma-262 B.2.1, so we can be confident about the utf8-ness of the result whenever such an encoding is detected (in the query string, obviously). But beyond that, I'm not sure what we can do right now in apreq. The %uXXXX escapes are rare enough that I don't think we should adjust our decode-APIs around them. -- Joe Schaefer