httpd-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From André Malo ...@perlig.de>
Subject Re: How do I handle Unicode inside XML request bits?
Date Tue, 13 Sep 2005 15:59:26 GMT
* William A. Rowe, Jr. wrote:

> Brandon Fosdick wrote:
> > Joe Orton wrote:
> >>The 𐀀 character will be passed through in its four byte UTF-8
> >>form (which is 0xf4 0x80 0x80 0x80 I think)
>
> FYI - 65536 isn't a valid ucs-2 character; it is, however, a valid ucs-4
> character.
>
> That might be part of the origin of your issues, try 65535 as a MAX_VAL
> for ucs-2 (which would be a three-byte utf-8 value.)
>
> 65536 cannot be mapped to utf-8, but it can be mapped as a four byte
> utf-16 sequence.

Sure, it can. The utf-8 sequence is "\xf0\x90\x80\x80".

nd
-- 
Flhacs wird im Usenet grundsätzlich alsfhc geschrieben. Schreibt man
lafhsc nicht slfach, so ist das schlichtweg hclafs. Hingegen darf man
rihctig ruhig rhitcgi schreiben, weil eine shcalfe Schreibweise bei
irhictg nicht als shflac angesehen wird.       -- Hajo Pflüger in dnq

Mime
View raw message