httpd-apreq-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Boris Zentner <...@2bz.de>
Subject Re: Apache::Request, APR::Table and UTF8
Date Tue, 05 Oct 2004 23:52:11 GMT

Hi,

Am 05.10.2004 um 19:18 schrieb Joe Schaefer:

> David Wheeler <david@kineticode.com> writes:
>
>> On Oct 5, 2004, at 9:20 AM, Joe Schaefer wrote:
>>
>>> We could use three-bit field for marking the charset:
>>>
>>>   0 - unknown
>>>   1 - ASCII
>>>   2 - UTF-8
>>>   3 - UTF-16
>>>   [ room for 4 more iso? charsets ]
>>

Perhaps it is better to split the byte into nibbles 0:3 for the charset 
and 4:7 for flags. This gives more room for charsets since the charset 
is exclusive. ie: if it is ASCII it can not be UTF8.

0000 unknown
0001 ascii
0010 utf8
0011 utf16
0100 ...
...
1111 ...

--
Boris


Mime
View raw message