xml-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alex Amies <AAm...@access360.com>
Subject RE: Why are characters 2 bytes in Xalan?
Date Fri, 27 Apr 2001 23:38:15 GMT
Actually Unicode characters are not necessarily 2 byte.  The Unicode
standard does not specify how many bytes are used to represent 
characters.  However, the XML and related standards say that parsers
should support UTF-8.  UTF-8 will use 1 byte to store ascii characters,
2 bytes to represent more exotic European characters, and 3 bytes to
represent Chinese / Japanese / Korean characters, and some other 
even more exotic characters, such as what archeaologists study.


-----Original Message-----
From: Martin Stricker [mailto:shugal@gmx.de]
Sent: Friday, April 27, 2001 3:01 PM
To: general@xml.apache.org
Subject: Re: Why are characters 2 bytes in Xalan?


Lee Chin wrote:
> 
> Why are characters 2 bytes as opposed to 1 byte in
> xalan?

Xalan uses Unicode http://www.unicode.org/ . Unicode can use 65536
different characters because it's 2 Byte. ANSI can only use 256
different characters using 1 Byte. Unicode is the try to get all these
different character sets like latin, greek, cyrillic, various asian
character sets etc. into *one* character set, thus beeing able to
display different character sets in one document without the need to
switch the font or the character set property.

Best regards,
Martin Stricker
-- 
Homepage: http://www.martin-stricker.de/
Registered Linux user #210635: http://counter.li.org/

---------------------------------------------------------------------
In case of troubles, e-mail:     webmaster@xml.apache.org
To unsubscribe, e-mail:          general-unsubscribe@xml.apache.org
For additional commands, e-mail: general-help@xml.apache.org

---------------------------------------------------------------------
In case of troubles, e-mail:     webmaster@xml.apache.org
To unsubscribe, e-mail:          general-unsubscribe@xml.apache.org
For additional commands, e-mail: general-help@xml.apache.org


Mime
View raw message