harmony-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tony Wu" <wuyue...@gmail.com>
Subject Re: [classlib][luni][charset]Strange behavior of UnicodeBig
Date Tue, 17 Oct 2006 09:03:23 GMT
Thank you Andrew,
I think I got the point. The j.l.String of RI uses the encoding of IO
whereas Charset.forName use another of NIO.

And the new problem is shall we follow the spec[1] to support the two
suites of charset implemetation? I just have a look and find we does
not support some Canonical Name for java.io and java.lang API such as
UnicodeBigUnmarked,UnicodeLittleUnmarked,UnicodeBig,Unicodelittle,etc.

[1] http://java.sun.com/j2se/1.5.0/docs/guide/intl/encoding.doc.html

On 10/17/06, Andrew Zhang <zhanghuangzhu@gmail.com> wrote:
> On 10/17/06, Andrew Zhang <zhanghuangzhu@gmail.com> wrote:
> >
> >
> >
> > On 10/17/06, Leo Li <liyilei1979@gmail.com> wrote:
> > >
> > > I think Harmony is more reasonable.
> > >
> > > As spec says, if  Charset.forName("UnicodeBig") throws
> > > .UnsupportedCharsetException then no support for the named charset is
> > > available in this instance of the Java virtual machine. Then how can we
> > > get
> > > new String(b, "UnicodeBig") without throwing UnsupportedCharsetException
> > > on
> > > the same jvm? The spec for String(byte[] bytes,String charsetName) also
> > > says
> > > if the named charset is not supported, UnsupportedCharsetException
> > > should be
> > > thrown out.
> >
> >
> > UNICODEBIG is a java alias for UTF-16BE. I think we'd better support such
> > mapping in String and follow RI.
> >
>
> You can find the encoding set from spec. [1]
>
> [1] http://java.sun.com/j2se/1.5.0/docs/guide/intl/encoding.doc.html
>
>  On 10/17/06, Tony Wu <wuyuehao@gmail.com> wrote:
> > > >
> > > > Hi all,
> > > > I found this when I tried to debug the failure tests of ant on
> > > > harmony. Note the output of testcases below.
> > > >
> > > > import java.io.UnsupportedEncodingException;
> > > > import java.nio.charset.Charset ;
> > > > import junit.framework.TestCase;
> > > >
> > > > public class TestCharset extends TestCase {
> > > >    public void test1() throws UnsupportedEncodingException {
> > > >        byte[] b = new byte[] { 'a', 'b', 'c' };
> > > >        String s = new String(b, "UnicodeBig");
> > > >        assertEquals("abc", s);
> > > >    }
> > > >
> > > >    public void test2() {
> > > >        Charset.forName("UnicodeBig");
> > > >    }
> > > > }
> > > >
> > > > RI:
> > > > test1: junit.framework.ComparisonFailure: expected:<abc> but was:<>
> > > > test2: java.nio.charset.UnsupportedCharsetException: UnicodeBig
> > > >
> > > > Harmony:
> > > > test1:java.nio.charset.UnsupportedCharsetException: UnicodeBig
> > > > test2:
> > > > java.nio.charset.UnsupportedCharsetException: The unsupported charset
> > > > name is "UnicodeBig"
> > > >
> > > > seems RI can recognize the *UnicodeBig* in Constructor of j.l.String,
> > > > whereas Harmony does not support this alias at all.
> > > >
> > > > Do you have any concern about that?
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > > > ---------------------------------------------------------------------
> > > > Terms of use : http://incubator.apache.org/harmony/mailing.html
> > > > To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
> > > > For additional commands, e-mail: harmony-dev-help@incubator.apache.org
> > > >
> > > >
> > >
> > >
> > > --
> > > Leo Li
> > > China Software Development Lab, IBM
> > >
> > >
> >
> >
> > --
> > Best regards,
> > Andrew Zhang
>
>
>
>
> --
> Best regards,
> Andrew Zhang
>
>


-- 
Tony Wu
China Software Development Lab, IBM

---------------------------------------------------------------------
Terms of use : http://incubator.apache.org/harmony/mailing.html
To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
For additional commands, e-mail: harmony-dev-help@incubator.apache.org


Mime
View raw message