Return-Path: Delivered-To: apmail-harmony-dev-archive@www.apache.org Received: (qmail 64849 invoked from network); 15 Jul 2009 02:00:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 15 Jul 2009 02:00:32 -0000 Received: (qmail 39300 invoked by uid 500); 15 Jul 2009 02:00:41 -0000 Delivered-To: apmail-harmony-dev-archive@harmony.apache.org Received: (qmail 39228 invoked by uid 500); 15 Jul 2009 02:00:41 -0000 Mailing-List: contact dev-help@harmony.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@harmony.apache.org Delivered-To: mailing list dev@harmony.apache.org Received: (qmail 39211 invoked by uid 99); 15 Jul 2009 02:00:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Jul 2009 02:00:38 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of xu.regis@gmail.com designates 209.85.221.173 as permitted sender) Received: from [209.85.221.173] (HELO mail-qy0-f173.google.com) (209.85.221.173) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Jul 2009 02:00:26 +0000 Received: by qyk3 with SMTP id 3so2835892qyk.20 for ; Tue, 14 Jul 2009 19:00:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:subject:references:in-reply-to :content-type:content-transfer-encoding; bh=1fiZraSR3OErA0SldY+j+rSYoyr7UX+FO+PJc7t8kVI=; b=tS5G+mdsvg3gscZE1nwAvuexmNgRxIzpH7pjwOVx5uFwfoAbfAJbkFXlcG8CwwMNnN Lw/XzVqGhReCCHllfW+64AbLXhTYBVoJud1sXH7gIxOiObgtHUsS2aaS31gUtzKPWY0m 9JyOQf4n5W4RA1B2v+Hlibp6DuxKUdk2+QT4o= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; b=JqpZ/GAmRpXoqCCGAKyxIt4ifDioqdDkaFtXOF9xiCj+bHAd/npKAnmHFpY7I3jWGL bzPgALjPVRa/G+YoxGpu50lle50r/Jr5nImEtZlsfhPxlv+E8JyeLCDemhx92G4GvmIo XXFhbU504NZy/hcZR8zkAOaCYiedZh0e3DY8E= Received: by 10.224.37.66 with SMTP id w2mr4317734qad.333.1247623205146; Tue, 14 Jul 2009 19:00:05 -0700 (PDT) Received: from ?9.123.229.170? ([220.248.0.145]) by mx.google.com with ESMTPS id 6sm9264233qwk.24.2009.07.14.18.59.57 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 14 Jul 2009 19:00:03 -0700 (PDT) Message-ID: <4A5D381D.3090709@gmail.com> Date: Wed, 15 Jul 2009 09:59:57 +0800 From: Regis User-Agent: Thunderbird 2.0.0.22 (Windows/20090605) MIME-Version: 1.0 To: dev@harmony.apache.org Subject: Re: Shall we change our file.encoding References: <5948b71e0907140250h2ba70787mec98fb2295baa5eb@mail.gmail.com> <5c8e69f0907140312l5e53bbccxc3dea83f6480f20d@mail.gmail.com> <5948b71e0907140351j4683e031q6861a33a5073eac8@mail.gmail.com> <222B26C1-4029-4832-AC33-DC4C78DBD368@gmail.com> <5948b71e0907140839k58a391fan54f4a477de1bca9c@mail.gmail.com> In-Reply-To: <5948b71e0907140839k58a391fan54f4a477de1bca9c@mail.gmail.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Charles Lee wrote: > Hi Nanthan, > > If the file encoding derive from the OS, it should be the some bugs in it > because on my LINUX machine the locale is en_US.UTF-8. Our default codec is > still ISO8859-1. Do you know where can we found such codes? Classlib expected vm do this and set the property, but it didn't, so we have to do this by ourselves. > > On Tue, Jul 14, 2009 at 10:17 PM, Nathan Beyer wrote: > >> Are we talking about windows or linux?the default file encoding should >> derive from the OS. I believe that's defined by the specs. >> >> Sent from my iPhone >> >> >> On Jul 14, 2009, at 5:51 AM, Charles Lee wrote: >> >> On Tue, Jul 14, 2009 at 6:12 PM, Jimmy,Jing Lv >>> wrote: >>> >>> Hi, >>>> Charles, I believe UTF-8 is the default encoding for RI, and it sounds >>>> reasonable. >>>> BTW, it may encounter some compatibility problem, maybe we need to run >>>> more tests to verify? >>>> >>>> 2009/7/14 Charles Lee >>>> >>>> Hi guys: >>>>> I am doing some test cases on the ant junit test case and meeting some >>>>> encoding problems. I find they are maybe caused by the different default >>>>> encoding from RI and harmony. My local is en_US.UTF-8, RI default is >>>>> >>>> UTF-8 >>>> >>>>> but harmony is 8859-1. And then I have encountered >>>>> HARMONY-3736, >>>>> and the two diffs attached on that issue. It seems we always get 8859-1. >>>>> Because: (correct me if wrong :-) >>>>> >>>>> 1. we remove the set code in the vm. we will always get null if we call >>>>> >>>> vm >>>> >>>>> method >>>>> 2. we set the file.encode in the libglob.c, if we got null from vm, we >>>>> >>>> set >>>> >>>> Sorry, it should be luniglob.c >>> >>>> 8859-1. >>>>> 3. we can not set file.encode on the run time. >>>>> >>>>> ant use UTF-8 to encode filename which contains the non-ascii character. >>>>> So why we use iso8859-1 as our unchangeable default? >>>>> From the wiki http://en.wikipedia.org/wiki/ISO8859-1, it says "In >>>>> computing >>>>> applications, encodings that provide full UCS support (such as >>>>> UTF-8and >>>>> UTF-16 ) are finding increasing >>>>> >>>> favor >>>> >>>>> over encodings based on ISO 8859-1." Should we simply change iso8859-1 >>>>> to >>>>> utf-8? >>>>> >>>>> -- >>>>> Yours sincerely, >>>>> Charles Lee >>>>> >>>>> >>>> >>>> -- >>>> >>>> Best Regards! >>>> >>>> Jimmy, Jing Lv >>>> China Software Development Lab, IBM >>>> >>>> >>> >>> -- >>> Yours sincerely, >>> Charles Lee >>> > > -- Best Regards, Regis.