Return-Path: Delivered-To: apmail-jakarta-poi-dev-archive@www.apache.org Received: (qmail 64353 invoked from network); 3 Sep 2003 23:22:28 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 3 Sep 2003 23:22:28 -0000 Received: (qmail 81767 invoked by uid 500); 3 Sep 2003 23:21:29 -0000 Delivered-To: apmail-jakarta-poi-dev-archive@jakarta.apache.org Received: (qmail 81747 invoked by uid 500); 3 Sep 2003 23:21:29 -0000 Mailing-List: contact poi-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "POI Developers List" Reply-To: "POI Developers List" Delivered-To: mailing list poi-dev@jakarta.apache.org Received: (qmail 81721 invoked from network); 3 Sep 2003 23:21:29 -0000 Received: from unknown (HELO mxout3.netvision.net.il) (194.90.9.24) by daedalus.apache.org with SMTP; 3 Sep 2003 23:21:29 -0000 Received: from endor ([217.132.32.143]) by mxout3.netvision.net.il (iPlanet Messaging Server 5.2 HotFix 1.14 (built Mar 18 2003)) with SMTP id <0HKN00EW7VJV4S@mxout3.netvision.net.il> for poi-dev@jakarta.apache.org; Thu, 04 Sep 2003 02:21:32 +0300 (IDT) Date: Thu, 04 Sep 2003 02:25:06 +0200 From: "A. Rothman" Subject: Re: unicode support To: POI Developers List Message-id: <000a01c3727a$fe418830$0100a8c0@endor> MIME-version: 1.0 X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2800.1165 X-Mailer: Microsoft Outlook Express 6.00.2800.1158 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: 7BIT X-Priority: 3 X-MSMail-priority: Normal References: X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N 1. how do I post the changes (and new TestUnicodeString class)? 2. the SSTRecord addString( final String string, final boolean useUTF16 ) method contract will change a bit - the flag will force UTF16, but will only allow 8-bit representation (compressed) if the string can be encoded as 8-bit chars, otherwise it'll be 16-bit as well. I hope this doesn't have any implications (except for progress :-) ). ----- Original Message ----- From: "Andrew C. Oliver" To: "POI Developers List" Sent: Wednesday, September 03, 2003 10:36 PM Subject: Re: unicode support > If you ruin it...the unit tests should fail. :-) > > On 9/3/03 5:27 PM, "A. Rothman" wrote: > > > I had performance in mind as well, but then I saw there > > > > String unicodeString = new > > String(getString().getBytes("Unicode"),"Unicode"); > > > > which is exactly the same performancewise, only this doesn't do anything > > (since unicode conversion doesn't lose nor gain any data in the conversion - > > it just decomposes and recomposes the string). also "Unicode" appears > > neither in the JVM required encodings list or sun supported encodings...it > > may be very JVM dependent. > > > > I'd still like to hear if anyone knows what that code section does before I > > ruin anything :-) > > > > ----- Original Message ----- > > From: "Andrew C. Oliver" > > To: "POI Developers List" > > Sent: Wednesday, September 03, 2003 10:18 PM > > Subject: Re: unicode support > > > > > >> We've done detection before...its always a huge performance and memory > > hog. > >> In any case, I don't think this method has been tried. So go for it and > > see > >> what happens. > >> > >> -Andy > >> > >> On 9/3/03 5:18 PM, "A. Rothman" wrote: > >> > >>> Hey guys, > >>> > >>> After experiencing some unicode trouble with HSSF today, namely setting > > the > >>> cell encoding before setting it's text (or lack thereof...), I figured > > since > >>> Java is unicode based it's not too friendly to require users to set > > unicode > >>> flags explicitly in order for unicode to work. I traced the problem down > > to > >>> UnicodeString.serialize(), and found some rather strange code that seems > > to do > >>> nothing (decomposing and creating a string, try and catch blocks that > > are > >>> identical...anyone have any ideas?), and thought we should have the > > serializer > >>> (or perhaps the constructor?) detect the case where the String contains > >>> non-ascii/iso-latin chars and set the encoding automatically. This can > > be as > >>> simple as a > >>> > >>> if (str.equals(new String(str.getBytes("iso8859_1"),"iso8859_1"))) // > > string > >>> can be compressed > >>> ... > >>> > >>> what do u say? any implications I didn't think of? > >>> > >>> > >>> -Amichai > >>> > >> > >> -- > >> Andrew C. Oliver > >> http://www.superlinksoftware.com/poi.jsp > >> Custom enhancements and Commercial Implementation for Jakarta POI > >> > >> http://jakarta.apache.org/poi > >> For Java and Excel, Got POI? > >> > >> The views expressed in this email are those of the author and are almost > >> definitely not shared by the Apache Software Foundation, its board or its > >> general membership. In fact they probably most definitively disagree with > >> everything espoused in the above email. > >> > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org > >> For additional commands, e-mail: poi-dev-help@jakarta.apache.org > >> > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org > > For additional commands, e-mail: poi-dev-help@jakarta.apache.org > > > > -- > Andrew C. Oliver > http://www.superlinksoftware.com/poi.jsp > Custom enhancements and Commercial Implementation for Jakarta POI > > http://jakarta.apache.org/poi > For Java and Excel, Got POI? > > The views expressed in this email are those of the author and are almost > definitely not shared by the Apache Software Foundation, its board or its > general membership. In fact they probably most definitively disagree with > everything espoused in the above email. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org > For additional commands, e-mail: poi-dev-help@jakarta.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: poi-dev-help@jakarta.apache.org