Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 9024 invoked from network); 5 Jan 2009 21:11:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Jan 2009 21:11:10 -0000 Received: (qmail 71353 invoked by uid 500); 5 Jan 2009 21:11:08 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 71311 invoked by uid 500); 5 Jan 2009 21:11:08 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 71302 invoked by uid 99); 5 Jan 2009 21:11:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Jan 2009 13:11:08 -0800 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.217.10] (HELO mail-gx0-f10.google.com) (209.85.217.10) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Jan 2009 21:10:59 +0000 Received: by gxk3 with SMTP id 3so1280263gxk.5 for ; Mon, 05 Jan 2009 13:10:36 -0800 (PST) Received: by 10.90.32.14 with SMTP id f14mr9824142agf.118.1231189836080; Mon, 05 Jan 2009 13:10:36 -0800 (PST) Received: from ?10.17.4.4? (pool-173-48-164-75.bstnma.fios.verizon.net [173.48.164.75]) by mx.google.com with ESMTPS id 38sm52348623agd.18.2009.01.05.13.10.34 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 05 Jan 2009 13:10:34 -0800 (PST) Message-Id: From: Michael McCandless To: java-dev@lucene.apache.org In-Reply-To: <49626480.50608@gmail.com> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v929.2) Subject: Re: TestIndexInput test failures on jdk 1.6/linux after r641303 Date: Mon, 5 Jan 2009 16:10:33 -0500 References: <4961F4F8.9050701@gmail.com> <4962105A.8080709@gmail.com> <49621826.6050309@gmail.com> <49626480.50608@gmail.com> X-Mailer: Apple Mail (2.929.2) X-Virus-Checked: Checked by ClamAV on apache.org In fact I think the 2 test cases of the "modified UTF8 null bytes" are just bogus, because they are using Java's UTF8 charset decoder to construct a String when (as Ken points out) the byte sequence 0xC0 0x80 is illegal UTF8. I'll remove those 2 test cases. Mike Mark Miller wrote: > I think your on the right tack Ken. Don't know enough about Unicode > myself, but I was looking at this this morning, and what you say > somewhat jives with what I saw. > > I don't think you can just flip that switch though - the index > format will not match what its trying to read (having been written > in the new format). Which is why that couldn't have been intended to > test reading the old format, unless it was mistake (it was also > added when the format changed, so its not like it was left around). > > Possibly a mistake that works when your Unicode support is older? > (Im on Ubuntu 8.10 - not sure what that means to my Unicode level) > That unicode version comment looks very interesting - no one really > noticing this problem in America (thats mentioned it), and I think > Sami is in Europe. > > McCandless knows whats wrong I sure (he did that patch), but hes > either busy fixing it, or ordering another margarita in tijuana. > > In either case, I'm sure the issue will be resolved soon. > > - Mark > > Ken Krugler wrote: >>> Ok, it's not a java 1.6 thing it's something else. I also found a >>> box that runs that test ok. >> >> From what I can tell, this is the test that's failing: >> >> http://www.krugle.org/kse/entfiles/lucene/apache.org/java/trunk/src/test/org/apache/lucene/index/TestIndexInput.java#89 >> >> This is verifying that the "Modified UTF-8 null bytes" sequence is >> handled properly, from line 63 in the same file. >> >> I think this is the old, deprecated format for pre-2.4 indexes. >> >> So shouldn't there be a call to setModifiedUTF8StringsMode()? And >> since this is a one-way setting of the preUTF8Strings flag, It >> feels like this should be in a separate test. >> >> Without this call, you'll get the result of calling the String >> class's default constructor with an ill-formed UTF-8 sequence (for >> Unicode 3.1 or later), since 0xC0 0x80 isn't the shortest form for >> the u0000 code point. >> >> -- Ken >> >> >>> Mark Miller wrote: >>>> Hey Sami, I've been running tests quite a bit recently with >>>> Ubuntu 8.10 and OpenJDK 6 on a 64-bit machine, and I have not >>>> seen it once. >>>> Just tried again with Sun JDK 6 and 5 32-bit as well, and I am >>>> still not seeing it. >>>> >>>> Odd. >>>> >>>> - Mark >>>> >>>> Sami Siren wrote: >>>>> I am constantly seeing following error when running "ant test": >>>>> >>>>> [junit] Testcase: >>>>> testRead(org.apache.lucene.index.TestIndexInput): FAILED >>>>> [junit] expected:<[]> but was:<[??]> >>>>> [junit] junit.framework.ComparisonFailure: expected:<[]> but >>>>> was:<[??]> >>>>> [junit] at >>>>> org >>>>> .apache.lucene.index.TestIndexInput.testRead(TestIndexInput.java: >>>>> 89) >>>>> >>>>> on both intel and amd architectures running linux. >>>>> >>>>> java on AMD: >>>>> java version "1.6.0_11" >>>>> Java(TM) SE Runtime Environment (build 1.6.0_11-b03) >>>>> Java HotSpot(TM) 64-Bit Server VM (build 11.0-b16, mixed mode) >>>>> >>>>> java on Intel: >>>>> java version "1.6.0_0" >>>>> IcedTea6 1.4 (fedora-7.b12.fc10-x86_64) Runtime Environment >>>>> (build 1.6.0_0-b12) >>>>> OpenJDK 64-Bit Server VM (build 10.0-b19, mixed mode) >>>>> >>>>> java version "1.6.0_11" >>>>> Java(TM) SE Runtime Environment (build 1.6.0_11-b03) >>>>> Java HotSpot(TM) 64-Bit Server VM (build 11.0-b16, mixed mode) >>>>> >>>>> java version "1.6.0_11" >>>>> Java(TM) SE Runtime Environment (build 1.6.0_11-b03) >>>>> Java HotSpot(TM) Server VM (build 11.0-b16, mixed mode) >>>>> >>>>> Anyone else seeing this? >>>>> >>>>> -- >>>>> Sami Siren >>>>> >>>>> --------------------------------------------------------------------- >>>>> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org >>>>> For additional commands, e-mail: java-dev-help@lucene.apache.org >>>>> >>>> >>>> >>>> --------------------------------------------------------------------- >>>> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org >>>> For additional commands, e-mail: java-dev-help@lucene.apache.org >>>> >>> >>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org >>> For additional commands, e-mail: java-dev-help@lucene.apache.org >> >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-dev-help@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org