Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 34584 invoked from network); 11 Apr 2006 23:49:45 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 Apr 2006 23:49:45 -0000 Received: (qmail 38420 invoked by uid 500); 11 Apr 2006 23:49:43 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 38390 invoked by uid 500); 11 Apr 2006 23:49:43 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 38379 invoked by uid 99); 11 Apr 2006 23:49:42 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Apr 2006 16:49:42 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [12.154.210.214] (HELO rectangular.com) (12.154.210.214) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Apr 2006 16:49:41 -0700 Received: from [67.189.26.9] (helo=[10.0.1.2]) by rectangular.com with esmtpa (Exim 4.44) id 1FTT5w-000M4P-3h for java-dev@lucene.apache.org; Tue, 11 Apr 2006 17:20:28 -0700 Mime-Version: 1.0 (Apple Message framework v749.3) In-Reply-To: <4728322A-AD68-4388-955C-F6BED7861B6F@rectangular.com> References: <4728322A-AD68-4388-955C-F6BED7861B6F@rectangular.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: Marvin Humphrey Subject: Re: bytecount as prefix Date: Tue, 11 Apr 2006 16:49:18 -0700 To: java-dev@lucene.apache.org X-Mailer: Apple Mail (2.749.3) X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On Apr 11, 2006, at 12:05 PM, Marvin Humphrey wrote: > TestRangeFilter. A phantom blank Term shows up out of nowhere in the middle of the merge process. When you stick a System.err.println into TermInfosWriter's writeTerm, you ordinarily see it adding Terms in proper sort order: [junit] TINFO: : [junit] TINFO: body:body [junit] TINFO: id:000000000000 [junit] TINFO: rand:-00953139433 [junit] TINFO: : [junit] TINFO: body:body [junit] TINFO: id:000000000001 [junit] TINFO: rand:000015869780 Here's several docs being merged together: [junit] TINFO: : [junit] TINFO: body:body [junit] TINFO: id:000000000009 [junit] TINFO: rand:-00563669564 [junit] TINFO: : [junit] TINFO: body:body [junit] TINFO: id:000000000000 [junit] TINFO: id:000000000001 [junit] TINFO: id:000000000002 [junit] TINFO: id:000000000003 [junit] TINFO: id:000000000004 [junit] TINFO: id:000000000005 [junit] TINFO: id:000000000006 [junit] TINFO: id:000000000007 [junit] TINFO: id:000000000008 [junit] TINFO: id:000000000009 [junit] TINFO: rand:-00072576061 [junit] TINFO: rand:-00260794310 [junit] TINFO: rand:-00563669564 [junit] TINFO: rand:-00953139433 [junit] TINFO: rand:-01094000683 [junit] TINFO: rand:-01481464619 [junit] TINFO: rand:-02099458317 [junit] TINFO: rand:000015869780 [junit] TINFO: rand:001019870061 [junit] TINFO: rand:001565603387 [junit] TINFO: : [junit] TINFO: body:body [junit] TINFO: id:000000000010 [junit] TINFO: rand:001271292228 At some point, late in the merge process, this happens: [junit] TermInfosWriter: rand:-00449774276 [junit] TermInfosWriter: rand:-00467363681 [junit] TermInfosWriter: rand:-00479945420 [junit] TermInfosWriter: rand:-00506239929 [junit] TermInfosWriter: : // Huh???? [junit] TermInfosWriter: rand:-00512006124 [junit] TermInfosWriter: rand:-00526876979 // <- look at this number [junit] TermInfosWriter: rand:-00531589361 [junit] TermInfosWriter: rand:-00563669564 [junit] TermInfosWriter: rand:-00638261924 Here's the first few terms coming off of a Term Enum, later. As you can see, the sort order is messed up. That's because the .tis stream has gotten out of sync somehow. [junit] TERMS: [junit] rand:26876979 // <- the last few digits of that number from earlier [junit] rand:31589361 [junit] rand:63669564 [junit] rand:638261924 [junit] rand:733778983 [junit] rand:770310547 [junit] rand:806409190 [junit] rand:849606785 [junit] rand:869935672 [junit] rand:927974448 [junit] rand:953139433 [junit] rand:954514004 [junit] rand:961290394 [junit] rand:1067018129 [junit] rand:1081398108 [junit] rand:1094000683 [junit] rand:1139978555 [junit] rand:1231799109 I'm stumped for now. Marvin Humphrey Rectangular Research http://www.rectangular.com/ --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org