Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 5337 invoked from network); 30 Jun 2006 12:36:57 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 30 Jun 2006 12:36:57 -0000 Received: (qmail 54704 invoked by uid 500); 30 Jun 2006 12:36:50 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 54677 invoked by uid 500); 30 Jun 2006 12:36:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 54666 invoked by uid 99); 30 Jun 2006 12:36:50 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Jun 2006 05:36:50 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [212.4.36.66] (HELO mail.agderweb.no) (212.4.36.66) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Jun 2006 05:36:49 -0700 Received: from lettmelk [212.4.54.130] by mail.agderweb.no with ESMTP (SMTPD-8.22) id A6CE04D8; Fri, 30 Jun 2006 14:19:26 +0200 Date: Fri, 30 Jun 2006 14:36:17 +0200 To: java-user@lucene.apache.org Subject: Re: Are there any problems with the hits.length() in luc 1.4? Reply-To: aleksander.stensby@integrasco.no From: "Aleksander M. Stensby" Organization: Integrasco A/S Content-Type: text/plain; format=flowed; delsp=yes; charset=iso-8859-15 MIME-Version: 1.0 References: <8834A84C87A2C148AD46921BB8BFC97C01F1CF8C@S1SE1MAIL.emea1.ad.group> Content-Transfer-Encoding: 7bit Message-ID: In-Reply-To: <8834A84C87A2C148AD46921BB8BFC97C01F1CF8C@S1SE1MAIL.emea1.ad.group> User-Agent: Opera Mail/9.00 (Win32) X-Antivirus: avast! (VPS 0626-3, 29.06.2006), Outbound message X-Antivirus-Status: Clean X-Declude-Sender: aleksander.stensby@integrasco.no [212.4.54.130] X-Declude-Spoolname: D16ce01a90000a56c.smd X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Maybe you didn't close / open your writer/reader and that you where searching with a snapshot of the old index. And that after you had closed the writer and opened a new reader you got the correct result..? On Fri, 30 Jun 2006 14:22:40 +0200, Marcus Falck wrote: > Hi, > > > I'm indexing around 200 million articles in lucene. I have for the > moment added around 600000 articles. > > > Using this technique: > > 5000 docs in RAMDir > > Flush RAMDir to FSDir to create segmentfile with 5000. > > Mergefactor 10. > > > I'm searching using the multisearcher. > > > When I had around 490 000 docs all new documents appeared to be in the > index but the count for the word "kan" was just around 8000 hits. > > When the large merge occurred at 500.000 docs the "kan" count increased > to 65000 hits. > > > Any ideas`? > > > / > > Regards > > Marcus > > -- Aleksander M. Stensby Software Developer Integrasco A/S aleksander.stensby@integrasco.no Tlf.: +47 41 22 82 72 --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org