Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B6181485D for ; Tue, 21 Jun 2011 16:28:52 +0000 (UTC) Received: (qmail 13577 invoked by uid 500); 21 Jun 2011 16:28:50 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 13516 invoked by uid 500); 21 Jun 2011 16:28:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 13508 invoked by uid 99); 21 Jun 2011 16:28:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2011 16:28:50 +0000 X-ASF-Spam-Status: No, hits=1.3 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.212.48] (HELO mail-vw0-f48.google.com) (209.85.212.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2011 16:28:45 +0000 Received: by vws7 with SMTP id 7so3098590vws.35 for ; Tue, 21 Jun 2011 09:28:24 -0700 (PDT) MIME-Version: 1.0 Received: by 10.52.98.73 with SMTP id eg9mr1847163vdb.110.1308673704184; Tue, 21 Jun 2011 09:28:24 -0700 (PDT) Received: by 10.52.169.130 with HTTP; Tue, 21 Jun 2011 09:28:24 -0700 (PDT) In-Reply-To: <1308561202025-3085456.post@n3.nabble.com> References: <1308561202025-3085456.post@n3.nabble.com> Date: Tue, 21 Jun 2011 12:28:24 -0400 Message-ID: Subject: Re: About IndexReader.reopen with very similar indexes From: Michael McCandless To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Reopening is based entirely on the latest segments_N file present in the in= dex. Lucene loads that file and checks if it refers to any new segments not already open and if so opens those new ones. And segments in common with what the reader already has open (ie same segment name) are simply reused. Lucene doesn't look at file modification times, etc. Mike McCandless http://blog.mikemccandless.com On Mon, Jun 20, 2011 at 5:13 AM, Marc Sturlese wr= ote: > Hey there, > I have a doubt about the behaviour of IndexReader.reopen. > I have a tomcat server holding a lucene index over an IndexSearcher. If I > move the index.folder to index.folder.old and another index, let's say > index.folder.2 to index.folder and then I reopen readers, something weird > happen if the first index and the second have very similar size and are > built from scratch. It seems that when I get the new reader and compare w= ith > the new one: > > =A0IndexReader reader =3D ... > =A0... > =A0IndexReader newReader =3D r.reopen(); > =A0if (newReader !=3D reader) { > =A0... =A0 =A0 // reader was reopened > =A0 reader.close(); > =A0} > =A0reader =3D newReader; > =A0... > > Lucene does not detect that are different indexes. > Here you can see both indexes (have same number of files and names are > similar, but sizes are a bit different, as contained documents are not > exactly the same). > *This does not happen if indexes have much more differents sizes (and so, > file names will not be equal, ex: _4.fdt, etc) > > Index1: > -rw-r--r-- =A0 1 marc =A0admin =A0269289634 15 Jun 15:52 _3.fdt > -rw-r--r-- =A0 1 marc =A0admin =A0 =A02066764 15 Jun 15:52 _3.fdx > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 =A0 =A0463 15 Jun 15:52 _3.fnm > -rw-r--r-- =A0 1 marc =A0admin =A0 40358787 15 Jun 15:52 _3.frq > -rw-r--r-- =A0 1 marc =A0admin =A0 =A01033384 15 Jun 15:52 _3.nrm > -rw-r--r-- =A0 1 marc =A0admin =A0 27014923 15 Jun 15:52 _3.prx > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 234797 15 Jun 15:52 _3.tii > -rw-r--r-- =A0 1 marc =A0admin =A0 19322234 15 Jun 15:52 _3.tis > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 =A0 =A0 20 15 Jun 15:52 segments.g= en > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 =A0 =A0298 15 Jun 15:52 segments_2 > > Index2: > -rw-r--r-- =A0 1 marc =A0admin =A0269044254 15 Jun 15:52 _3.fdt > -rw-r--r-- =A0 1 marc =A0admin =A0 =A02068116 15 Jun 15:52 _3.fdx > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 =A0 =A0463 15 Jun 15:52 _3.fnm > -rw-r--r-- =A0 1 marc =A0admin =A0 40320465 15 Jun 15:52 _3.frq > -rw-r--r-- =A0 1 marc =A0admin =A0 =A01034060 15 Jun 15:52 _3.nrm > -rw-r--r-- =A0 1 marc =A0admin =A0 26967519 15 Jun 15:52 _3.prx > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 235895 15 Jun 15:52 _3.tii > -rw-r--r-- =A0 1 marc =A0admin =A0 19372446 15 Jun 15:52 _3.tis > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 =A0 =A0 20 15 Jun 15:52 segments.g= en > -rw-r--r-- =A0 1 marc =A0admin =A0 =A0 =A0 =A0298 15 Jun 15:52 segments_2 > > Can someone explain me the lucene criteria to decide if a segment has > changed or not? > Thanks in advance. > > > -- > View this message in context: http://lucene.472066.n3.nabble.com/About-In= dexReader-reopen-with-very-similar-indexes-tp3085456p3085456.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org