Return-Path: Delivered-To: apmail-jackrabbit-users-archive@minotaur.apache.org Received: (qmail 21081 invoked from network); 28 Sep 2010 12:20:22 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 28 Sep 2010 12:20:22 -0000 Received: (qmail 51498 invoked by uid 500); 28 Sep 2010 12:20:22 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 51201 invoked by uid 500); 28 Sep 2010 12:20:18 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 51192 invoked by uid 99); 28 Sep 2010 12:20:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Sep 2010 12:20:17 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ianboston@gmail.com designates 74.125.82.50 as permitted sender) Received: from [74.125.82.50] (HELO mail-ww0-f50.google.com) (74.125.82.50) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Sep 2010 12:20:08 +0000 Received: by wwc33 with SMTP id 33so1092277wwc.19 for ; Tue, 28 Sep 2010 05:19:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:content-type :mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=nCYZMSYq6nBB569+CAnTqRoUamWOta6OPYup/uQZANA=; b=RVF+nJP1gHwzJqZ9QJ4xlHoPSdz6mvSojZJXNbJysk/7KXnCK0dOeCMlnB8WCH762f H32USE0JI97M3TmFIzF82hgLarhPyDjFhy6vKTh0VJ8XpZkAQVUvg42MWtbE95W8xhux IhI9GN6P6/luKv+c+4VyyrQP375judzgQjRbY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; b=io6h4v3pN5NQsVRSdwRm/kEBm9WgzdgQzkiLonaHxiYaPwo4F3Eie2X/zKN2Hk3Mo8 k1xTsBaU8eVBwXrE3VUlIqobv93zum/87eDLYxl7nVGMln1tRxi7ORtApGlm2pFrm3LU fCUt413AQCAwLWaCeewAXg2GqYwNmde9B4aaA= Received: by 10.216.159.6 with SMTP id r6mr976445wek.55.1285676388160; Tue, 28 Sep 2010 05:19:48 -0700 (PDT) Received: from dhcp-36.caret.private (ginger.caret.cam.ac.uk [131.111.21.21]) by mx.google.com with ESMTPS id k46sm4445898weq.10.2010.09.28.05.19.33 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 28 Sep 2010 05:19:38 -0700 (PDT) Sender: Ian Boston Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1081) Subject: Re: Lucene consistency in clustered environment From: Ian Boston In-Reply-To: <4CA1CD76.9010103@rug.nl> Date: Tue, 28 Sep 2010 13:19:32 +0100 Content-Transfer-Encoding: quoted-printable Message-Id: <33BB0FEA-96AD-4EA0-B5EE-4AE0C212B693@tfd.co.uk> References: <4CA1CD76.9010103@rug.nl> To: users@jackrabbit.apache.org X-Mailer: Apple Mail (2.1081) X-Virus-Checked: Checked by ClamAV on apache.org Did you setup the machines as a Jackrabbit cluster as per [1] right from = the start ? And do you have a complete Journal right from the first operation in the = JCR (or local state snapshots) The local state of the Lucene index is dependent on every node in the = cluster replaying every event from the Journal to ensure that they all = contain the same content. The Journal is also used for sharedItem state = invalidation which will ensure that stale items do not get into = responses. HTH Ian 1 http://wiki.apache.org/jackrabbit/Clustering On 28 Sep 2010, at 12:11, Dennis van der Laan wrote: > Hi all, >=20 > We are using Jackrabbit 1.6.1 in a production environment. It is > clustered across 6 machines, which all store the Lucene indexes on a > local disk. After using this setup for a couple of months, we are = seeing > the Lucene indexes differ per machine. Not only the size of the = indexes, > but some documents seem to be indexed on one machine, but not on = another > machine. So doing a full-text search (xpath with a ' contains' clause) > will have different results depending on the machine the query is run = on. > I am a complete Lucene novice, so my question is: is this > non-deterministic behaviour a characteristic of Lucene (and should we > rebuild all indexes on a regular basis to keep them in sync) or is > something going wrong here? >=20 > Thanks for any help! > Beste regards, > Dennis >=20 > --=20 > Dennis van der Laan, MSc > Centre for Information Technology > University of Groningen >=20