Return-Path: X-Original-To: apmail-incubator-lucy-user-archive@www.apache.org Delivered-To: apmail-incubator-lucy-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1575997B1 for ; Fri, 18 Nov 2011 18:14:47 +0000 (UTC) Received: (qmail 92511 invoked by uid 500); 18 Nov 2011 18:14:47 -0000 Delivered-To: apmail-incubator-lucy-user-archive@incubator.apache.org Received: (qmail 92484 invoked by uid 500); 18 Nov 2011 18:14:47 -0000 Mailing-List: contact lucy-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: lucy-user@incubator.apache.org Delivered-To: mailing list lucy-user@incubator.apache.org Received: (qmail 92476 invoked by uid 99); 18 Nov 2011 18:14:46 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Nov 2011 18:14:46 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of gorankent@gmail.com designates 209.85.212.47 as permitted sender) Received: from [209.85.212.47] (HELO mail-vw0-f47.google.com) (209.85.212.47) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Nov 2011 18:14:41 +0000 Received: by vbbey12 with SMTP id ey12so422830vbb.6 for ; Fri, 18 Nov 2011 10:14:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=qx6gUJ9Hzu1TBVRcmhnlsX4r1nkCRYZvKe1ifl20uqU=; b=pUxEnYIC/RbV7a4ZxF2xztq7DH6rC/aIy8oPjDorglyE0QqJgbaSaXpq6YutqlQYNJ WTwG/mK2xG65HzZbyP12jeuIDcRrNRZwb7dWecXjJ+xYSKOUhltGqW5K29Rci1Md0cJf 6ossZA7W+kH4FZaLtnYz/rAA2ZiZq+t1sy2Bc= MIME-Version: 1.0 Received: by 10.52.89.3 with SMTP id bk3mr4542434vdb.92.1321640060831; Fri, 18 Nov 2011 10:14:20 -0800 (PST) Received: by 10.52.188.10 with HTTP; Fri, 18 Nov 2011 10:14:20 -0800 (PST) In-Reply-To: <20111118151820.GA19347@rectangular.com> References: <20111117061703.GA27162@rectangular.com> <20111117070813.GA27239@rectangular.com> <20111118151820.GA19347@rectangular.com> Date: Fri, 18 Nov 2011 20:14:20 +0200 Message-ID: From: goran kent To: lucy-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Subject: Re: [lucy-user] Couldn't completely remove 'seg_N' On Fri, Nov 18, 2011 at 5:18 PM, Marvin Humphrey w= rote: >> The lockfile contains: >> { >> =A0 "host": "host6", >> =A0 "name": "write", >> =A0 "pid": "24342" >> } > > OK, all that looks correct. =A0Also, since the lockfile is still there an= d > definitely corresponds to the process that crashed, we can assume that no > other process has messed with the index directory since. > > Question: is there a seg_2 folder in the index dir? =A0If so, is there an= ything > inside it? Yes, seg_2 was there - it also had the same timestamp as the lockfile, implying it had been created by the same process that created the lockfile itself. Secondly, no it was empty. > It could be NFS cache consistency: a deletion operation succeeds, and the= item > is really gone from the NFS drive, but the local cache of the NFS client > doesn't get updated in time and a subsequent check on whether the item ex= ists > returns an incorrect result. > > =A0 =A0http://nfs.sourceforge.net/#faq_a8 > > =A0 =A0Perfect cache coherency among disparate NFS clients is very expens= ive to > =A0 =A0achieve, so NFS settles for something weaker that satisfies the > =A0 =A0requirements of most everyday types of file sharing. > > A tremendous amount of energy has gone into making NFS mimic local file s= ystem > behaviors as closely as possible, both by the NFS devs and by us (see > ) = but > it's a very hard problem and compromises are impossible to avoid. Some food for thought, thanks. I'll start looking into my index store servers and their NFS exports. > Best practice would be to avoid writing to Lucy indexes on NFS drives if > possible. =A0Read performance is going to be lousy anyway unless you make= the > NFS mount read-only. Just too much data (and we needed redundancy), and the load needs to be spread across as many storage nodes as possible (we have separate source-store and index-store servers). When all the indexing machines are grinding away in unison, sucking from the source-store servers (via NFS), and writing to the index-store servers (also via NFS), the load can get quite high and some NFS tweaking has been done on the source/target servers. Merging nodes reads indexes from the index-store servers (NFS) and writes them on the search-server (NFS) nodes themselves. Searching is then always local. The load at the moment is negligible, so I know it's not causing a problem with NFS - however, with NFS you never know, so I'll be focusing on that next. Thanks for your comments!