Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 22594 invoked from network); 14 Dec 2010 15:46:24 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 14 Dec 2010 15:46:24 -0000 Received: (qmail 41118 invoked by uid 500); 14 Dec 2010 15:46:22 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 40758 invoked by uid 500); 14 Dec 2010 15:46:19 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 40750 invoked by uid 99); 14 Dec 2010 15:46:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Dec 2010 15:46:18 +0000 X-ASF-Spam-Status: No, hits=1.5 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.216.48 as permitted sender) Received: from [209.85.216.48] (HELO mail-qw0-f48.google.com) (209.85.216.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Dec 2010 15:46:13 +0000 Received: by qwh6 with SMTP id 6so873171qwh.35 for ; Tue, 14 Dec 2010 07:45:52 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=2Qyi150n9LhW61R0OF5NLik6JWsYAz2Hbs8HwyIaLNA=; b=GAKbj2xyQS5iiIdW+IKw1wai7PKstjQmZdS3nhHS58++POtYc3rJ7QaXWacrmwJCwG JjJCqLcfVW2VrYIlB9n8jwk+aUBCXSIWNe7gbNpSgqjJah6CSdLpM8jtTEXYBWUslPqG vhMf4JrevwJ4Vxq7ncXUqsZooNhtCmTGK7IFU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=LQ7AfBA2Q5tpxBcTXpnNmTrky7Jyj28zdgHCHmSKj03Sx4je/gtsLtqH9NmkuiYppU /eIpdPdYWgadJ9BZG49aAdjC4SiJql+Hie1r3z6AsFbWOtapN5EMpEx04vEt4HRSOz92 yWuh2vHaabYqKtthg0q5Xmty9bJk7S2i/8HC8= MIME-Version: 1.0 Received: by 10.229.241.13 with SMTP id lc13mr85505qcb.190.1292341552213; Tue, 14 Dec 2010 07:45:52 -0800 (PST) Received: by 10.229.235.208 with HTTP; Tue, 14 Dec 2010 07:45:52 -0800 (PST) In-Reply-To: References: Date: Tue, 14 Dec 2010 10:45:52 -0500 Message-ID: Subject: Re: Forcing specific index file names From: Erick Erickson To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016363b8efc25efe8049760b75f --0016363b8efc25efe8049760b75f Content-Type: text/plain; charset=ISO-8859-1 I'm missing something here. You mention "two versions of a data set in a directory tree structure". The Lucene indexes will have different names if they have been merged. Usually this is a result of changing the data, issuing an optimize, etc. That is, the data *is* different so it seems perfectly appropriate to consider them new... Lucene never changes an existing segments file once it is committed. It only merges segments then deletes the old ones. So if the file names are different, then it seems that renaming them wouldn't be what you really want. So either it really is an XY problem (as in "I really don't think you want to do that") or I completely misunderstand what you're trying to do. Best Erick On Tue, Dec 14, 2010 at 9:46 AM, Earl Hood wrote: > On Tue, Dec 14, 2010 at 12:53 AM, Chris Hostetter > wrote: > > > > : It is possible to always have Lucene end up with the > > : same set of index filenames for each index generation > > : process? > > > > this smells like an XY problem .... why do you car what the file names > > are? that's an implementtaion detail of lucene -- the directory as a > whole > > is the index -- what are you trying to do that you are concerned about > > wanting to "rename" the files? > > I have to create patch sets against two version of a data > set in a directory tree structure, and the data set contains > a lucene index. > > However, if the filenames are not consistent for the index, > then the delta program thinks they are completely new > files vs just doing an xdelta on the index data. > > If renaming is not possible, the delta program will > have to have lucene awareness about variations in > the filenames between two versions of a data set > tree. I guess I will have to do this if I am going > to be lectured about how to develop software. > > From a design perspective, I figured if the process > that builds the data sets and the lucene index can > be modified to make sure the lucene index files are > consistently named, the delta computation program > can stay agnostic about lucene and just do a basic > tree differencing algorithm. > > --ewh > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --0016363b8efc25efe8049760b75f--