Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 3305 invoked from network); 11 Mar 2010 20:01:48 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 11 Mar 2010 20:01:48 -0000 Received: (qmail 63951 invoked by uid 500); 11 Mar 2010 20:01:12 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 63904 invoked by uid 500); 11 Mar 2010 20:01:11 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 63896 invoked by uid 99); 11 Mar 2010 20:01:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Mar 2010 20:01:11 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [74.125.83.48] (HELO mail-gw0-f48.google.com) (74.125.83.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Mar 2010 20:01:03 +0000 Received: by gwaa11 with SMTP id a11so214152gwa.35 for ; Thu, 11 Mar 2010 12:00:42 -0800 (PST) MIME-Version: 1.0 Received: by 10.151.17.33 with SMTP id u33mr493412ybi.298.1268337642439; Thu, 11 Mar 2010 12:00:42 -0800 (PST) In-Reply-To: <2bfbd3ee1003111035j6e32181fs26d9cf14af80df99@mail.gmail.com> References: <2bfbd3ee1003111035j6e32181fs26d9cf14af80df99@mail.gmail.com> Date: Thu, 11 Mar 2010 15:00:42 -0500 Message-ID: <9ac0c6aa1003111200o6eab509cob2d8cca56f3b9f7d@mail.gmail.com> Subject: Re: Can 2.3 read indexes created by 2.4? From: Michael McCandless To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Urgh, I failed to update the opening in fileformats.html (describing what's changed on each version). We also had a change in 3.0, from removing compressed fields. I'll fix... But: 2.3 can't read indexes created with 2.4 (and in general older Lucene releases very likely will not be able to read indexes created by newer versions, when there's been an index format change). Mike On Thu, Mar 11, 2010 at 1:35 PM, Nathanael D. Jones wrote: > Lucene 2.4 introduced a change not documented on the File Formats page > > *LUCENE-510: The index now stores strings as true UTF-8 bytes (previously it > was Java's modified UTF-8). If any text, either stored fields or a token, > has illegal UTF-16 surrogate characters, these characters are now silently > replaced with the Unicode replacement character U+FFFD. This is a change to > the index file format.* > *(Marvin Humphrey via Mike McCandless)* > > > Is there a reason this change isn't documened on the File Formats page of > any 2.4+ doc release? > > From the 3.0.1 docs: > > *http://lucene.apache.org/java/3_0_1/fileformats.html* > *"**Compatibility notes are provided in this document, describing how file > formats have changed from prior versions.* > > *In version 2.1, the file format was changed to allow lock-less commits (ie, > no more commit lock). The change is fully backwards compatible: you can open > a pre-2.1 index for searching or adding/deleting of docs. When the new > segments file is saved (committed), it will be written in the new file > format (meaning no specific "upgrade" process is needed). But note that once > a commit has occurred, pre-2.1 Lucene will not be able to read the index.* > > *In version 2.3, the file format was changed to allow segments to share a > single set of doc store (vectors & stored fields) files. This allows for > faster indexing in certain cases. The change is fully backwards compatible > (in the same way as the lock-less commits change in 2.1)."* > > > But no mention of the unicode change in version 2.4... was it somehow > forwards compatible? > > Can I read indexes created in 2.4 with a 2.3 compliant reader? > (Specifically, I'm interested in creating indexes with Lucene 3 and reading > them with CLucene on an iPhone.) > > Thanks, > > Nathanael > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org