Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 65725 invoked from network); 27 Mar 2006 17:32:13 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 27 Mar 2006 17:32:13 -0000 Received: (qmail 97800 invoked by uid 500); 27 Mar 2006 17:32:07 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 97771 invoked by uid 500); 27 Mar 2006 17:32:07 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 97760 invoked by uid 99); 27 Mar 2006 17:32:06 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 27 Mar 2006 09:32:06 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [69.44.16.11] (HELO getopt.org) (69.44.16.11) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 27 Mar 2006 09:32:06 -0800 Received: from [192.168.0.252] (75-mo3-2.acn.waw.pl [62.121.105.75]) (authenticated) by getopt.org (8.11.6/8.11.6) with ESMTP id k2RHVt719313 for ; Mon, 27 Mar 2006 11:31:55 -0600 Message-ID: <44282169.8070800@getopt.org> Date: Mon, 27 Mar 2006 19:31:21 +0200 From: Andrzej Bialecki User-Agent: Thunderbird 1.5 (Windows/20051201) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: Lucene indexing on Hadoop distributed file system References: <158e71710603252353uace38b4k810589d80745d6ad@mail.gmail.com> <44280FEF.30100@apache.org> In-Reply-To: <44280FEF.30100@apache.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Doug Cutting wrote: > Igor Bolotin wrote: >> If somebody is interested - I can post our changes in TermInfosWriter >> and >> SegmentTermEnum code, although they are pretty trivial. > > Please submit this as a patch attached to a bug report. > > I contemplated making this change to Lucene myself, when writing > Nutch's FsDirectory, but thought that no one else would ever be > interested in using it. Now that's been proven wrong! > > Note that any change to the file format must be back-compatible. This could be solved by putting a marker value in the first 8 bytes (== -1L), which would indicate that the real length is at the end. This way the new implementation will be able to read old indexes. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org