From java-user-return-36606-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Tue Oct 14 13:37:30 2008 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 56080 invoked from network); 14 Oct 2008 13:37:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 14 Oct 2008 13:37:30 -0000 Received: (qmail 4054 invoked by uid 500); 14 Oct 2008 13:37:24 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 4024 invoked by uid 500); 14 Oct 2008 13:37:24 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 4013 invoked by uid 99); 14 Oct 2008 13:37:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Oct 2008 06:37:24 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [140.203.201.101] (HELO mx2.nuigalway.ie) (140.203.201.101) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Oct 2008 13:36:17 +0000 X-IronPort-AV: E=Sophos;i="4.33,409,1220223600"; d="scan'208";a="95892068" Received: from exbe1.ac.nuigalway.ie (HELO EVS1.ac.nuigalway.ie) ([10.132.157.11]) by mx2.nuigalway.ie with ESMTP; 14 Oct 2008 14:35:43 +0100 Received: from EVS1.ac.nuigalway.ie ([10.132.157.14]) by EVS1.ac.nuigalway.ie with Microsoft SMTPSVC(6.0.3790.3959); Tue, 14 Oct 2008 14:35:41 +0100 Received: from [10.2.18.102] ([140.203.154.11]) by EVS1.ac.nuigalway.ie over TLS secured channel with Microsoft SMTPSVC(6.0.3790.3959); Tue, 14 Oct 2008 14:35:41 +0100 Message-ID: <48F4A02D.3000702@deri.org> Date: Tue, 14 Oct 2008 14:35:41 +0100 From: Renaud Delbru User-Agent: Thunderbird 2.0.0.17 (X11/20080925) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: Modification of positional information encoding References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 14 Oct 2008 13:35:41.0217 (UTC) FILETIME=[C0421110:01C92E01] X-Virus-Checked: Checked by ClamAV on apache.org Hi Michael, Michael McCandless wrote: > Also, this issue was just opened: > > > https://issues.apache.org/jira/browse/LUCENE-1419 > > which would make it possible for classes in the same package > (oal.index) to use their own indexing chain. With that fix, if you > make your own classes in oal.index package, and perhaps subclass the > above classes, you could then create your own indexing chain for > indexing? If you take that approach, please report back so we can > learn how to improve Lucene for these very advanced customizations! > As a first impression, what will be handy in order to customize postings list will be to make an abstract class FreqProxTermsWriter, that separates segment creation and term information serialisation. This class will implement the generic logic for flushing and appending postings, but will delegate to subclasses the way you write doc + freq and prox + payload info. A first idea will be to have the following abstract methods: - writeMinState : called by appendPostings, and define how to serialise one FreqProxFieldMergeState - writeDocFreq : called by writeMinState, and define how to serialise docs and freq - writeProx: called by writeMinState and define how to serialise positions and payloads I think other parts of the FreqProxTermsWriter can stay generic. What do you think ? Regards. -- Renaud Delbru --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org