Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 99560 invoked from network); 24 Mar 2011 17:25:47 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 24 Mar 2011 17:25:47 -0000 Received: (qmail 10554 invoked by uid 500); 24 Mar 2011 17:25:45 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 10496 invoked by uid 500); 24 Mar 2011 17:25:45 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 10485 invoked by uid 99); 24 Mar 2011 17:25:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Mar 2011 17:25:45 +0000 X-ASF-Spam-Status: No, hits=-1999.7 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD,URIBL_RHS_DOB X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Mar 2011 17:25:43 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id CB1B04CBA8 for ; Thu, 24 Mar 2011 17:25:05 +0000 (UTC) Date: Thu, 24 Mar 2011 17:25:05 +0000 (UTC) From: "Michael McCandless (JIRA)" To: dev@lucene.apache.org Message-ID: <1101492702.8819.1300987505828.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1304404801.5883.1300883165746.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (LUCENE-2985) Build SegmentCodecs incrementally for consistent codecIDs during indexing MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-2985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13010778#comment-13010778 ] Michael McCandless commented on LUCENE-2985: -------------------------------------------- Patch looks great! I wonder if we should pass the segmentCodecsBuilder to FieldInfos? This way, FieldInfos.add/update could set the codecID, instead of caller doing it after the fact (in DocFieldProcessorPerThread)? > Build SegmentCodecs incrementally for consistent codecIDs during indexing > ------------------------------------------------------------------------- > > Key: LUCENE-2985 > URL: https://issues.apache.org/jira/browse/LUCENE-2985 > Project: Lucene - Java > Issue Type: Improvement > Components: Codecs, Index > Affects Versions: CSF branch, 4.0 > Reporter: Simon Willnauer > Assignee: Simon Willnauer > Fix For: CSF branch, 4.0 > > Attachments: LUCENE-2985.patch > > > currently we build the SegementCodecs during flush which is fine as long as no codec needs to know which fields it should handle. This will change with DocValues or when we expose StoredFields / TermVectors via Codec (see LUCENE-2621 or LUCENE-2935). The other downside it that we don't have a consistent view of which codec belongs to which field during indexing and all FieldInfo instances are unassigned (set to -1). Instead we should build the SegmentCodecs incrementally as fields come in so no matter when a codec needs to be selected to process a document / field we have the right codec ID assigned. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org