Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 21554 invoked from network); 17 Feb 2010 10:55:52 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 17 Feb 2010 10:55:52 -0000 Received: (qmail 58032 invoked by uid 500); 17 Feb 2010 10:55:51 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 57968 invoked by uid 500); 17 Feb 2010 10:55:51 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 57960 invoked by uid 99); 17 Feb 2010 10:55:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Feb 2010 10:55:51 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Feb 2010 10:55:49 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id ED15D29A0012 for ; Wed, 17 Feb 2010 02:55:27 -0800 (PST) Message-ID: <125754015.332331266404127969.JavaMail.jira@brutus.apache.org> Date: Wed, 17 Feb 2010 10:55:27 +0000 (UTC) From: "Michael McCandless (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Commented: (LUCENE-1761) low level Field metadata is never removed from index In-Reply-To: <1099488265.1248492531830.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-1761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12834730#action_12834730 ] Michael McCandless commented on LUCENE-1761: -------------------------------------------- I don't think there will be much perf loss. Each dead field will cause a FieldInfo instance to be created (which is very small). Norms won't be loaded unless something explicitly asks for them. EG if you do a search against the dead field, that will create the 1 byte per doc array. If you do a sort against the dead field, FieldCache will be populated (which is silly since the values will all be null/0). But if no searching is done against the fields I believe there's very little cost. But we really should fix merging to purge fields that don't occur anymore... > low level Field metadata is never removed from index > ---------------------------------------------------- > > Key: LUCENE-1761 > URL: https://issues.apache.org/jira/browse/LUCENE-1761 > Project: Lucene - Java > Issue Type: Bug > Components: Index > Affects Versions: 2.2, 2.3, 2.3.1, 2.3.2, 2.4, 2.4.1 > Reporter: Hoss Man > Priority: Minor > Attachments: LUCENE-1761.patch > > > with heterogeneous docs, or an index whose fields evolve over time, field names that are no longer used (ie: all docs that ever referenced them have been deleted) still show up when you use IndexReader.getFieldNames. > It seems logical that segment merging should only preserve metadata about fields that actually existing the new segment, but even after deleting all documents from an index and optimizing the old field names are still present. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org