Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 19467 invoked from network); 18 Jun 2010 22:35:50 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 18 Jun 2010 22:35:50 -0000 Received: (qmail 66156 invoked by uid 500); 18 Jun 2010 22:35:49 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 65951 invoked by uid 500); 18 Jun 2010 22:35:48 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 65944 invoked by uid 99); 18 Jun 2010 22:35:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jun 2010 22:35:48 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jun 2010 22:35:46 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o5IMZOhs002521 for ; Fri, 18 Jun 2010 22:35:24 GMT Message-ID: <22578223.87081276900524537.JavaMail.jira@thor> Date: Fri, 18 Jun 2010 18:35:24 -0400 (EDT) From: "Wojtek Piaseczny (JIRA)" To: dev@lucene.apache.org Subject: [jira] Updated: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wojtek Piaseczny updated SOLR-1782: ----------------------------------- Attachment: SOLR-1782.patch Added getValues method to UninvertedField class. Might be very slow with many big terms (I happen to have 0 big terms on the fields I'll be using this on). Both code paths that use StringIndex now use UnivertedField is multi-valued fields are detected. > stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields > --------------------------------------------------------------------------------- > > Key: SOLR-1782 > URL: https://issues.apache.org/jira/browse/SOLR-1782 > Project: Solr > Issue Type: Bug > Components: search > Affects Versions: 1.4 > Environment: reproduced on Win2k3 using 1.5.0-dev solr ($Id: CHANGES.txt 906924 2010-02-05 12:43:11Z noble $) > Reporter: Gerald DeConto > Attachments: index.rar, SOLR-1782.patch, SOLR-1782.test.patch > > > the StatsComponent assumes any field specified in the stats.facet param can be faceted using FieldCache.DEFAULT.getStringIndex. This can cause problems with a variety of field types, but in the case of multivalued fields it can either cause erroneous false stats when the number of distinct values is small, or it can cause ArrayIndexOutOfBoundsException when the number of distinct values is greater then the number of documents. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org