Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 60629 invoked from network); 29 Nov 2010 11:48:05 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Nov 2010 11:48:05 -0000 Received: (qmail 83649 invoked by uid 500); 29 Nov 2010 11:48:03 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 83194 invoked by uid 500); 29 Nov 2010 11:48:03 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 83185 invoked by uid 99); 29 Nov 2010 11:48:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Nov 2010 11:48:02 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Nov 2010 11:48:00 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oATBlc4s018105 for ; Mon, 29 Nov 2010 11:47:39 GMT Message-ID: <18168528.14851291031258736.JavaMail.jira@thor> Date: Mon, 29 Nov 2010 06:47:38 -0500 (EST) From: "Anarkii (JIRA)" To: dev@lucene.apache.org Subject: [jira] Commented: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12964706#action_12964706 ] Anarkii commented on SOLR-1782: ------------------------------- Is there any update on this issue? Or if Wojtek's fix would be merged into the trunk? I'm trying to do a stats.facet on a field which contains multiple tokens, and have the same issue resulting out of getStringIndex. Say, I'm doing a stats.facet on a field "tags", which can contain multiple tokens...only one of the tags is being considered. > stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields > --------------------------------------------------------------------------------- > > Key: SOLR-1782 > URL: https://issues.apache.org/jira/browse/SOLR-1782 > Project: Solr > Issue Type: Bug > Components: search > Affects Versions: 1.4 > Environment: reproduced on Win2k3 using 1.5.0-dev solr ($Id: CHANGES.txt 906924 2010-02-05 12:43:11Z noble $) > Reporter: Gerald DeConto > Attachments: index.rar, SOLR-1782.2.patch, SOLR-1782.patch, SOLR-1782.test.patch > > > the StatsComponent assumes any field specified in the stats.facet param can be faceted using FieldCache.DEFAULT.getStringIndex. This can cause problems with a variety of field types, but in the case of multivalued fields it can either cause erroneous false stats when the number of distinct values is small, or it can cause ArrayIndexOutOfBoundsException when the number of distinct values is greater then the number of documents. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org