lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wojtek Piaseczny (JIRA)" <j...@apache.org>
Subject [jira] Updated: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields
Date Tue, 22 Jun 2010 00:14:56 GMT

     [ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Wojtek Piaseczny updated SOLR-1782:
-----------------------------------

    Attachment: SOLR-1782.2.patch

First batch was unusably slow with ~1M documents. New patch uses both UninvertedField and
FieldCache.DocTermsIndex for multi-valued facet fields in StatsComponent. getValues renamed
to getTermNumbers to reflect the change.

> stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields
> ---------------------------------------------------------------------------------
>
>                 Key: SOLR-1782
>                 URL: https://issues.apache.org/jira/browse/SOLR-1782
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>         Environment: reproduced on Win2k3 using 1.5.0-dev solr ($Id: CHANGES.txt 906924
2010-02-05 12:43:11Z noble $)
>            Reporter: Gerald DeConto
>         Attachments: index.rar, SOLR-1782.2.patch, SOLR-1782.patch, SOLR-1782.test.patch
>
>
> the StatsComponent assumes any field specified in the stats.facet param can be faceted
using FieldCache.DEFAULT.getStringIndex.  This can cause problems with a variety of field
types, but in the case of multivalued fields it can either cause erroneous false stats when
the number of distinct values is small, or it can cause ArrayIndexOutOfBoundsException when
the number of distinct values is greater then the number of documents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message