lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-1030) Facet counts are not correct (or total document count is not correct as they do not match) on some searches
Date Fri, 20 Feb 2009 19:39:04 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12675447#action_12675447
] 

Yonik Seeley commented on SOLR-1030:
------------------------------------

Yes, duplicate docs on shards would indeed cause facet counts to be too high.
Duplicate docs is an error condition that we can handle relatively gracefully, but not without
some inconsistencies.

> Facet counts are not correct (or total document count is not correct as they do not match)
on some searches
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-1030
>                 URL: https://issues.apache.org/jira/browse/SOLR-1030
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>            Reporter: Jayson Minard
>
> -There isn't much detailed evidence for this one yet, but hopefully it rings a bell with
someone who made changes in this area recently...-
> -Since updating to the tip from our previous use of the tip from around Jan 9, 2009-
(seems to be previous to r733656 as well) we are now seeing facet counts no longer match total
document count.  This is through distributed search and I have not verified that it only happens
on distributed vs. single shard search so it could be on both.
> For example, on a single valued field with one facet value set as a fq filter, combined
with a text search on a simple term "science", the following is the facet count:
> 8,294,284
> And the total document count for the same results is:
> 8,294,274
> some debug info (not sure why the filter query is replicated more than once, but that
shouldn't be harmful):
> {code}
> uerystring	  	(science)
> QParser	  	OldLuceneQParser
> filter_queries	  	[sys_content_type:("Journal Article"), sys_content_type:("Journal Article"),

> sys_content_type:("Journal Article"), sys_content_type:("Journal Article"), 
> sys_content_type:("Journal Article"), sys_content_type:("Journal Article"), 
> sys_content_type:("Journal Article"), sys_content_type:("Journal Article"), 
> sys_content_type:("Journal Article"), sys_content_type:("Journal Article")]
> rawquerystring	  	(science)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message