lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jayson Minard (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-1030) Facet counts are not correct (or total document count is not correct as they do not match) on some searches
Date Fri, 20 Feb 2009 20:37:01 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12675466#action_12675466
] 

Jayson Minard commented on SOLR-1030:
-------------------------------------

Yeah, we had the assumption that the source data did not have duplicate docs across shards,
turns out that it does.  Otherwise we would have checked that first.  I'll keep an eye on
this one for a bit, but most likely just the duplicate doc issue.

> Facet counts are not correct (or total document count is not correct as they do not match)
on some searches
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-1030
>                 URL: https://issues.apache.org/jira/browse/SOLR-1030
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>            Reporter: Jayson Minard
>
> -There isn't much detailed evidence for this one yet, but hopefully it rings a bell with
someone who made changes in this area recently...-
> -Since updating to the tip from our previous use of the tip from around Jan 9, 2009-
(seems to be previous to r733656 as well) we are now seeing facet counts no longer match total
document count.  This is through distributed search and I have not verified that it only happens
on distributed vs. single shard search so it could be on both.
> For example, on a single valued field with one facet value set as a fq filter, combined
with a text search on a simple term "science", the following is the facet count:
> 8,294,284
> And the total document count for the same results is:
> 8,294,274
> some debug info (not sure why the filter query is replicated more than once, but that
shouldn't be harmful):
> {code}
> uerystring	  	(science)
> QParser	  	OldLuceneQParser
> filter_queries	  	[sys_content_type:("Journal Article"), sys_content_type:("Journal Article"),

> sys_content_type:("Journal Article"), sys_content_type:("Journal Article"), 
> sys_content_type:("Journal Article"), sys_content_type:("Journal Article"), 
> sys_content_type:("Journal Article"), sys_content_type:("Journal Article"), 
> sys_content_type:("Journal Article"), sys_content_type:("Journal Article")]
> rawquerystring	  	(science)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message