lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shawn Heisey <s...@elyograg.org>
Subject Re: numFound is changing when start parameter changed
Date Fri, 27 Jun 2014 18:59:40 GMT
On 6/26/2014 8:47 PM, CONAN wrote:
> I use solr4.4 , 2 shards and 2 replicas  and I found a problem on solrCloud search.
>
>
> If I perform a query  with start=0 and rows=10 and say fq=ownerId:123 , I get numFound=225.
> If I simply change the start param to start=6, I get numFound=223.
> and i change the start param to start=10 , i get numFound=219.
>
>
> i searhed google and get "You probably have duplicates (docs on different shards with
the same id)."
>
>
> how is that happen? is that a bug? and how to fix it? 

I think you mentioned this problem on IRC as well.

What is the "router" on your collection?  Is it implicit, or something
like compositeId?  You can find this by opening the admin UI, clicking
Cloud, then Tree, and finding your collection under /collections.  Click
on the collection and look on the right side for the "router" entry.

If you have implicit routing or you are using some other method (such as
shard keys) to determine which shard contains a given document ID, then
this problem would happen because you or your indexing program sent the
same ID to different shards.

If you're using compositeId so that document routing is automatic, then
it should be impossible for this to happen ... unless you have manually
adjusted your clusterstate and changed the hash ranges.

Thanks,
Shawn


Mime
View raw message