lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mikhail Khludnev <mkhlud...@griddynamics.com>
Subject Re: docFreq vs no docs returned
Date Mon, 10 Aug 2015 09:46:16 GMT
Hello Martin,

I think two docs has been deleted. Try to force merge to 1 segments and
repeat observation. Usually numFound(*:*) = maxDocs - numDeleted.

On Mon, Aug 10, 2015 at 11:05 AM, Martin Leopold <leopold@eluence.com>
wrote:

> Hi,
> I'm new to Solr and I'm trying to understand how relevancy is computed
> in Solr. I've run across the following difference that puzzles me: I'm
> seeing a difference between the "docFreq" returned by the debugQuery
> output and the number of documents for that search term.
>
> As understand docFreq it counts the number of documents that contain
> each term for a given field - so to me that should match the number of
> documents returned in a search.
>
> E.g. Searching on a particular term below yields the following debug
> output - 12 documents are found, but docFreq is 14. The total sum of all
> occurrences of the term is somewhat higher.
>
>   "response": {
>     "numFound": 12,
> ...
>           "description": "weight(content_da:bil in 12)
> ...
>               "description": "fieldWeight in 12, product of:",
>
> ...
>                   "description": "idf(docFreq=14, maxDocs=124616)"
> ...
>                   "description": "fieldNorm(doc=12)"
>
> Can someone enlighten me on what I'm missing?
>
> Br,
> Martin
>
>
>


-- 
Sincerely yours
Mikhail Khludnev
Principal Engineer,
Grid Dynamics

<http://www.griddynamics.com>
<mkhludnev@griddynamics.com>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message