lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@lucene.com>
Subject Re: Lucene Scoring Behavior
Date Thu, 18 Sep 2003 03:15:23 GMT
Hmm.  This makes no sense to me.  Can you supply a reproducible 
standalone test case?

Doug

Terry Steichen wrote:
> Doug,
> 
> (1) No, I did *not* boost the pub_date field, either in the indexing process
> or in the query itself.
> 
> (2) And, each pub_date field of each document (which is in XML format)
> contains only one instance of the date string.
> 
> (3) And only the pub_date field itself is indexed.  There are other
> attributes of this field that may contain the date string, but they aren't
> indexed - that is, they are not included in the instantiated Document class.
> 
> Regards,
> 
> Terry
> 
> ----- Original Message -----
> From: "Doug Cutting" <cutting@lucene.com>
> To: "Lucene Users List" <lucene-user@jakarta.apache.org>
> Sent: Wednesday, September 17, 2003 5:51 PM
> Subject: Re: Lucene Scoring Behavior
> 
> 
> 
>>Terry Steichen wrote:
>>
>>>  0.03125 = fieldNorm(field=pub_date, doc=90992)
>>>  1.0 = fieldNorm(field=pub_date, doc=90970)
>>
>>It looks like the fieldNorm's are what differ, not the IDFs.  These are
>>the product of the document and/or field boost, and 1/sqrt(numTerms)
>>where numTerms is the number of terms in the "pub_date" field of the
>>document.  Thus if each document is only assigned one date, and you
>>didn't boost the field or the document when you indexed it, this should
>>be 1.0.  But if the document has two dates, then this would be
>>1/sqrt(2).  Or if you boosted this document pub_date field, then this
>>will have whatever boost you provided.
>>
>>So, did you boost anything when indexing?  Or could a single document
>>have two or more different values for pub_date?  Either would explain
> 
> this.
> 
>>Doug
>>
>>
>>---------------------------------------------------------------------
>>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>>
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 


Mime
View raw message