lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <erickerick...@gmail.com>
Subject Re: Termfreq
Date Wed, 03 Dec 2008 15:52:35 GMT
I'm not much of an expert on term frequencies and scoring,
but would you really want the score calculated for a document
to be  affected by the occurrence of terms in a field you did
NOT search on?

I sure wouldn't,


Best
Erick

On Wed, Dec 3, 2008 at 10:44 AM, Gustavo Corral <gustavo.corral@gmail.com>wrote:

> Hi list,
>
> I hope this is not a silly question, but I should ask.
>
> I developed a IR system for XML documents with Lucene and I was checking
> the
> explain() output for some queries, but I don't understand this part:
>
> 0.121383816 = fieldWeight(title:efecto in 1), product of:
> 1.0 = tf(termFreq(title:efecto)=1)
> 0.7768564 = idf(docFreq=4)
>
> It suppose tf refears to the term's frequency in the document, but I know
> there are more than one occurrences of this term in this document, so I
> noted that termFreq stores the frequency of a term in the document, but
> just
> for the given field, like if Lucene takes a field like a whole document. Is
> this correct, and if that is the case is this a good practice for IR?
>
> Thanks for any help or a good explanation.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message