lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gustavo Corral" <gustavo.cor...@gmail.com>
Subject Re: Termfreq
Date Wed, 03 Dec 2008 17:58:31 GMT
Yes, of course it makes sense. I was just confused about the documentation
for the Similarity function.

On Wed, Dec 3, 2008 at 9:52 AM, Erick Erickson <erickerickson@gmail.com>wrote:

> I'm not much of an expert on term frequencies and scoring,
> but would you really want the score calculated for a document
> to be  affected by the occurrence of terms in a field you did
> NOT search on?
>
> I sure wouldn't,
>
>
> Best
> Erick
>
> On Wed, Dec 3, 2008 at 10:44 AM, Gustavo Corral <gustavo.corral@gmail.com
> >wrote:
>
> > Hi list,
> >
> > I hope this is not a silly question, but I should ask.
> >
> > I developed a IR system for XML documents with Lucene and I was checking
> > the
> > explain() output for some queries, but I don't understand this part:
> >
> > 0.121383816 = fieldWeight(title:efecto in 1), product of:
> > 1.0 = tf(termFreq(title:efecto)=1)
> > 0.7768564 = idf(docFreq=4)
> >
> > It suppose tf refears to the term's frequency in the document, but I know
> > there are more than one occurrences of this term in this document, so I
> > noted that termFreq stores the frequency of a term in the document, but
> > just
> > for the given field, like if Lucene takes a field like a whole document.
> Is
> > this correct, and if that is the case is this a good practice for IR?
> >
> > Thanks for any help or a good explanation.
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message