lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chuck Williams" <ch...@manawiz.com>
Subject RE: Contribution: better multi-field searching
Date Fri, 22 Oct 2004 05:57:36 GMT
Those are interesting papers, especially the one by Robertson.  There are subtle variations
in the specific form of idf, but in all of the models presented the term is linear, not quadratic.
 Robertson's theoretical arguments justify a linear term.

  > -----Original Message-----
  > From: Antonio Gulli [mailto:gulli@di.unipi.it]
  > Sent: Friday, October 22, 2004 9:59 AM
  > To: Lucene Developers List
  > Subject: Re: Contribution: better multi-field searching
  > 
  > 
  > >
  > > If someone can demonstrate that an alternate formulation produces
  > > superior results for most applications, then we should of course
  > > change the default implementation.  But just noting that there's a
  > > factor which is equal to idf^2 in each element of the sum does not do
  > > this.
  > 
  > Dont think that there is a magic formula, but found these papers
  > interesting.
  > http://www.emeraldinsight.com/rpsv/cgi-bin/emft.pl
  > 
  > Title: Understanding inverse document frequency: on theoretical
  > arguments
  > for IDF
  > Author: Stephen Robertson
  > Pages: 503-520
  > 
  > Title: IDF term weighting and IR research lessons
  > Author: Karen Spärck Jones
  > Pages: 521-523
  > 
  > 
  > 
  > ---------------------------------------------------------------------
  > To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
  > For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message