lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Elschot <paul.elsc...@xs4all.nl>
Subject Re: Study Group (WAS Re: Normalized Scoring)
Date Sun, 06 Feb 2005 12:26:24 GMT
On Sunday 06 February 2005 10:14, Kelvin Tan wrote:
> Wouldn't it be great if we can form a study-group of Lucene folks who want 
to take the "next step"? I feel uneasy posting non-Lucene specific questions 
to dev or user even if its related to IR.
>
> Feels to me like there could be a couple like us, who didn't do a 
dissertation in IR, but would like a more indepth knowledge for practical 
purposes. Basically, the end result is that we are able to tune or extend 
lucene by using the Expert api (classes marked as Expert). Perhaps a possible 
outcome is a tuning tutorial for advanced users who already know how to use 
Lucene.
> 
> What do you think?
> 

I learned a lot by adding some javadocs to such classes. I suppose Doug
added the Expert markings, but I don't know their precise purpose.

Tuning the scoring is difficult because one needs to avoid the trap of
optimizing for the test collection and test queries at hand.
The interplays between query structure, coord(), idf() and tf() 
add to the complexity.

As long as the discussion is on possible additions/improvements
/tunings/extensions to Lucene, I think lucene-dev is a good platform.
For example, there is some code in bugzilla for variations
on idf(): http://issues.apache.org/bugzilla/show_bug.cgi?id=32942
and tf(): http://issues.apache.org/bugzilla/show_bug.cgi?id=31784
and the MultiFieldQuery things are here:
http://issues.apache.org/bugzilla/show_bug.cgi?id=32674

> k
> 
> On Sat, 5 Feb 2005 22:10:26 -0800 (PST), Otis Gospodnetic wrote:
> > Exactly.  Luckily, since then I've learned a bit from lucene-dev
> > discussions and side IR readings, so some of the topics are making
> > more sense now.

One could collect and annotate more references here:
http://wiki.apache.org/jakarta-lucene/InformationRetrieval

Regards,
Paul Elschot


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message