lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "José Ramón Pérez Agüera"<>
Subject Re: implementatin of the state-of-art retrieval models for lucene?
Date Fri, 02 Feb 2007 09:54:36 GMT
Dear Hui,

i'm a Ph. d. student from University Complutense of Madrid (Spain)
where i'm teaching assistant also, in the departament of Artificial
Intelligence. I'm working with Lucene from two years ago, and i'm very
interesting on re-implement certain classes (TermQuery, TermScorer,
DefaultSimilarity) to adapt it to the state-of-art models in
information retrieval BM25, LM, DFR, etc. I'm working also in the
implementation of the evaluation module for Lucene to work with TREC
collections and similars.

I think that would be a good idea if we create a subproject of Lucene
to develop new IR models and differents tools focused to IR community.
I would be very interested on this issue and i think that would be very
intereseting not only for IR comunity but also to Lucene comunity.

What do you think about this idea?



----- Mensaje original -----
De: Hui Fang <>
Fecha: Viernes, Febrero 2, 2007 5:45 am
Asunto: implementatin of the state-of-art retrieval models for lucene?

> Dear all,
> My primary research interest is Information retrieval, with a 
> focus on
> developing
> effective and robust retrieval models. I am happy to send my 
> first email
> to Lucene community.
> Lucene and nutch are really useful IR systems. But I think that 
> the current
> retrieval function
> implemented in Lucene does not perform as well as other state-of-art
> retrieval functions in terms of effectiveness.  I have 
> implemented some
> state-of-art models
> (such as pivoted normalization, okapi and axiomatic retrieval models)
> on top of Lucene, and evaluated these models and the default model
> implemented in
> Lucene using standard IR evaluation methodology. Experiments 
> show that
> the state-of-art retrieval functions outperform the default one.
> Actually, this is one assignment my advisor and I designed for 
> our IR
> course.
> After posting this assignment online, quite a few IR researchers 
> contactedus and
> asked for the code of our implementations.  So, we think that
> it might be beneficial to everyone in the lucene community and 
> IR research
> community,
> if we could contribute our implementation of the state-of-art 
> retrievalfunctions to Lucene.
> I think that our contribution could help improve the retrieval 
> performancefor both
> Lucene and nutch.
> What do you think?
> Thanks,
> -Hui

José Ramón Pérez Agüera

Dept. de Ingeniería del Software e Inteligencia Artificial
Despacho 411 tlf. 913947599
Facultad de Informática
Universidad Complutense de Madrid

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message