lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephen Green <Stephen.Gr...@sun.com>
Subject Re: Open Source Relevance
Date Wed, 21 May 2008 12:26:37 GMT
Grant Ingersoll wrote:

> Cool, hadn't seen that.

Hi folks.  Long time lurker (in RSS), first time mailer.  I just  
wanted to say that (obviously) I think this is a great idea and we  
should try to push it a little further along.  I posted a bit more  
about it in my blog this morning:

http://blogs.sun.com/searchguy/entry/open_source_trec_trecmentum

The practical upshot:  I'd be more than happy to participate in this  
and to try to get data sources and queries from Sun or elsewhere.  I'd  
also be up for trying to find some place to host the collections and  
maybe even try to figure out some way that we could get computing  
resources to run the evaluations.  No guarantees on that (I'm sure a  
Sun Lawyer's ears are burning somewhere right now, just for me having  
said that!), but I'm willing to tilt at that windmill.

TREC had a huge impact on the academic and commercial IR communities  
and I think an OSTREC (see, it's already got a cool acronym!) could  
benefit all of us (it would give us bragging rights if nothing else :-)

This is not directly related to Lucene development, so we might want  
to take further discussions off-list.

> On May 20, 2008, at 1:01 PM, Steven A Rowe wrote:
>
> > On 05/19/2008 at 3:58 PM, Grant Ingersoll wrote:
> >> I think it is time the open source search community (and
> >> I don=92t mean just Lucene) develop and publish a set of
> >> TREC-style relevance judgments for freely available data
> >> that is easily obtained from the Internet.
> >
> > Stephen Green, Minion developer at Sun, whose posts comparing  
> Minion =20=
>
> > and Lucene were recently mentioned on the solr-user mailing  
> list[1], =20=
>
> > has similar ideas.  =46rom =
> <http://blogs.sun.com/searchguy/entry/minion_and_lucene_performance=20
> > >:
> >
> >   I think it would be a good idea for all of the open
> >   source engines to get together, find a nice open document
> >   collection (the Apache mailing list archives and their
> >   associated searches?) and build a nice set of regression
> >   tests and some pooled relevance sets so that we can test
> >   retrieval performance without having to rely on the TREC
> >   data.
> >


Steve
-- 
Stephen Green                      //   Stephen.Green@sun.com
Principal Investigator             \\   http://blogs.sun.com/searchguy
Aura Project                       //   Voice: +1 781-442-0926
Sun Microsystems Labs              \\   Fax:   +1 781-442-1692




---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message