lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dave Kor" <>
Subject Re: Benchmarking on GOV2
Date Mon, 29 May 2006 09:33:23 GMT

On 5/29/06, Sebastiano Vigna <> wrote:
> Dear Lucene developers,
> I'd be interested in doing some benchmarking on (at least) Lucene,
> Egothor and MG4J. There is no actual data around on publicly available
> collections, and it would be nice to have some more objective data on
> efficiency for a significantly large collection.

I was wondering if you have seen the TREC 2004 paper by Giuseppe
Attardi, Andrea Esuli and Chirag Pate from the University of Pisa,
Italy, titled "Using Clustering and Blade Clusters in the TeraByte

In the paper, three search engines (including Lucene) was benchmarked
on the GOV2 corpus.

Dave Kor
Center for Information Mining and Extraction
School of Computing
National University of Singapore.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message