lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: Benchmarking on GOV2
Date Mon, 29 May 2006 17:34:18 GMT
Otis Gospodnetic wrote:
> OG: But Andrzej, you already wrote that indexing benchmark tool (which we never put anywhere
in SVN, I'm afraid) that works on some freely available Reuters corpus, I believe.  Why couldn't
that be adapted for testing Lucene, Egothor, and MG4J?
>   

Hmm, yes, indeed I have ... It was so long ago I nearly forgot about it. 
:) I need to dust it off and see if it's of any use. It used the 
20newsgroups corpus (~19,000 items). It could use the Reuters corpus, 
just the parser would have to be implemented.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message