lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: Benchmarking on GOV2
Date Mon, 29 May 2006 17:58:05 GMT
Marvin Humphrey wrote:
>
> On May 29, 2006, at 10:34 AM, Andrzej Bialecki wrote:
>
>>  It could use the Reuters corpus
>
> Has anyone used existing categorization data associated with the 
> Reuters corpus to build a benchmarker that measured IR precision 
> and/or recall?

That would be RCV1 or RCV2, right? AFAIK the Reuters-21578 has no such 
information ... The use of RCV1/RCV2 is subject to a more stringent 
license than Reuters-21578, so that few people would be able to actually 
run the benchmarks.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message