mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [CONF] Apache Lucene Mahout > Collections
Date Thu, 22 Apr 2010 19:57:02 GMT
Space: Apache Lucene Mahout (
Page: Collections (

Edited by Grant Ingersoll:
TODO: Organize these somehow, add one-line blurbs
Organize by usage? (classification, recommendation etc.)

*Collections of Collections*
[UCI Machine Learning Repo|]

*Categorization Data*
[10 years of CLEF Data|] (Approximately 160k categorized docs)
There is a newer beta verson here: (Approximately 320k categorized docs)

*Recommendation Data*
[Netflix Prize/Dataset|]
[Book usage and recommendation data from the University of Huddersfield|]
[|] - Non-commercial
use only

*General Resources*


[4 Universities Data Set|]



Change your notification preferences:

View raw message