hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas Graves (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-5395) Update Teragen algorithm
Date Tue, 16 Jul 2013 13:12:48 GMT
Thomas Graves created MAPREDUCE-5395:

             Summary: Update Teragen algorithm
                 Key: MAPREDUCE-5395
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5395
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: examples
    Affects Versions: 0.23.7
            Reporter: Thomas Graves

The Teragen algorithm is no longer up to date with the sortbenchmark.org gensort tool used
for the official sort benchmark.  The new algorithm is supposed to generate data that isn't
very compressible. 

Also the new version of gensort can generate skewed data so we should add that option to teragen

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message