hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nigel Daley (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (HADOOP-877) we should automate checks of the output of the sort example program
Date Tue, 13 Feb 2007 16:18:05 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nigel Daley resolved HADOOP-877.
--------------------------------

    Resolution: Duplicate
      Assignee:     (was: Nigel Daley)

Duplicate of HADOOP-333

> we should automate checks of the output of the sort example program
> -------------------------------------------------------------------
>
>                 Key: HADOOP-877
>                 URL: https://issues.apache.org/jira/browse/HADOOP-877
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: test
>    Affects Versions: 0.10.0
>            Reporter: Owen O'Malley
>
> Since we are using the sort example program to do smoke tests on new versions of Hadoop,
it would be nice to have some checks of the output. The checks that I've considered:
>   1. count the number of records on input & output
>   2. compute the md5 of each key/value and xor across all of the rows
>   3. use a map/reduce job to merge the input and output directories and make sure that
each key/value appears on both input and output

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message