hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Kramer (JIRA)" <j...@apache.org>
Subject [jira] Commented: (MAPREDUCE-1750) Make #rows avail. to reducers as environment variable
Date Mon, 03 May 2010 06:46:56 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-1750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12863252#action_12863252

Adam Kramer commented on MAPREDUCE-1750:

Sorry for being unclear. Yes, this is for streaming. I agree that that is a good first step.

And yes, I meant the number of values, not the number of keys. Thanks!

> Make #rows avail. to reducers as environment variable
> -----------------------------------------------------
>                 Key: MAPREDUCE-1750
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1750
>             Project: Hadoop Map/Reduce
>          Issue Type: Wish
>            Reporter: Adam Kramer
>            Priority: Minor
> Given that there is a sort phase between the copy phase and the reduce phase, it seems
like there is a chance for counting during sort.
> It would be nice if my reducers could have access to an environment variable, say, mapred.reduce.rows,
that contained the number of rows present for this reducer (as counted during the sort step).

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message