hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bryan Pendleton (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-403) close method in a Mapper should be provided with OutputCollector and a Reporter
Date Mon, 31 Jul 2006 22:45:16 GMT
    [ http://issues.apache.org/jira/browse/HADOOP-403?page=comments#action_12424687 ] 
            
Bryan Pendleton commented on HADOOP-403:
----------------------------------------

+1 on this one. Reporter or Progressable objects should be passed into *any* call in this
project, but I've also run into the need to output "after all maps", with no hook currently
available to do it.

> close method in a Mapper should be provided with OutputCollector and a Reporter
> -------------------------------------------------------------------------------
>
>                 Key: HADOOP-403
>                 URL: http://issues.apache.org/jira/browse/HADOOP-403
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.5.0
>         Environment: all
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.5.0
>
>
> For mappers with side-effects, or mappers that work as aggregators (i.e. no output on
individual key-value pairs, but an aggregate output at the end of all key-value pairs), output
should be performed in the close method. For this purpose, we need to supply output collector
and reporter to the close method of Mapper. This involves interface change, though. Thoughts
?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message