hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom White (JIRA)" <j...@apache.org>
Subject [jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
Date Mon, 24 Aug 2009 10:17:59 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12746801#action_12746801
] 

Tom White commented on MAPREDUCE-370:
-------------------------------------

Could the counter name be based on the named output, rather than the base filename?

bq. if user doesn't give unique name for the output file, there are chances that output will
be garbled.

This is true, but like MultipleOutputFormat it would be up to the application to give unique
names to the output files. Most users would use the simpler form that takes a named output
and lets MultipleOutputs construct the output filename {{{namedOutput}-(m|r)-{part-number}}},
but this change I'm proposing would allow advanced users to control the precise filename of
the outputs.

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> -------------------------------------------------------------------
>
>                 Key: MAPREDUCE-370
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>            Reporter: Amareshwari Sriramadasu
>            Assignee: Amareshwari Sriramadasu
>             Fix For: 0.21.0
>
>         Attachments: patch-370-1.txt, patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message