hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom White (JIRA)" <j...@apache.org>
Subject [jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
Date Mon, 24 Aug 2009 11:12:59 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12746816#action_12746816
] 

Tom White commented on MAPREDUCE-370:
-------------------------------------

bq. I think these users can override FileOutputFormat.getDefaultWorkFile to control the precise
filename.

This is true. So to have complete control over the output filename you would call the write
method with a base output path of the name you want (possibly using the key and value to construct
it). You would then override FileOutputFormat.getDefaultWorkFile() to omit the {m,r}-nnnnn
suffix.

We could make this slightly easier in the future perhaps (by putting it in the MultipleOutputs
API, for example), but I think the current approach is reasonable.

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> -------------------------------------------------------------------
>
>                 Key: MAPREDUCE-370
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>            Reporter: Amareshwari Sriramadasu
>            Assignee: Amareshwari Sriramadasu
>             Fix For: 0.21.0
>
>         Attachments: patch-370-1.txt, patch-370-2.txt, patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message