hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Harsh J (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-4616) Improvement to MultipleOutputs javadocs
Date Fri, 12 Oct 2012 16:05:03 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-4616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13475100#comment-13475100

Harsh J commented on MAPREDUCE-4616:

Thanks Arun! And sorry I missed this out after that review Tony, my apologies.
> Improvement to MultipleOutputs javadocs
> ---------------------------------------
>                 Key: MAPREDUCE-4616
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4616
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: documentation
>    Affects Versions: 1.0.3
>            Reporter: Tony Burton
>            Assignee: Tony Burton
>            Priority: Minor
>             Fix For: 2.0.3-alpha
>         Attachments: MAPREDUCE-4616.patch, MAPREDUCE-4616.patch
> In the new API, and using MultipleOutputs it is possible to segment output into directories
by using MultipleOutputs.write(KEYOUT key, VALUEOUT value, String baseOutputPath) in the Reducer
to determine the output directory, and by using LazyOutputFormat at the job-level config to
suppress normal output [eg use LazyOutputFormat.setOutputFormatClass(job, TextOutputFormat.class);
instead of job.setOutputFormatClass(TextOutputFormat.class);]
> This recreates the functionality previously provided in the old API by using MultipleTextOutputFormat

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message