hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tony Burton (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-4616) Improvement to MultipleOutputs javadocs
Date Tue, 11 Sep 2012 15:39:08 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-4616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tony Burton updated MAPREDUCE-4616:
-----------------------------------

    Status: Patch Available  (was: Open)

Documentation changes to describe how to use MultipleOutputs and LazyOutputFormat to mimic
behaviour in the now-deprecated MultipleTextOutputFormat (and similar)
                
> Improvement to MultipleOutputs javadocs
> ---------------------------------------
>
>                 Key: MAPREDUCE-4616
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4616
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: documentation
>    Affects Versions: 1.0.3
>            Reporter: Tony Burton
>            Priority: Minor
>              Labels: hadoop, mapreduce
>             Fix For: trunk
>
>         Attachments: MAPREDUCE-4616.patch
>
>
> In the new API, and using MultipleOutputs it is possible to segment output into directories
by using MultipleOutputs.write(KEYOUT key, VALUEOUT value, String baseOutputPath) in the Reducer
to determine the output directory, and by using LazyOutputFormat at the job-level config to
suppress normal output [eg use LazyOutputFormat.setOutputFormatClass(job, TextOutputFormat.class);
instead of job.setOutputFormatClass(TextOutputFormat.class);]
> This recreates the functionality previously provided in the old API by using MultipleTextOutputFormat
(etc)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message