hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jay Booth (JIRA)" <j...@apache.org>
Subject [jira] Updated: (MAPREDUCE-1170) MultipleInputs doesn't work with new API in 0.20 branch
Date Mon, 02 Nov 2009 23:04:59 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jay Booth updated MAPREDUCE-1170:
---------------------------------

    Attachment: MAPREDUCE-1170.patch

New patch fixes ClassCastException in LineRecordReader via 
<pre>
if (split instanceof TaggedInputSplit) fileSplit = (FileSplit) ((TaggedInputSplit) split).getInputSplit();
else fileSplit = (FileSplit) split;
</pre>

The old test just added the inputs and verified they were added, didn't actually run a job,
so this error snuck through.

New test runs a job with MultipleInputs and 2 different mapper classes, ensuring that output
is correct.  Passes.

The test fails on 0.21 branch though -- I'll make a separate JIRA and post a patch for that
as well



> MultipleInputs doesn't work with new API in 0.20 branch
> -------------------------------------------------------
>
>                 Key: MAPREDUCE-1170
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1170
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>    Affects Versions: 0.20.1
>            Reporter: Jay Booth
>             Fix For: 0.20.2
>
>         Attachments: MAPREDUCE-1170.patch
>
>
> This patch adds support for MultipleInputs (and KeyValueTextInputFormat) in o.a.h.mapreduce.lib.input,
working with the new API.  Included passing unit test.  Include for 0.20.2?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message