hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom White (JIRA)" <j...@apache.org>
Subject [jira] Updated: (MAPREDUCE-2226) TaggedInputSplit should be public
Date Mon, 28 Feb 2011 04:43:36 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tom White updated MAPREDUCE-2226:
---------------------------------

    Status: Open  (was: Patch Available)

Is it worth making TaggedInputSplit's name better as a part of making it public, e.g. MultipleInputsSplit?

Marking as open pending a new patch.


> TaggedInputSplit should be public
> ---------------------------------
>
>                 Key: MAPREDUCE-2226
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2226
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 0.20.1
>            Reporter: E. Sammer
>            Assignee: E. Sammer
>         Attachments: MAPREDUCE-2226.diff
>
>
> Currently it's not possible to get at the original InputSplits when using MultipleInputs.
This is because TaggedInputSplit (used by DelegatingInputFormat used by MultipleInputs) is
not public. This means things like the following do not work:
> {code}
> public void map(...) {
>   FileSplit fis = (FileSplit)((TaggedInputSplit) reporter.getInputSplit()).getInputSplit();
>   Path p = fis.getPath();
> }
> {code}
> This prevents users from getting at input split specific data.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message