hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (HADOOP-433) Better access to the RecordReader
Date Thu, 24 May 2007 06:39:16 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Owen O'Malley resolved HADOOP-433.
----------------------------------

       Resolution: Duplicate
    Fix Version/s: 0.13.0
         Assignee:     (was: Owen O'Malley)

This was fixed by HADOOP-1251, which added a method to the Reporter that provides the InputSplit
to the mapper.

> Better access to the RecordReader
> ---------------------------------
>
>                 Key: HADOOP-433
>                 URL: https://issues.apache.org/jira/browse/HADOOP-433
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.5.0
>            Reporter: Benjamin Reed
>            Priority: Minor
>             Fix For: 0.13.0
>
>
> The record reader has access to the FileSplit which can in turn have information that
is useful to the Mapper. For example, Map processing may vary according to file name or attributes
associated with a file. Unfortunately, even using a MapRunner you only have access to the
progress wrapper of the RecordReader. To get access to the real record reader I had to use
a thread local variable which I set in RecordReader.getNext(). It would be much nicer if you
could get a reference to the real RecordReader from the RecordReader passed to MapRunner.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message