hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ahmed Radwan (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-2602) Allow setting of end-of-record delimiter for TextInputFormat (for the old API)
Date Fri, 17 Jun 2011 02:14:49 GMT
Allow setting of end-of-record delimiter for TextInputFormat (for the old API)
------------------------------------------------------------------------------

                 Key: MAPREDUCE-2602
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2602
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
            Reporter: Ahmed Radwan
            Assignee: Ahmed Radwan


Since there are users who are still using the old MR API, it will be useful to modify the
org.apache.hadoop.mapred.LineRecordReader and org.apache.hadoop.mapred.TextInputFormat to
be able to use custom (user-specified) end-of-record delimiters. This will make use of the
LineReader improvement introduced in HADOOP-7096 that enables the LineReader to break lines
at user-specified delimiters. 

Note: MAPREDUCE-2254 already added this improvement to the new API (but not the old API).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message