hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom White (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-2602) Allow setting of end-of-record delimiter for TextInputFormat (for the old API)
Date Wed, 22 Jun 2011 00:38:48 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13052974#comment-13052974
] 

Tom White commented on MAPREDUCE-2602:
--------------------------------------

+1

> Allow setting of end-of-record delimiter for TextInputFormat (for the old API)
> ------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2602
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2602
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Ahmed Radwan
>            Assignee: Ahmed Radwan
>         Attachments: MAPREDUCE-2602.patch
>
>
> Since there are users who are still using the old MR API, it will be useful to modify
the org.apache.hadoop.mapred.LineRecordReader and org.apache.hadoop.mapred.TextInputFormat
to be able to use custom (user-specified) end-of-record delimiters. This will make use of
the LineReader improvement introduced in HADOOP-7096 that enables the LineReader to break
lines at user-specified delimiters. 
> Note: MAPREDUCE-2254 already added this improvement to the new API (but not the old API).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message