hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Harsh J (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-280) TextInputFormat should allow different treatment on carriage return char '\r'
Date Tue, 24 Jan 2012 20:20:40 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13192473#comment-13192473
] 

Harsh J commented on MAPREDUCE-280:
-----------------------------------

Sorry for not having linked the jiras at the time. I was closing down all very old cases.
Here you go:

MAPREDUCE-2254. Allow setting of end-of-record delimiter for TextInputFormat.
MAPREDUCE-2602. Allow setting of end-of-record delimiter for TextInputFormat (for the old
API).

These are in 0.23.0 and I chanced upon them in CDH3 about a month ago when looking for doing
something like that.

If you feel the above does not fix it though, please do reopen this.
                
> TextInputFormat should allow different treatment on carriage return char '\r'
> -----------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-280
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-280
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Runping Qi
>            Assignee: Owen O'Malley
>
> The current implementation treat '\r' and '\n' both as line breakers. However, in some
cases, it is desiable to strictly use '\n' as the solely line breaker and treat '\r' as a
part of data in a line. 
> One way to do this is to make readline function as a member function so that the user
can create a subclass to overwrite the function with the desired behavior.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message