hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gelesh (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-8654) TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter starts with same char/char sequence
Date Mon, 15 Oct 2012 17:20:03 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13476274#comment-13476274
] 

Gelesh commented on HADOOP-8654:
--------------------------------

Thanks Arun...

                
> TextInputFormat delimiter  bug:- Input Text portion ends with & Delimiter starts
with same char/char sequence
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-8654
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8654
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: util
>    Affects Versions: 0.20.204.0, 1.0.3, 0.21.0, 2.0.0-alpha
>         Environment: Linux
>            Reporter: Gelesh
>              Labels: patch
>             Fix For: 3.0.0, 2.0.2-alpha
>
>         Attachments: HADOOP-8654.patch, MAPREDUCE-4512.txt
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> TextInputFormat delimiter  bug scenario , a character sequence of the input text,  in
which the first character matches with the first character of delimiter, and the remaining
input text character sequence  matches with the entire delimiter character sequence from the
 starting position of the delimiter.
> eg   delimiter ="record";
> and Text =" record 1:- name = Gelesh e mail = gelesh.hadoop@gmail.com Location Bangalore
record 2: name = sdf  ..  location =Bangalorrecord 3: name .... " 
> Here string "=Bangalorrecord 3: " satisfy two conditions 
> 1) contains the delimiter "record"
> 2) The character / character sequence immediately before the delimiter (ie ' r ') matches
with first character (or character sequence ) of delimiter.  (ie "=Bangalor" ends with and
Delimiter starts with same character/char sequence 'r' ),
> Here the delimiter is not encountered by the program resulting in improper value text
in map that contains the delimiter   

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message