hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-8654) TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter starts with same char/char sequence
Date Mon, 06 Aug 2012 15:13:03 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jason Lowe updated HADOOP-8654:
-------------------------------

    Release Note:   (was: Dont find any  incompatible changes and features.)
    Hadoop Flags:   (was: Reviewed)

Moving to project Hadoop Common since that's where the patch needs to be applied.

In the future, please don't set the Reviewed flag unless the patch has been reviewed and approved
by someone in the community.  I see no record of that occurring, so I've cleared that flag.
 Also the Fix versions flag is intended to mark where the patch has been integrated, please
don't set this field.  If you'd like to indicate what versions you'd like to have the patch
committed to, use the Target Versions field.
                
> TextInputFormat delimiter  bug:- Input Text portion ends with & Delimiter starts
with same char/char sequence
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-8654
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8654
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: util
>    Affects Versions: 0.20.204.0, 1.0.3, 0.21.0, 2.0.0-alpha
>         Environment: Linux
>            Reporter: Gelesh
>              Labels: patch
>         Attachments: MAPREDUCE-4512.txt
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> TextInputFormat delimiter  bug scenario , a character sequence of the input text,  in
which the first character matches with the first character of delimiter, and the remaining
input text character sequence  matches with the entire delimiter character sequence from the
 starting position of the delimiter.
> eg   delimiter ="record";
> and Text =" record 1:- name = Gelesh e mail = gelesh.hadoop@gmail.com Location Bangalore
record 2: name = sdf  ..  location =Bangalorrecord 3: name .... " 
> Here string "=Bangalorrecord 3: " satisfy two conditions 
> 1) contains the delimiter "record"
> 2) The character / character sequence immediately before the delimiter (ie ' r ') matches
with first character (or character sequence ) of delimiter.  (ie "=Bangalor" ends with and
Delimiter starts with same character/char sequence 'r' ),
> Here the delimiter is not encountered by the program resulting in improper value text
in map that contains the delimiter   

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message