hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-8654) TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter starts with same char/char sequence
Date Fri, 17 Aug 2012 13:56:38 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13436736#comment-13436736
] 

Hudson commented on HADOOP-8654:
--------------------------------

Integrated in Hadoop-Mapreduce-trunk #1169 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1169/])
    HADOOP-8654. TextInputFormat delimiter bug (Gelesh and Jason Lowe via bobby) (Revision
1373859)

     Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1373859
Files : 
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/util/LineReader.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/util/TestLineReader.java

                
> TextInputFormat delimiter  bug:- Input Text portion ends with & Delimiter starts
with same char/char sequence
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-8654
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8654
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: util
>    Affects Versions: 0.20.204.0, 1.0.3, 0.21.0, 2.0.0-alpha
>         Environment: Linux
>            Reporter: Gelesh
>              Labels: patch
>             Fix For: 3.0.0, 2.2.0-alpha
>
>         Attachments: HADOOP-8654.patch, MAPREDUCE-4512.txt
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> TextInputFormat delimiter  bug scenario , a character sequence of the input text,  in
which the first character matches with the first character of delimiter, and the remaining
input text character sequence  matches with the entire delimiter character sequence from the
 starting position of the delimiter.
> eg   delimiter ="record";
> and Text =" record 1:- name = Gelesh e mail = gelesh.hadoop@gmail.com Location Bangalore
record 2: name = sdf  ..  location =Bangalorrecord 3: name .... " 
> Here string "=Bangalorrecord 3: " satisfy two conditions 
> 1) contains the delimiter "record"
> 2) The character / character sequence immediately before the delimiter (ie ' r ') matches
with first character (or character sequence ) of delimiter.  (ie "=Bangalor" ends with and
Delimiter starts with same character/char sequence 'r' ),
> Here the delimiter is not encountered by the program resulting in improper value text
in map that contains the delimiter   

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message