Return-Path: X-Original-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 889AADD42 for ; Thu, 16 Aug 2012 14:37:39 +0000 (UTC) Received: (qmail 37094 invoked by uid 500); 16 Aug 2012 14:37:39 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 37057 invoked by uid 500); 16 Aug 2012 14:37:39 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 37048 invoked by uid 99); 16 Aug 2012 14:37:39 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2012 14:37:39 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 955F42C5BE6 for ; Thu, 16 Aug 2012 14:37:38 +0000 (UTC) Date: Fri, 17 Aug 2012 01:37:38 +1100 (NCT) From: "Hudson (JIRA)" To: common-issues@hadoop.apache.org Message-ID: <2083152988.19011.1345127858612.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (HADOOP-8654) TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter starts with same char/char sequence MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HADOOP-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435998#comment-13435998 ] Hudson commented on HADOOP-8654: -------------------------------- Integrated in Hadoop-Hdfs-trunk-Commit #2652 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2652/]) HADOOP-8654. TextInputFormat delimiter bug (Gelesh and Jason Lowe via bobby) (Revision 1373859) Result = SUCCESS bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1373859 Files : * /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt * /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/util/LineReader.java * /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/util/TestLineReader.java > TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter starts with same char/char sequence > ------------------------------------------------------------------------------------------------------------- > > Key: HADOOP-8654 > URL: https://issues.apache.org/jira/browse/HADOOP-8654 > Project: Hadoop Common > Issue Type: Bug > Components: util > Affects Versions: 0.20.204.0, 1.0.3, 0.21.0, 2.0.0-alpha > Environment: Linux > Reporter: Gelesh > Labels: patch > Fix For: 3.0.0, 2.2.0-alpha > > Attachments: HADOOP-8654.patch, MAPREDUCE-4512.txt > > Original Estimate: 1m > Remaining Estimate: 1m > > TextInputFormat delimiter bug scenario , a character sequence of the input text, in which the first character matches with the first character of delimiter, and the remaining input text character sequence matches with the entire delimiter character sequence from the starting position of the delimiter. > eg delimiter ="record"; > and Text =" record 1:- name = Gelesh e mail = gelesh.hadoop@gmail.com Location Bangalore record 2: name = sdf .. location =Bangalorrecord 3: name .... " > Here string "=Bangalorrecord 3: " satisfy two conditions > 1) contains the delimiter "record" > 2) The character / character sequence immediately before the delimiter (ie ' r ') matches with first character (or character sequence ) of delimiter. (ie "=Bangalor" ends with and Delimiter starts with same character/char sequence 'r' ), > Here the delimiter is not encountered by the program resulting in improper value text in map that contains the delimiter -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira