Return-Path: Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: (qmail 16968 invoked from network); 9 Apr 2010 17:32:14 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 9 Apr 2010 17:32:14 -0000 Received: (qmail 44641 invoked by uid 500); 9 Apr 2010 17:32:14 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 44606 invoked by uid 500); 9 Apr 2010 17:32:14 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 44598 invoked by uid 99); 9 Apr 2010 17:32:14 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Apr 2010 17:32:14 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Apr 2010 17:32:12 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id DB332234C4AE for ; Fri, 9 Apr 2010 17:31:50 +0000 (UTC) Message-ID: <1259330561.11961270834310897.JavaMail.jira@brutus.apache.org> Date: Fri, 9 Apr 2010 17:31:50 +0000 (UTC) From: "Vinod K V (JIRA)" To: common-issues@hadoop.apache.org Subject: [jira] Commented: (HADOOP-4322) Input/Output Format for TFile MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12855483#action_12855483 ] Vinod K V commented on HADOOP-4322: ----------------------------------- When this patch is tested internally, we found some problem - the job gives out negative "input byes" count when using this outputformat. I don't have any context about this, but just updating this bug with the analysis by Rahul.. {code} ObjectFileRecordReader has getPos() method implementation , this method is giving incorrect values for offset. Code flow in the framework is like below. ===== beforePos = getPos(); //call to user's record reader 'next() method. afterPos = getPos(); //then for counter we do the following: inputByteCounter.increment(afterPos - beforePos);//this is the counter which is //in question ===== (ObjectFileRecordReader's getPos() method ) afterPos < beforePos , this is resulting in the -ve increment to the counter. {code} So this patch shouldn't be committed as is without a relook. > Input/Output Format for TFile > ----------------------------- > > Key: HADOOP-4322 > URL: https://issues.apache.org/jira/browse/HADOOP-4322 > Project: Hadoop Common > Issue Type: New Feature > Reporter: Amir Youssefi > Assignee: Amir Youssefi > Attachments: ObjectFileInputOutputFormat_1.patch > > > Input/Output Format for TFile -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.