Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 81964 invoked from network); 1 Aug 2008 17:55:03 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 1 Aug 2008 17:55:03 -0000 Received: (qmail 81221 invoked by uid 500); 1 Aug 2008 17:55:00 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 81183 invoked by uid 500); 1 Aug 2008 17:55:00 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 81172 invoked by uid 99); 1 Aug 2008 17:55:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Aug 2008 10:55:00 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Aug 2008 17:54:05 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6E492234C196 for ; Fri, 1 Aug 2008 10:54:32 -0700 (PDT) Message-ID: <425324443.1217613272450.JavaMail.jira@brutus> Date: Fri, 1 Aug 2008 10:54:32 -0700 (PDT) From: "Raghu Angadi (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-3514) Reduce seeks during shuffle, by inline crcs In-Reply-To: <1777511993.1212934184996.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12619084#action_12619084 ] Raghu Angadi commented on HADOOP-3514: -------------------------------------- My nit : {{ChecksumInputStream}} and {{ChecksumOutputStream}} are in hadoop.io package seem to imply they are more general purpose checksum streams. But these don't seem so.. these are utilities for dealing with another stream that has 'checksum per record'. I would recommend 'Record' some where in the name of these classes or moving them to MR. > Reduce seeks during shuffle, by inline crcs > ------------------------------------------- > > Key: HADOOP-3514 > URL: https://issues.apache.org/jira/browse/HADOOP-3514 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Affects Versions: 0.18.0 > Reporter: Devaraj Das > Assignee: Jothi Padmanabhan > Fix For: 0.19.0 > > Attachments: hadoop-3514-v1.patch, hadoop-3514-v2.patch, hadoop-3514.patch > > > The number of seeks can be reduced by half in the iFile if we move the crc into the iFile rather than having a separate file. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.