Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 15667 invoked from network); 15 Feb 2008 19:10:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 15 Feb 2008 19:10:36 -0000 Received: (qmail 88891 invoked by uid 500); 15 Feb 2008 19:10:26 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 88854 invoked by uid 500); 15 Feb 2008 19:10:26 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 88709 invoked by uid 99); 15 Feb 2008 19:10:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Feb 2008 11:10:25 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Feb 2008 19:09:48 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 56140714079 for ; Fri, 15 Feb 2008 11:10:08 -0800 (PST) Message-ID: <4612446.1203102608350.JavaMail.jira@brutus> Date: Fri, 15 Feb 2008 11:10:08 -0800 (PST) From: "Raghu Angadi (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Issue Comment Edited: (HADOOP-2758) Reduce memory copies when data is read from DFS In-Reply-To: <29870993.1201814709005.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12569375#action_12569375 ] rangadi edited comment on HADOOP-2758 at 2/15/08 11:09 AM: ---------------------------------------------------------------- Comparision of single instance of 'dfs -cat 5Gbfile > /dev/null" with 'cat 5Gbfile > /dev/null'. All the data resides locally on a 4 disk RAID0 partition : || min:sec || cat || dfs -cat with 0.16 || dfs -cat with the patch || | run 1 | 2:40 | 3:44 | 3:24 | | run 2 | 2:56 | 3:05 | 3:51 | | run 3 | 3:01 | 3:18 | 2:51 | What would you conclude? Both of the obvious conclusions are incorrect : # dfs -cat is almost as good as simple cat. # this patch does not help much. If we had a single disk partition, the numbers would be even closer. was (Author: rangadi): Comparision of single instance of 'dfs -cat 5Gbfile > /dev/null" with 'cat 5Gbfile > /dev/null'. All the data resides locally on a 4 disk RAID0 partition : || min:sec || cat || dfs -cat with 0.16 || dfs -cat with the patch || | run 1 | 2:40 | 3:44 | 3:24 | | run 2 | 2:56 | 3:05 | 3:51 | | run 3 | 3:01 | 3:18 | 2:51 | What would you conclude? Both of the obvious conclusions are incorrect : # dfs -cat is almost as good as simple cat. # this patch does not help mu. If we had a single disk partition, the numbers would be even closer. > Reduce memory copies when data is read from DFS > ----------------------------------------------- > > Key: HADOOP-2758 > URL: https://issues.apache.org/jira/browse/HADOOP-2758 > Project: Hadoop Core > Issue Type: Improvement > Components: dfs > Reporter: Raghu Angadi > Assignee: Raghu Angadi > Fix For: 0.17.0 > > Attachments: HADOOP-2758.patch > > > Currently datanode and client part of DFS perform multiple copies of data on the 'read path' (i.e. path from storage on datanode to user buffer on the client). This jira reduces these copies by enhancing data read protocol and implementation of read on both datanode and the client. I will describe the changes in next comment. > Requirement is that this fix should reduce CPU used and should not cause regression in any benchmarks. It might not improve the benchmarks since most benchmarks are not cpu bound. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.