Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 53762 invoked from network); 2 Nov 2007 16:51:12 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 2 Nov 2007 16:51:12 -0000 Received: (qmail 16597 invoked by uid 500); 2 Nov 2007 16:51:00 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 16340 invoked by uid 500); 2 Nov 2007 16:50:59 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 16331 invoked by uid 99); 2 Nov 2007 16:50:59 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Nov 2007 09:50:59 -0700 X-ASF-Spam-Status: No, hits=-99.0 required=10.0 tests=ALL_TRUSTED,FS_LARGE_PERCENT2 X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Nov 2007 16:51:11 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id A02A9714201 for ; Fri, 2 Nov 2007 09:50:50 -0700 (PDT) Message-ID: <24149441.1194022250619.JavaMail.jira@brutus> Date: Fri, 2 Nov 2007 09:50:50 -0700 (PDT) From: "Runping Qi (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-2144) Data node process consumes 180% cpu In-Reply-To: <4680191.1194012230760.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12539635 ] Runping Qi commented on HADOOP-2144: ------------------------------------ Overall cpu usage is 90+%. It is easy to reproduce. > Data node process consumes 180% cpu > ------------------------------------ > > Key: HADOOP-2144 > URL: https://issues.apache.org/jira/browse/HADOOP-2144 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Reporter: Runping Qi > > I did a test on DFS read throughput and found that the data node > process consumes up to 180% cpu when it is under heavi load. Here are the details: > The cluster has 380+ machines, each with 3GB mem and 4 cpus and 4 disks. > I copied a 10GB file to dfs from one machine with a data node running there. > Based on the dfs block placement policy, that machine has one replica for each block of the file. > then I run 4 of the following commands in parellel: > hadoop dfs -cat thefile > /dev/null & > Since all the blocks have a local replica, all the read requests went to the local data node. > I observed that: > The data node process's cpu usage was around 180% for most of the time . > The clients's cpu usage was moderate (as it should be). > All the four disks were working concurrently with comparable read throughput. > The total read throughput was maxed at 90MB/Sec, about 60% of the expected total > aggregated max read throughput of 4 disks (160MB/Sec). Thus disks were not a bottleneck > in this case. > The data node's cpu usage seems unreasonably high. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.