Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8DADED82C for ; Thu, 1 Nov 2012 09:43:53 +0000 (UTC) Received: (qmail 51419 invoked by uid 500); 1 Nov 2012 09:43:48 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 51240 invoked by uid 500); 1 Nov 2012 09:43:48 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 51218 invoked by uid 99); 1 Nov 2012 09:43:47 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Nov 2012 09:43:47 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates 209.85.223.176 as permitted sender) Received: from [209.85.223.176] (HELO mail-ie0-f176.google.com) (209.85.223.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Nov 2012 09:43:41 +0000 Received: by mail-ie0-f176.google.com with SMTP id k11so3794061iea.35 for ; Thu, 01 Nov 2012 02:43:20 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=owJobabQV2d3wOsnDwia2F4HhEzZGOe4lChGeWbgewo=; b=ht2GIq0u63fKT6P4sNZILppyvMnZaGH1PRXnmBfw60FzSRyW+byLLZiYbFToao23RD NUQZiDEmrOQ7bxC55gdWbRbmbpKeL7ycUug7OFY4p/uFUEooVk1mijvkNW/L7b3KyRzV PU+TtXnhyQg5eQvldp4seuHuldgFtSkFGLl3fsyEcCW+jCe4LmFuXm/WvQc8+nFBEGr3 iM4T8RbDKHBJVwsJkDw8zNJ/8kGNW+XQp35X6YzgvWiz4n6xAwsGNsE6sgd2AeCSzPS4 xKdpY5tT6XlW37XOQf2qTVlgs9rHThnIDXwzNukyWUWmXTgvFRg8rtENVohIaF58sPWd XWWQ== Received: by 10.43.12.132 with SMTP id pi4mr2747931icb.31.1351763000480; Thu, 01 Nov 2012 02:43:20 -0700 (PDT) MIME-Version: 1.0 Received: by 10.64.27.8 with HTTP; Thu, 1 Nov 2012 02:43:00 -0700 (PDT) In-Reply-To: References: From: Harsh J Date: Thu, 1 Nov 2012 15:13:00 +0530 Message-ID: Subject: Re: Low shuffle transfer speeds To: user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQmt7a7sO6QEAgvWapuN5t/NYacXPvhoLvm+UkOQ8q7iD0HXU/0bL8JYfhzmRo5q9X0sXm4L X-Virus-Checked: Checked by ClamAV on apache.org Hi, The reducer copies map outputs progressively (as and when they complete) unless configured otherwise. It is normal hence, for the overall average (thats what it is currently, unfortunately), to show up lower than the actual value since there are periods where the reducer is idle in waiting for further map task waves to complete. You can control the mapred.reduce.slowstart.completed.maps (5% (0.05) by default) to control the threshold of overall maps completion percentage the reducer should begin copying outputs at. An increased value, such as 80% (0.8) will let your Reducers copy more data continuously (since it does not have to wait much). On Thu, Nov 1, 2012 at 2:31 PM, john smith wrote: > Hi list, > > I have jobs that generate huge amount of intermediate data. For eg: One of > my job generates almost 12 GB map output. I have 8 datanodes/TTs and 1 > master. > > My reduce progress shows that the copy speed in range 0.55 - 1 MBps , but > normal file transfers between my datanodes generally go up to 40-50 MBps. > Why is my shuffle speed so slow? > > Also how is that number calculated ? What exactly does that signify? (Is it > the avg speed of all mappers to that particular reducer? or anything else?) > Any suggestions? > > Thanks -- Harsh J