Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DC0C6ECE2 for ; Wed, 26 Dec 2012 11:46:47 +0000 (UTC) Received: (qmail 23429 invoked by uid 500); 26 Dec 2012 11:46:42 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 22984 invoked by uid 500); 26 Dec 2012 11:46:40 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 22946 invoked by uid 99); 26 Dec 2012 11:46:39 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Dec 2012 11:46:39 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of e.v.skaley@gmail.com designates 209.85.214.50 as permitted sender) Received: from [209.85.214.50] (HELO mail-bk0-f50.google.com) (209.85.214.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Dec 2012 11:46:31 +0000 Received: by mail-bk0-f50.google.com with SMTP id jf3so3872724bkc.37 for ; Wed, 26 Dec 2012 03:46:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:message-id:date:from:user-agent:mime-version:to:cc :subject:references:in-reply-to:content-type :content-transfer-encoding; bh=8BhvrRfknwmkVOH441Tc6ULUx0YqTgCw3WfxG3cHhIU=; b=hO76qYEcVBK8Y5N6S/WvkNuefl4fZE4DjBt5QqDlu2/huZ0UzQHokapfq1bJ5Jpu86 o6vMcY4TMmgxKHtKLSxaEBbUEp88PB5x7nM7YDCKTi4ia+BeLznnYzVkainmsyNwbnk6 zaZeeCNfDYhGVyqcfrK/FJI3x0nBWRhK8/jOpA8q6HJx1Fd7EsJkNk377yfo/a50vhj9 R/ImpNhnb6hiB7f9J6ganMzL1sl/TheXGSCX7TlB6HyLJiG8Wbh2OUb1wdryhDPvZOS8 lNO1cjHLh0km4sMIdkonIVtt7xUNTg8KuKoxRR5Y+kQ5JyIUZB67jjvw+Hg7Z7szSsgA aSVw== X-Received: by 10.204.148.134 with SMTP id p6mr12825800bkv.75.1356522369456; Wed, 26 Dec 2012 03:46:09 -0800 (PST) Received: from [192.168.178.45] (port-92-203-106-88.dynamic.qsc.de. [92.203.106.88]) by mx.google.com with ESMTPS id o7sm18384217bkv.13.2012.12.26.03.46.07 (version=SSLv3 cipher=OTHER); Wed, 26 Dec 2012 03:46:08 -0800 (PST) Message-ID: <50DAE37E.6000709@gmail.com> Date: Wed, 26 Dec 2012 12:46:06 +0100 From: Eduard Skaley User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/17.0 Thunderbird/17.0 MIME-Version: 1.0 To: user@hadoop.apache.org CC: Harsh J Subject: Re: Map Shuffle Bytes References: <50D99D23.9030902@gmail.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi, I mean TO the mappers. I'm using the CompositeInputFormat for my application to compute map-side joins. I want to join two datasets A and B one is stored on node 1 and the other one on node 2. For example if the join will be computed on node 2 then the inputsplit of the dataset which is stored on node 1 has to be transferred to node 2. I want to count the bytes which are shuffled (transferred) TO the mapper of node 2. > Hi, > > What do you mean by "shuffled bytes [to] the mappers"? If you mean > "from", it is "Reduce shuffle bytes" you look for; otherwise, you may > be looking for the per-map counter of "Map output bytes". > > Per-partition counters can be constructed on the user side if needed, > by pre-computing the partition before emit (using the same > partitioner) and counting up the bytes of your objects for its > counter. > > On Tue, Dec 25, 2012 at 6:03 PM, Eduard Skaley wrote: >> Hello guys, >> >> I need a counter for shuffled bytes to the mappers. >> Is there existing one or should I define one myself ? >> How can I implement such a counter? >> >> Thank you and happy Christmas time, >> Eduard > >