Return-Path: X-Original-To: apmail-hadoop-general-archive@minotaur.apache.org Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0D89A49FE for ; Fri, 10 Jun 2011 05:29:38 +0000 (UTC) Received: (qmail 40189 invoked by uid 500); 10 Jun 2011 05:29:35 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 40145 invoked by uid 500); 10 Jun 2011 05:29:35 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 40135 invoked by uid 99); 10 Jun 2011 05:29:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jun 2011 05:29:33 +0000 X-ASF-Spam-Status: No, hits=3.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of hailong.yang1115@gmail.com designates 209.85.210.48 as permitted sender) Received: from [209.85.210.48] (HELO mail-pz0-f48.google.com) (209.85.210.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jun 2011 05:29:26 +0000 Received: by pzk10 with SMTP id 10so1486842pzk.35 for ; Thu, 09 Jun 2011 22:29:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:date:from:to:subject:message-id:x-mailer :mime-version:content-type; bh=LqBgO9KNufLhVhqaMoaeFzenq7ijFMkNL8tfuhHKGqs=; b=KM9MjNRL+5CVI2VndmFiXPr6eRuacX7EK099+o1wquvXEAhcYx9t/659qJTKR4NBFw w2hbhBHg/8ctXoAgUBZGEdDT2WhpH7xS2Vi8Ukdu6PGIYgdRWgGRARjJNMGMlQ0qDshV zDzKuJl4GQRj+6RgbF2ka/RqAndEogBE5uMNA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=date:from:to:subject:message-id:x-mailer:mime-version:content-type; b=W6UHoLTnz1qT5coOxjhLberdS4aq61w9nGmI01v8J6GlVSGFkiWox+rh553uqoVj+n GUTLkFp/Ilyohp7QAHdAfnP6cP1Z13F29tOZ7IwWr56xvwXkMI4WbFXxwCoEyuggBr/5 TkR8NxVaLlVvETELykZLu3O/jeyw7goqlL9gI= Received: by 10.143.20.14 with SMTP id x14mr282691wfi.105.1307683744462; Thu, 09 Jun 2011 22:29:04 -0700 (PDT) Received: from HailongYangLen ([124.205.18.226]) by mx.google.com with ESMTPS id n1sm2029381pbi.31.2011.06.09.22.28.51 (version=SSLv3 cipher=OTHER); Thu, 09 Jun 2011 22:29:03 -0700 (PDT) Date: Fri, 10 Jun 2011 13:28:54 +0800 From: "hailong.yang1115" To: "general" Subject: Problems about the job counters Message-ID: <201106101328462304641@gmail.com> X-mailer: Foxmail 6, 15, 201, 26 [cn] Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="=====003_Dragon731067674787_=====" --=====003_Dragon731067674787_===== Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Dear all, I am trying to the built-in example wordcount with nearly 15GB input. When the Hadoop job finished, I got the following counters. CounterMapReduceTotal Job CountersLaunched reduce tasks001 Rack-local map tasks0035 Launched map tasks002,318 Data-local map tasks002,283 FileSystemCountersFILE_BYTES_READ22,863,580,65617,654,943,34140,518,523,997 HDFS_BYTES_READ154,400,997,4590154,400,997,459 FILE_BYTES_WRITTEN33,490,829,40317,654,943,34151,145,772,744 HDFS_BYTES_WRITTEN02,747,356,7042,747,356,704 My question is what does the FILE_BYTES_READ counter mean? And what is the difference between FILE_BYTES_READ and HDFS_BYTES_READ? In my opinion, all the input is located in HDFS, so where does FILE_BYTES_READ come from during the map phase? Any help will be appreciated! Hailong 2011-06-10 *********************************************** * Hailong Yang, PhD. Candidate * Sino-German Joint Software Institute, * School of Computer Science&Engineering, Beihang University * Phone: (86-010)82315908 * Email: hailong.yang1115@gmail.com * Address: G413, New Main Building in Beihang University, * No.37 XueYuan Road,HaiDian District, * Beijing,P.R.China,100191 *********************************************** --=====003_Dragon731067674787_=====--