Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E6158102EC for ; Tue, 24 Feb 2015 00:46:14 +0000 (UTC) Received: (qmail 57287 invoked by uid 500); 24 Feb 2015 00:46:09 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 57174 invoked by uid 500); 24 Feb 2015 00:46:09 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 57152 invoked by uid 99); 24 Feb 2015 00:46:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Feb 2015 00:46:09 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (athena.apache.org: local policy) Received: from [209.85.218.50] (HELO mail-oi0-f50.google.com) (209.85.218.50) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Feb 2015 00:46:03 +0000 Received: by mail-oi0-f50.google.com with SMTP id v1so16740252oia.9 for ; Mon, 23 Feb 2015 16:45:22 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=GbfFMLmb/LfzVuSpx3l5cYWtuHT80fOcJF7uxkblWAk=; b=evetbqPA7hCACqwQW3+kKrew5izaZBsIfZh0zPIA8kkmnb63KokflXkWyLW5FFOsBU KMEJMGGV7jsr1VRdVaw92saq/lvYy3GUk0WEWlUROYESm1aNL4TcUKcngJr0lim77RBQ jujxmZVlRgXzwDcFJDdGpq/DXgZkRqy2m3qxf5llOggVs6g6/wOUr0wXuXfd6Ba4HDkw LmpGj53nCIEF8VLHtAXpwbkBFUc2sAiFNFqr2Yl6CdMGz01gVtIk4H3dSaOQK/nvDcGt P5/CAujThylJEoYa3UYlSnIZQ1BGTdJG+khnfjkoVrXTWXyjQVKRfXea3OWFB+xuGun1 AqeA== X-Gm-Message-State: ALoCoQkugDMUWNB/DGag7iALkBeA8C+cSLQaC8VMqVAuiojIq6JGf/FLKoq7fCbP0WByYA66B2Mp MIME-Version: 1.0 X-Received: by 10.182.76.69 with SMTP id i5mr9607055obw.19.1424738722332; Mon, 23 Feb 2015 16:45:22 -0800 (PST) Received: by 10.182.38.132 with HTTP; Mon, 23 Feb 2015 16:45:22 -0800 (PST) In-Reply-To: References: Date: Tue, 24 Feb 2015 09:45:22 +0900 Message-ID: Subject: Re: tracking remote reads in datanode logs From: =?UTF-8?B?RHJha2Xrr7zsmIHqt7w=?= To: user Content-Type: multipart/alternative; boundary=047d7b6760c87153eb050fcad572 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b6760c87153eb050fcad572 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi, Igor Did you look at the mapreduce application master log? I think the local or rack local map tasks are logged in the MapReduce AM log. Good luck. Drake =EB=AF=BC=EC=98=81=EA=B7=BC Ph.D kt NexR On Tue, Feb 24, 2015 at 3:30 AM, Igor Bogomolov wrote: > Hi all, > > In a small cluster of 5 nodes that run CDH 5.3.0 (Hadoop 2.5.0) I want to > know how many remote map tasks (ones that read input data from remote > nodes) there are in a mapreduce job. For this purpose I took logs of each > datanode an looked for lines with "op: HDFS_READ" and cliID field that > contains map task id. > > Surprisingly, 4 datanode logs does not contain lines with "op: HDFS_READ"= . > Another 1 has many lines with "op: HDFS_READ" but all cliID look like > DFSClient_NONMAPREDUCE_* and does not contain any map task id. > > I concluded there are no remote map tasks but that does not look correct. > Also even local reads are not logged (because there is no line where cliI= D > field contains some map task id). Could anyone please explain what's wron= g? > Why logging is not working? (I use default settings). > > Chris, > > Found HADOOP-3062 > that you have implemented. Thought you might have an explanation. > > Best, > Igor > > --047d7b6760c87153eb050fcad572 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi, Igor

Did you look at the mapreduce = application master log? I think the local or rack local map tasks are logge= d in the MapReduce AM log.

Good luck.
<= div class=3D"gmail_extra">
Drake =EB=AF=BC=EC=98=81=EA=B7= =BC Ph.D
kt NexR

On Tue, Feb 24, 2015 at 3:30 AM, Igor Bogomo= lov <igor.bogomolov@gmail.com> wrote:
Hi all,
In a small cluster of 5 nodes that run CDH 5.3.0 (= Hadoop 2.5.0) I want to know how many remote map tasks (ones t= hat read input data from remote nodes) there are in a mapreduce job. For th= is purpose I took logs of each datanode an looked for lines w= ith "op: HDFS_READ" and cliID field tha= t contains map task id.

Surprisingly, 4 datanode= logs does not contain lines with "op: HDFS_READ". A= nother 1 has many lines with "op: HDFS_READ" but al= l cliID look like DFSClient_NONMAPREDUCE_* and does not contain any map task id.

I concluded there are = no remote map tasks but that does not look correct. Also even local reads a= re not logged (because there is no line where cliID field cont= ains some map task id). Could anyone please explain what's wrong? Why l= ogging is not working? (I use default settings).

Chris,
Found HADOOP-3062 that you h= ave implemented. Thought you might have an explanation.

Best,<= br>
Igor


--047d7b6760c87153eb050fcad572--