Return-Path: X-Original-To: apmail-hadoop-yarn-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 354B818A12 for ; Tue, 15 Dec 2015 05:18:24 +0000 (UTC) Received: (qmail 50666 invoked by uid 500); 15 Dec 2015 05:18:23 -0000 Delivered-To: apmail-hadoop-yarn-dev-archive@hadoop.apache.org Received: (qmail 50444 invoked by uid 500); 15 Dec 2015 05:18:23 -0000 Mailing-List: contact yarn-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-dev@hadoop.apache.org Delivered-To: mailing list yarn-dev@hadoop.apache.org Received: (qmail 50432 invoked by uid 99); 15 Dec 2015 05:18:23 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Dec 2015 05:18:23 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id DAB721A0483 for ; Tue, 15 Dec 2015 05:18:22 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.899 X-Spam-Level: ** X-Spam-Status: No, score=2.899 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 5ixrE-J16gDl for ; Tue, 15 Dec 2015 05:18:17 +0000 (UTC) Received: from mail-io0-f175.google.com (mail-io0-f175.google.com [209.85.223.175]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 5F660206E3 for ; Tue, 15 Dec 2015 05:18:16 +0000 (UTC) Received: by mail-io0-f175.google.com with SMTP id e126so7894473ioa.1 for ; Mon, 14 Dec 2015 21:18:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=4D+6lNWlBp6f6KL8QQxgE1HOBUnXNSFdulEVMt120EM=; b=PcFQHW+QNUh2yii3LHqxrRCJDZImUg2J7+wtDjcphPBIGO1QDvWj7qY9CJooZLR9NP oREv2m69WJsyyiyXMIluks25KlSxD8j0VbazzhviYN5WAtmaH2TYabod0RmcWz+C7J/C k5CyZ34a/DQPFKqyDahTka195rybZnpS/HreEXV2HHvlfPk28jRKj5Vvjh+cdtIjmu30 L8Le+b3w7m4tlNDILiogheUhrZFt/nJlmwyyDvBU4Ao7h30CWs0hZOBX04Oer7QkstdK UPTtuFozkVk56QXMx7lEh41Nf0jN6uv8Sg68gsJH3wLTDdFokf7GZbU4Lz+wdBD+vJ5T zWiQ== MIME-Version: 1.0 X-Received: by 10.107.10.199 with SMTP id 68mr32381478iok.75.1450156689599; Mon, 14 Dec 2015 21:18:09 -0800 (PST) Received: by 10.64.1.133 with HTTP; Mon, 14 Dec 2015 21:18:09 -0800 (PST) In-Reply-To: <0FE709BE-FC67-43B4-964D-F6F8AF7D4248@hortonworks.com> References: <4335BFA2-D106-4AC5-8C21-9D8AB2F39516@hortonworks.com> <0FE709BE-FC67-43B4-964D-F6F8AF7D4248@hortonworks.com> Date: Tue, 15 Dec 2015 10:48:09 +0530 Message-ID: Subject: Re: [Timeline V2 branch] Latest timeline v2 and SMP problem From: Varun Saxena To: Li Lu Cc: "yarn-dev@hadoop.apache.org" , Sangjin Lee , Junping Du , Vrushali Channapattan , Joep Rottinghuis , "Naganarasimha G R (Naga)" Content-Type: multipart/alternative; boundary=001a113edd705a33d40526e8ea93 --001a113edd705a33d40526e8ea93 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi Li, This is because we are registering the event in serviceStart() instead of serviceInit(). As SMP is the last service in the list, its started right in the end i.e. even after all the RPCs', UI related stuff. This can cause an app flow to start before the SMP/V2Publisher service has even started. This is what causes the issue. You want to raise JIRA for this issue or should I ? I can handle it. Regards, Varun Saxena. On Tue, Dec 15, 2015 at 8:35 AM, Li Lu wrote: > Thanks Sangjin. I=E2=80=99ll keep tracing this. Meanwhile, if anybody has > reproduced the problem, please feel free to let me know. Thanks! > > Li Lu > > On Dec 14, 2015, at 18:16, Sangjin Lee wrote: > > Can you bisect the commits to see if you can isolate which commit > introduced the issue? > > On Mon, Dec 14, 2015 at 5:39 PM, Li Lu wrote: > > Hi YARN developers working on Timeline v2 (YARN-2928) branch, > > I just realized I=E2=80=99ve accidentally turned off SMP for my local Tim= eline v2 > build. After I turned yarn.system-metrics-publisher.enabled back on, the = RM > fails to start with the following FATAL message: > > 2015-12-14 17:27:54,125 INFO ipc.Server (Server.java:run(797)) - IPC > Server listener on 8033: starting > 2015-12-14 17:27:54,127 FATAL event.AsyncDispatcher > (AsyncDispatcher.java:dispatch(189)) - Error in dispatcher thread true > java.lang.Exception: No handler for registered for class > > org.apache.hadoop.yarn.server.resourcemanager.metrics.AbstractSystemMetri= csPubli > sher$SystemMetricsEventType > at > > org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.jav= a:185) > at > > org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:1= 09) > at java.lang.Thread.run(Thread.java:745) > 2015-12-14 17:27:54,127 INFO event.AsyncDispatcher > (AsyncDispatcher.java:register(208)) - Registering class > org.apache.hadoop.yarn.serve > > r.resourcemanager.metrics.AbstractSystemMetricsPublisher$SystemMetricsEve= ntType > for class org.apache.hadoop.yarn.server.resourcemanager.m > etrics.TimelineServiceV2Publisher$TimelineV2EventHandler > > Interestingly, we=E2=80=99re registering this class to timeline v2 handle= r in the > next line of log. I=E2=80=99m wondering if this is caused by some of my m= issing > configs, or a newly introduced issue? Has anybody on feature-YARN-2928 > branch noticed this issue? Thanks! > > Li Lu > > > > --001a113edd705a33d40526e8ea93--