Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5D420113F4 for ; Tue, 2 Sep 2014 21:29:25 +0000 (UTC) Received: (qmail 35123 invoked by uid 500); 2 Sep 2014 21:29:21 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 35004 invoked by uid 500); 2 Sep 2014 21:29:21 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 34988 invoked by uid 99); 2 Sep 2014 21:29:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Sep 2014 21:29:20 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of john.lilley@redpoint.net designates 207.46.163.140 as permitted sender) Received: from [207.46.163.140] (HELO na01-bn1-obe.outbound.protection.outlook.com) (207.46.163.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Sep 2014 21:28:52 +0000 Received: from DM2PR0701MB729.namprd07.prod.outlook.com (10.242.126.152) by DM2PR0701MB730.namprd07.prod.outlook.com (10.242.126.153) with Microsoft SMTP Server (TLS) id 15.0.1015.19; Tue, 2 Sep 2014 21:28:48 +0000 Received: from DM2PR0701MB729.namprd07.prod.outlook.com ([10.242.126.152]) by DM2PR0701MB729.namprd07.prod.outlook.com ([10.242.126.152]) with mapi id 15.00.1015.018; Tue, 2 Sep 2014 21:28:48 +0000 From: John Lilley To: "user@hadoop.apache.org" Subject: RE: YARN userapp cache lifetime: can't find core dump Thread-Topic: YARN userapp cache lifetime: can't find core dump Thread-Index: Ac/G6G/jAjridrXERRWB8nxPjnTP2gADGZ6w Date: Tue, 2 Sep 2014 21:28:48 +0000 Message-ID: <0c1ec5cd598340eab64b65377463f4f5@DM2PR0701MB729.namprd07.prod.outlook.com> References: <8ae1cc7692b64cc0a8946e1072023b7d@DM2PR0701MB729.namprd07.prod.outlook.com> In-Reply-To: <8ae1cc7692b64cc0a8946e1072023b7d@DM2PR0701MB729.namprd07.prod.outlook.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [173.160.43.60] x-microsoft-antispam: BCL:0;PCL:0;RULEID:;UriScan:; x-forefront-prvs: 0322B4EDE1 x-forefront-antispam-report: SFV:NSPM;SFS:(6009001)(199003)(164054003)(377454003)(189002)(81342001)(76576001)(19580405001)(2656002)(19300405004)(74316001)(15975445006)(80022001)(66066001)(81542001)(101416001)(21056001)(46102001)(19580395003)(86362001)(83322001)(92566001)(87936001)(77982001)(76482001)(83072002)(19625215002)(33646002)(85852003)(4396001)(31966008)(74662001)(74502001)(106356001)(99396002)(108616004)(107886001)(2351001)(107046002)(20776003)(79102001)(64706001)(105586002)(90102001)(54356999)(99286002)(85306004)(110136001)(15202345003)(16236675004)(50986999)(95666004)(2501002)(76176999)(24736002);DIR:OUT;SFP:;SCL:1;SRVR:DM2PR0701MB730;H:DM2PR0701MB729.namprd07.prod.outlook.com;FPR:;MLV:sfv;PTR:InfoNoRecords;A:1;MX:1;LANG:en; Content-Type: multipart/alternative; boundary="_000_0c1ec5cd598340eab64b65377463f4f5DM2PR0701MB729namprd07p_" MIME-Version: 1.0 X-OriginatorOrg: redpoint.net X-Virus-Checked: Checked by ClamAV on apache.org --_000_0c1ec5cd598340eab64b65377463f4f5DM2PR0701MB729namprd07p_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable I think I found it: yarn.nodemanager.delete.debug-delay-sec From: John Lilley [mailto:john.lilley@redpoint.net] Sent: Tuesday, September 02, 2014 2:02 PM To: 'user@hadoop.apache.org' Subject: YARN userapp cache lifetime: can't find core dump We have a YARN task that is core-dumping, and the JVM error log says: # Core dump written. Default location: /data2/hadoop/yarn/local/usercache/j= lilley/appcache/application_1405724043176_2453/container_1405724043176_2453= _01_000002/core or core.14801 However when I look at the node, everything below here is empty /data2/hadoop/yarn/local/usercache/jlilley/appcache I seem to recall there is a YARN setting to control the time these files ar= e kept around after application exit, but I can't figure out what it is. Thanks, john --_000_0c1ec5cd598340eab64b65377463f4f5DM2PR0701MB729namprd07p_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

I think I found it:

= yarn.nodemanager.delete.debug-= delay-sec

 

 

From: John Lilley [mailto:john.lilley@redpoin= t.net]
Sent: Tuesday, September 02, 2014 2:02 PM
To: 'user@hadoop.apache.org'
Subject: YARN userapp cache lifetime: can't find core dump

 

We have a YARN task that is core-dumping, and the JV= M error log says:

# Core dump written. Default location: /data2/hadoop= /yarn/local/usercache/jlilley/appcache/application_1405724043176_2453/conta= iner_1405724043176_2453_01_000002/core or core.14801

 

However when I look at the node, everything below he= re is empty

/data2/hadoop/yarn/local/usercache/jlilley/appcache<= o:p>

 

I seem to recall there is a YARN setting to control = the time these files are kept around after application exit, but I can̵= 7;t figure out what it is.

 

Thanks,

john

 

--_000_0c1ec5cd598340eab64b65377463f4f5DM2PR0701MB729namprd07p_--