Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1B07A10B4A for ; Mon, 1 Jul 2013 20:38:23 +0000 (UTC) Received: (qmail 87808 invoked by uid 500); 1 Jul 2013 20:38:18 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 87689 invoked by uid 500); 1 Jul 2013 20:38:17 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 87682 invoked by uid 99); 1 Jul 2013 20:38:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jul 2013 20:38:17 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of john.lilley@redpoint.net designates 206.225.164.218 as permitted sender) Received: from [206.225.164.218] (HELO hub021-nj-3.exch021.serverdata.net) (206.225.164.218) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jul 2013 20:38:11 +0000 Received: from MBX021-E3-NJ-2.exch021.domain.local ([10.240.4.78]) by HUB021-NJ-3.exch021.domain.local ([10.240.4.36]) with mapi id 14.03.0123.003; Mon, 1 Jul 2013 13:37:50 -0700 From: John Lilley To: "user@hadoop.apache.org" Subject: temporary folders for YARN tasks Thread-Topic: temporary folders for YARN tasks Thread-Index: Ac52mkiuCVeU3YoERPO886Nqy4nwbA== Date: Mon, 1 Jul 2013 20:37:48 +0000 Message-ID: <869970D71E26D7498BDAC4E1CA92226B658D578D@MBX021-E3-NJ-2.exch021.domain.local> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [173.160.43.61] Content-Type: multipart/alternative; boundary="_000_869970D71E26D7498BDAC4E1CA92226B658D578DMBX021E3NJ2exch_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_869970D71E26D7498BDAC4E1CA92226B658D578DMBX021E3NJ2exch_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable When a YARN app and its tasks wants to write temporary files, how does it k= now where to write the files? I am assuming that each task has some temporary space available, and I hope= it is available across multiple disk volumes for parallel performance. Are those files cleaned up automatically after task exit? If I want to give lifetime control of the files to an auxiliary service (al= ong the lines of MR shuffle passing files to the aux service), how would I = do that, and would that entail different file locations? Thanks John --_000_869970D71E26D7498BDAC4E1CA92226B658D578DMBX021E3NJ2exch_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

When a YARN app and its tasks wants to write tempora= ry files, how does it know where to write the files? 

I am assuming that each task has some temporary spac= e available, and I hope it is available across multiple disk volumes for pa= rallel performance.

Are those files cleaned up automatically after task = exit?

If I want to give lifetime control of the files to a= n auxiliary service (along the lines of MR shuffle passing files to the aux= service), how would I do that, and would that entail different file locati= ons?

Thanks

John

 

 

--_000_869970D71E26D7498BDAC4E1CA92226B658D578DMBX021E3NJ2exch_--