Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3069EF3DC for ; Tue, 26 Mar 2013 20:44:12 +0000 (UTC) Received: (qmail 59995 invoked by uid 500); 26 Mar 2013 20:44:07 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 59894 invoked by uid 500); 26 Mar 2013 20:44:07 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 59887 invoked by uid 99); 26 Mar 2013 20:44:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Mar 2013 20:44:07 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of vinodkv@hortonworks.com designates 209.85.220.46 as permitted sender) Received: from [209.85.220.46] (HELO mail-pa0-f46.google.com) (209.85.220.46) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Mar 2013 20:44:01 +0000 Received: by mail-pa0-f46.google.com with SMTP id rl6so12497pac.19 for ; Tue, 26 Mar 2013 13:43:41 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:from:mime-version:content-type:subject:date:in-reply-to :to:references:message-id:x-mailer:x-gm-message-state; bh=b8cM8c6yf8TRoWIlKBMgO4FPu0CS74rRCxoA0uzj7jM=; b=BA3QG4qWXP9EHIHmNfj+/1eqvofto02xZN721GmaDL7ZYiMVXopXPkTUyjDHeyN7ak z5dPsAoSUgcnzJU3eZ+GsbhkbmnZXSLRFwlvJGhIUyBqnsg9xpB7OHWMXxsKhz8N3ljD 03VxZro8VlM2qZHXWDvJfL8nmN4HgrMrF+bXW5uSSh9W9BBjq2oPJ9N3TbUAFmgrjTzl vkT5PdTKdovyMhZXm2GmHqr9WYOwl6rOf6WWmPYinWbw8xIEAaxSryHtsu2IwtLRsRKL Ke91VUaMHZf7DjzB1klnGUa8WSgEeC5+r5twkbdKyws1J8b5AFlvdR0kz37IhMJRDpcu eNNQ== X-Received: by 10.68.12.103 with SMTP id x7mr9285422pbb.37.1364330621592; Tue, 26 Mar 2013 13:43:41 -0700 (PDT) Received: from [10.11.3.71] (host1.hortonworks.com. [70.35.59.2]) by mx.google.com with ESMTPS id tm1sm18684169pbc.11.2013.03.26.13.43.40 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 26 Mar 2013 13:43:40 -0700 (PDT) From: Vinod Kumar Vavilapalli Mime-Version: 1.0 (Apple Message framework v1283) Content-Type: multipart/alternative; boundary="Apple-Mail=_4E36DD4D-5907-4E29-BA08-7E215665717D" Subject: Re: Auto clean DistCache? Date: Tue, 26 Mar 2013 13:43:38 -0700 In-Reply-To: To: user@hadoop.apache.org References: Message-Id: <39971159-ACE1-4D74-96E0-21B5B0B56E48@apache.org> X-Mailer: Apple Mail (2.1283) X-Gm-Message-State: ALoCoQl86m9WTWmpeeM7fJn324FSI3Kj3g4oavO4A8h/ENXVgQpA9HfRy9kNU0EWaEBXMb0yT2MK X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail=_4E36DD4D-5907-4E29-BA08-7E215665717D Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii You can control the limit of these cache files, the default is 10GB = (value of 10737418240L): Try changing local.cache.size or = mapreduce.tasktracker.cache.local.size in mapred-site.xml Thanks, +Vinod Kumar Vavilapalli Hortonworks Inc. http://hortonworks.com/ On Mar 25, 2013, at 5:16 PM, Jean-Marc Spaggiari wrote: > Hi, >=20 > Each time my MR job is run, a directory is created on the TaskTracker > under mapred/local/taskTracker/hadoop/distcache (based on my > configuration). >=20 > I looked at the directory today, and it's hosting thousands of > directories and more than 8GB of data there. >=20 > Is there a way to automatically delete this directory when the job is = done? >=20 > Thanks, >=20 > JM --Apple-Mail=_4E36DD4D-5907-4E29-BA08-7E215665717D Content-Transfer-Encoding: 7bit Content-Type: text/html; charset=us-ascii

You can control the limit of these cache files, the default is 10GB (value of 10737418240L): Try changing local.cache.size or mapreduce.tasktracker.cache.local.size in mapred-site.xml

Thanks,
+Vinod Kumar Vavilapalli
Hortonworks Inc.
http://hortonworks.com/

On Mar 25, 2013, at 5:16 PM, Jean-Marc Spaggiari wrote:

Hi,

Each time my MR job is run, a directory is created on the TaskTracker
under mapred/local/taskTracker/hadoop/distcache (based on my
configuration).

I looked at the directory today, and it's hosting thousands of
directories and more than 8GB of data there.

Is there a way to automatically delete this directory when the job is done?

Thanks,

JM

--Apple-Mail=_4E36DD4D-5907-4E29-BA08-7E215665717D--