Return-Path: Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: (qmail 89223 invoked from network); 1 Jul 2010 03:26:50 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 Jul 2010 03:26:50 -0000 Received: (qmail 94427 invoked by uid 500); 1 Jul 2010 03:26:50 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 94302 invoked by uid 500); 1 Jul 2010 03:26:47 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 94294 invoked by uid 99); 1 Jul 2010 03:26:46 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Jul 2010 03:26:46 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Jul 2010 03:26:43 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o613Iqvp022445 for ; Thu, 1 Jul 2010 03:18:52 GMT Message-ID: <19139050.147161277954332422.JavaMail.jira@thor> Date: Wed, 30 Jun 2010 23:18:52 -0400 (EDT) From: "Amareshwari Sriramadasu (JIRA)" To: common-issues@hadoop.apache.org Subject: [jira] Commented: (HADOOP-5436) job history directory grows without bound, locks up job tracker on new job submission MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12884145#action_12884145 ] Amareshwari Sriramadasu commented on HADOOP-5436: ------------------------------------------------- Jobhistory directory structure will be fixed by MAPREDUCE-323. That should solve the issue. > job history directory grows without bound, locks up job tracker on new job submission > ------------------------------------------------------------------------------------- > > Key: HADOOP-5436 > URL: https://issues.apache.org/jira/browse/HADOOP-5436 > Project: Hadoop Common > Issue Type: Bug > Affects Versions: 0.19.0, 0.20.0, 0.20.1, 0.20.2 > Reporter: Tim Williamson > Fix For: 0.22.0 > > Attachments: HADOOP-5436.patch > > > An unpleasant surprise upgrading to 0.19: requests to jobtracker.jsp would take a long time or even time out whenever new jobs where submitted. Investigation showed the call to JobInProgress.initTasks() was calling JobHistory.JobInfo.logSubmitted() which in turn was calling JobHistory.getJobHistoryFileName() which was pegging the CPU for a couple minutes. Further investigation showed the were 200,000+ files in the job history folder -- and every submission was creating a FileStatus for them all, then applying a regular expression to just the name. All this just on the off chance the job tracker had been restarted (see HADOOP-3245). To make matters worse, these files cannot be safely deleted while the job tracker is running, as the disappearance of a history file at the wrong time causes a FileNotFoundException. > So to summarize the issues: > - having Hadoop default to storing all the history files in a single directory is a Bad Idea > - doing expensive processing of every history file on every job submission is a Worse Idea > - doing expensive processing of every history file on every job submission while holding a lock on the JobInProgress object and thereby blocking the jobtracker.jsp from rendering is a Terrible Idea (note: haven't confirmed this, but a cursory glance suggests that's what's going on) > - not being able to clean up the mess without taking down the job tracker is just Unfortunate -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.