Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 91755 invoked from network); 30 Jul 2010 22:01:56 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 30 Jul 2010 22:01:56 -0000 Received: (qmail 79149 invoked by uid 500); 30 Jul 2010 22:01:56 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 79095 invoked by uid 500); 30 Jul 2010 22:01:55 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 79087 invoked by uid 99); 30 Jul 2010 22:01:55 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Jul 2010 22:01:55 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Jul 2010 22:01:55 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o6UM1YId001809 for ; Fri, 30 Jul 2010 22:01:34 GMT Message-ID: <19671789.94801280527294946.JavaMail.jira@thor> Date: Fri, 30 Jul 2010 18:01:34 -0400 (EDT) From: "Dick King (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Commented: (MAPREDUCE-323) Improve the way job history files are managed MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894173#action_12894173 ] Dick King commented on MAPREDUCE-323: ------------------------------------- I need to modify {{getMatchingJob(String, String, String[])}} in my comment of 28/Jul/10 03:09 PM as follows: {noformat} class PathCow implements Iterator { // Iterator methods int numberMatches(); // returns number of matches you could get if you drive the Iterator to // the end. Might be an approximation. } PathCow getMatchingJob (String user, String jobnameSubstring, String[] dateStrings, boolean backwards) throws IOException // has no remove() method // any criterion can be null // filtering is conjunctive // dates are MM/DD/YYYY // results happen approximately oldest first [or newest first, // if backwards is true] // a new file that gets added after the iterator is created can either be // or not be delivered by the result // dates are approximations of completion time {noformat} > Improve the way job history files are managed > --------------------------------------------- > > Key: MAPREDUCE-323 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-323 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: jobtracker > Affects Versions: 0.21.0, 0.22.0 > Reporter: Amar Kamat > Assignee: Dick King > Priority: Critical > > Today all the jobhistory files are dumped in one _job-history_ folder. This can cause problems when there is a need to search the history folder (job-recovery etc). It would be nice if we group all the jobs under a _user_ folder. So all the jobs for user _amar_ will go in _history-folder/amar/_. Jobs can be categorized using various features like _jobid, date, jobname_ etc but using _username_ will make the search much more efficient and also will not result into namespace explosion. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.