Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 62091 invoked from network); 16 Sep 2008 11:19:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Sep 2008 11:19:38 -0000 Received: (qmail 93513 invoked by uid 500); 16 Sep 2008 11:19:32 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 93483 invoked by uid 500); 16 Sep 2008 11:19:32 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 93472 invoked by uid 99); 16 Sep 2008 11:19:32 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Sep 2008 04:19:32 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Sep 2008 11:18:42 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 38DE2234C1E2 for ; Tue, 16 Sep 2008 04:18:45 -0700 (PDT) Message-ID: <1490359129.1221563925231.JavaMail.jira@brutus> Date: Tue, 16 Sep 2008 04:18:45 -0700 (PDT) From: "Hemanth Yamijala (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-3930) Decide how to integrate scheduler info into CLI and job tracker web page In-Reply-To: <675325039.1218240644514.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12631340#action_12631340 ] Hemanth Yamijala commented on HADOOP-3930: ------------------------------------------ JobTracker: - getAllJobs: if the scheduler returns null, it should return an empty JobStatus array. - There's code being repeated in getAllJobs(), getAllJobs(String queue) and jobsToComplete. I think it should be factored out so changes to one of the methods (for e.g. to return a new field) need not be duplicated. JobQueueInfo: - schedulingInfo stored here is a stringified version. I think it should be declared a String and get/set should deal with strings. The caller should basically call with actualObject.toString(). This makes it similar to JobStatus. - In JobStatus, we are using Text.readString whereas in JobQueueInfo, we are using readUTF. I think in similar cases elsewhere we use the UTF versions. Similar comments for the write APIs. JspUtil: - This is including JspHelper which is a class from the NameNode package. I don't think it is a good idea for a MapRed class to depend on this, however I understand this has always been this way. Maybe we should file a new JIRA to fix it. JobSubmissionProtocol: - Include HADOOP JIRA number in the comment related to version field. JobClient: - Usage prints: [-queueinfo [-showJobs] - this is missing a closing ']' - Return code should be set to 0 when the command syntax is found to be correct. - Since scheduler information is set to empty, it can never be null. I think in any case, it should print something like: {code} Queue Name: default Scheduling Information: N/A {code} - The line "Job List for the queue ::" needs a newline. Also, I think it can just read "Job list:" jobqueue_details.jsp: - Needs a backlink to the main jobtracker page - Needs a link to Hadoop web page - like in other pages. jobtracker.jsp: - The scheduling info column is not being split into rows. The HTML code generated does look fine. But still it is not showing up. Can you please check ? CapacityTaskScheduler: - Does not need supportsPriority as a separate field in the SchedulingInfo class. You can pick it up from one of the QueueSchedulingInfo objects. - guaranteedCapacity actual must be split between reduce and map slots. Currently, only the value for the map is being displayed. - Number of reclaimed resources is an internal variable and does not need to be displayed. - Rename getQSI to getQueueSchedulingInfo TestJobQueueInformation: - I think you can use JobClient, instead of directly dealing with JobSubmissionProtocol and having to duplicate the methods for createRPCProxy etc. > Decide how to integrate scheduler info into CLI and job tracker web page > ------------------------------------------------------------------------ > > Key: HADOOP-3930 > URL: https://issues.apache.org/jira/browse/HADOOP-3930 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Affects Versions: 0.19.0 > Reporter: Matei Zaharia > Assignee: Sreekanth Ramakrishnan > Attachments: 3930-1.patch, HADOOP-3930-2.patch, HADOOP-3930-3.patch, HADOOP-3930-4.patch, mockup.JPG > > > We need a way for job schedulers such as HADOOP-3445 and HADOOP-3476 to provide info to display on the JobTracker web interface and in the CLI. The main things needed seem to be: > * A way for schedulers to provide info to show in a column on the web UI and in the CLI - something as simple as a single string, or a map for multiple parameters. > * Some sorting order for jobs - maybe a method to sort a list of jobs. > Let's figure out what the best way to do this is and implement it in the existing schedulers. > My first-order proposal at an API: Augment the TaskScheduler with > * public Map getSchedulingInfo(JobInProgress job) -- returns key-value pairs which are displayed in columns on the web UI or the CLI for the list of jobs. > * public Map getSchedulingInfo(String queue) -- returns key-value pairs which are displayed in columns on the web UI or the CLI for the list of queues. > * public Collection getJobs(String queueName) -- returns the list of jobs in a given queue, sorted by a scheduler-specific order (the order it wants to run them in / schedule the next task in / etc). > * public List getQueues(); -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.