Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 94569 invoked from network); 10 Oct 2008 05:34:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 10 Oct 2008 05:34:11 -0000 Received: (qmail 89842 invoked by uid 500); 10 Oct 2008 05:34:04 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 89814 invoked by uid 500); 10 Oct 2008 05:34:04 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 89798 invoked by uid 99); 10 Oct 2008 05:34:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Oct 2008 22:34:04 -0700 X-ASF-Spam-Status: No, hits=-1999.9 required=10.0 tests=ALL_TRUSTED,DNS_FROM_SECURITYSAGE X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Oct 2008 05:33:08 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 48CFF234C21C for ; Thu, 9 Oct 2008 22:33:44 -0700 (PDT) Message-ID: <1730457643.1223616824297.JavaMail.jira@brutus> Date: Thu, 9 Oct 2008 22:33:44 -0700 (PDT) From: "Vinod K V (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-4287) [mapred] jobqueue_details.jsp shows negative count of running and waiting reduces with CapacityTaskScheduler. In-Reply-To: <96460383.1222419944215.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-4287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12638484#action_12638484 ] Vinod K V commented on HADOOP-4287: ----------------------------------- The fix is fine. I have a few test-case related comments. - You only have a test for failing maps, you should also have a test for failing reduces. - You should test for positive counts not just once, but through out the life time of the job; so you need to check it in a loop till job completion. Otherwise test-case success/failure would just be a matter of timing. - I think you can rename the test-case to TestJobInProgess, because part of which is what we are really testing here. - Minor : You shouldn't catch and ignore any exception thrown by RunningJob.runJob(). If something abnormal happens, let the test-case fail. > [mapred] jobqueue_details.jsp shows negative count of running and waiting reduces with CapacityTaskScheduler. > ------------------------------------------------------------------------------------------------------------- > > Key: HADOOP-4287 > URL: https://issues.apache.org/jira/browse/HADOOP-4287 > Project: Hadoop Core > Issue Type: Bug > Components: contrib/capacity-sched > Reporter: Vinod K V > Assignee: Sreekanth Ramakrishnan > Priority: Blocker > Attachments: HADOOP-4287-1.patch, HADOOP-4287-2.patch > > > This I observed while running a job that always fails because of reduce failures. Need to investigate this. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.