Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 68622 invoked from network); 13 Dec 2006 01:51:44 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 13 Dec 2006 01:51:44 -0000 Received: (qmail 99979 invoked by uid 500); 13 Dec 2006 01:51:51 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 99944 invoked by uid 500); 13 Dec 2006 01:51:51 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 99935 invoked by uid 99); 13 Dec 2006 01:51:51 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Dec 2006 17:51:51 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Dec 2006 17:51:43 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 171E87140F6 for ; Tue, 12 Dec 2006 17:51:23 -0800 (PST) Message-ID: <1783840.1165974683092.JavaMail.jira@brutus> Date: Tue, 12 Dec 2006 17:51:23 -0800 (PST) From: "Hadoop QA (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-794) JobTracker crashes with ArithmeticException In-Reply-To: <20588755.1165516221125.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ http://issues.apache.org/jira/browse/HADOOP-794?page=comments#action_12457992 ] Hadoop QA commented on HADOOP-794: ---------------------------------- +1, http://issues.apache.org/jira/secure/attachment/12347050/div-zero.patch applied and successfully tested against trunk revision r486399 > JobTracker crashes with ArithmeticException > ------------------------------------------- > > Key: HADOOP-794 > URL: http://issues.apache.org/jira/browse/HADOOP-794 > Project: Hadoop > Issue Type: Bug > Components: mapred > Affects Versions: 0.9.1 > Reporter: Nigel Daley > Assigned To: Owen O'Malley > Fix For: 0.10.0 > > Attachments: div-zero.patch > > > The sort benchmark on 20 nodes failed for me with an ArithmeticException. Sort.java calculates the number of maps and reduces dynamically based on the cluster size. I'm guessing at the time it did the calculation, the jobtracker had heard from 0 task trackers (which seems odd since I wait 1 minute after starting the jobtracker before starting Sort). > From the job output: > Running on 0 nodes to sort from /sortBenchmark20/input into /sortBenchmark20/output with 0 reduces. > From the JobTracker log: > 2006-12-07 07:10:06,709 ERROR org.apache.hadoop.mapred.JobTracker: Job initialization failed: > java.lang.ArithmeticException: / by zero > at org.apache.hadoop.mapred.InputFormatBase.getSplits(InputFormatBase.java:130) > at org.apache.hadoop.mapred.JobInProgress.initTasks(JobInProgress.java:125) > at org.apache.hadoop.mapred.JobTracker$JobInitThread.run(JobTracker.java:326) > at java.lang.Thread.run(Thread.java:595) > The jobtacker should check for illegal inputs like this and not fall over. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira