Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 19590 invoked from network); 15 Dec 2007 16:57:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 15 Dec 2007 16:57:09 -0000 Received: (qmail 63283 invoked by uid 500); 15 Dec 2007 16:56:58 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 63252 invoked by uid 500); 15 Dec 2007 16:56:57 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 63243 invoked by uid 99); 15 Dec 2007 16:56:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Dec 2007 08:56:57 -0800 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Dec 2007 16:56:51 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 4D2F571420E for ; Sat, 15 Dec 2007 08:56:43 -0800 (PST) Message-ID: <16784079.1197737803313.JavaMail.jira@brutus> Date: Sat, 15 Dec 2007 08:56:43 -0800 (PST) From: "Devaraj Das (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Updated: (HADOOP-2228) Jobs fail because job.xml exists In-Reply-To: <27220538.1195494343142.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj Das updated HADOOP-2228: -------------------------------- Resolution: Fixed Status: Resolved (was: Patch Available) I just committed this. Thanks, Johan! > Jobs fail because job.xml exists > -------------------------------- > > Key: HADOOP-2228 > URL: https://issues.apache.org/jira/browse/HADOOP-2228 > Project: Hadoop > Issue Type: Bug > Components: mapred > Affects Versions: 0.14.3 > Environment: 35 node cluster, linux > Reporter: Johan Oskarsson > Assignee: Johan Oskarsson > Fix For: 0.15.2, 0.16.0 > > Attachments: HADOOP-2228-v1.patch > > > org.apache.hadoop.ipc.RemoteException: java.io.IOException: Target /var/storage/4/mapred/local/jobTracker/job_200711081903_3976.xml already exists > at org.apache.hadoop.fs.FileUtil.checkDest(FileUtil.java:271) > at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:117) > at org.apache.hadoop.fs.FileSystem.copyToLocalFile(FileSystem.java:803) > at org.apache.hadoop.fs.FileSystem.copyToLocalFile(FileSystem.java:784) > at org.apache.hadoop.mapred.JobInProgress.(JobInProgress.java:134) > at org.apache.hadoop.mapred.JobTracker.submitJob(JobTracker.java:1479) > at sun.reflect.GeneratedMethodAccessor25.invoke(Unknown Source) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:340) > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:566) > at org.apache.hadoop.ipc.Client.call(Client.java:470) > at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:165) > at $Proxy1.submitJob(Unknown Source) > at sun.reflect.GeneratedMethodAccessor26.invoke(Unknown Source) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:82) > at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:59) > at $Proxy1.submitJob(Unknown Source) > at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:397) > at org.apache.hadoop.mapred.jobcontrol.Job.submit(Job.java:345) > at org.apache.hadoop.mapred.jobcontrol.JobControl.startReadyJobs(JobControl.java:250) > at org.apache.hadoop.mapred.jobcontrol.JobControl.run(JobControl.java:282) > at java.lang.Thread.run(Thread.java:619) > Perhaps related to HADOOP-1057, HADOOP-891 or to the rpc retry. It seems my job was submitted and actually finished despite the exception. Could it be that the job went in and the rpc retry decided to submit it again anyway? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.