Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 6685 invoked from network); 7 Dec 2009 03:37:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Dec 2009 03:37:56 -0000 Received: (qmail 25131 invoked by uid 500); 7 Dec 2009 03:37:55 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 24948 invoked by uid 500); 7 Dec 2009 03:37:53 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 24938 invoked by uid 99); 7 Dec 2009 03:37:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Dec 2009 03:37:53 +0000 X-ASF-Spam-Status: No, hits=4.2 required=10.0 tests=HTML_MESSAGE,NO_RDNS_DOTCOM_HELO,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [69.147.107.20] (HELO mrout1-b.corp.re1.yahoo.com) (69.147.107.20) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Dec 2009 03:37:41 +0000 Received: from sp1-ex07cas03.ds.corp.yahoo.com (sp1-ex07cas03.ds.corp.yahoo.com [216.252.116.151]) by mrout1-b.corp.re1.yahoo.com (8.13.8/8.13.8/y.out) with ESMTP id nB73b5pv022358 for ; Sun, 6 Dec 2009 19:37:05 -0800 (PST) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=received:from:to:date:subject:thread-topic:thread-index: message-id:in-reply-to:accept-language:content-language: x-ms-has-attach:x-ms-tnef-correlator:acceptlanguage:content-type:mime-version; b=tjjyzcNoPwFUMd1tsUS1WpKjJ7N3vrmkVONvwgdC7macGb203KxUYW9o3Rr/jJrj Received: from SP1-EX07VS01.ds.corp.yahoo.com ([216.252.116.139]) by sp1-ex07cas03.ds.corp.yahoo.com ([216.252.116.151]) with mapi; Sun, 6 Dec 2009 19:37:04 -0800 From: Rekha Joshi To: "general@hadoop.apache.org" Date: Sun, 6 Dec 2009 19:36:54 -0800 Subject: Re: How to pause a job? Thread-Topic: How to pause a job? Thread-Index: Acp2Z+nE1DecgjAsTBaVOz06zpmPQQAhptKt Message-ID: In-Reply-To: <5a921af20912060332k9ddacf1u48d14d0449a68444@mail.gmail.com> Accept-Language: en-US Content-Language: en X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: multipart/alternative; boundary="_000_C742762E52E1rekhajosyahooinccom_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_C742762E52E1rekhajosyahooinccom_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable If you have workflow engine around your job, that can take care of restarti= ng the jobs. For direct hadoop execution, please refer MAPREDUCE-227/ MAPR= EDUCE-828 Not sure if it would work, you might try (effectively) blocking this job ru= n by -set-priority on this job to very low and have other VHP jobs running.= . Thanks! On 12/6/09 5:02 PM, "xiao yang" wrote: Hi, all I'm running a job on a 10-nodes cluster. Now I want to add another node. I have to reconfigure the dfs, and restart it, but I don't want to stop the running job. It run for 1 week already. What should I do? Is there a way to pause a job, and resume it after dfs restart. Thanks! Xiao --_000_C742762E52E1rekhajosyahooinccom_--