Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2618AC080 for ; Fri, 11 May 2012 11:09:25 +0000 (UTC) Received: (qmail 36215 invoked by uid 500); 11 May 2012 11:09:21 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 35988 invoked by uid 500); 11 May 2012 11:09:21 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 35963 invoked by uid 99); 11 May 2012 11:09:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 May 2012 11:09:20 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of harsh@cloudera.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qc0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 May 2012 11:09:13 +0000 Received: by qcsc21 with SMTP id c21so2486096qcs.35 for ; Fri, 11 May 2012 04:08:52 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=PsJd0zAX1+4mQ7qLFdq72+JvMxFrD7Dq4l2E2pwJ0q0=; b=d6guug6BcvHwMHfi4rhTOwQaROVdKQAkoG4Kjy6Sf1B9Twhz8z/sHCtm3RNaiDRD1I Rl3VUcaQBq6aC8dgpxjJmqGDyWPfppbOH6jIo7wIJFqek92ndgnV7niXurcOBwqJvrJ0 qLzttzW9PBfdiw3ek5/s1w6hITnhMwL3a1KkhSASwcUB10ntifOSvCAQYrPdYUwiwT9J hWkGFy9JJ9K3ZgfooBR0VtJcT3pJi4EVd6RiBjWAujqo9eAKDWTvUBf/KmCAvIzEb0PF s1Lvxewo+O4UwpCS9PYK7wdUbpvLdxW9Xkm4j6OHxEdKkM9d3Xg9pcL2+DUCltVPpkJA k72g== Received: by 10.224.117.16 with SMTP id o16mr17586194qaq.8.1336734532209; Fri, 11 May 2012 04:08:52 -0700 (PDT) MIME-Version: 1.0 Received: by 10.229.13.6 with HTTP; Fri, 11 May 2012 04:08:31 -0700 (PDT) In-Reply-To: References: From: Harsh J Date: Fri, 11 May 2012 16:38:31 +0530 Message-ID: Subject: Re: freeze a mapreduce job To: common-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQkthTkOCXbKrkPyrq26ovaeaNA7SoTn2UJtCkx6f+uQ5yNJsDSOilkB7v7us+FH/NBn5O9f I do not know about the per-host slot control (that is most likely not supported, or not yet anyway - and perhaps feels wrong to do), but the rest of the needs can be doable if you use schedulers and queues/pools. If you use FairScheduler (FS), ensure that this job always goes to a special pool and when you want to freeze the pool simply set the pool's maxMaps and maxReduces to 0. Likewise, control max simultaneous tasks as you wish, to constrict instead of freeze. When you make changes to the FairScheduler configs, you do not need to restart the JT, and you may simply wait a few seconds for FairScheduler to refresh its own configs. More on FS at http://hadoop.apache.org/common/docs/current/fair_scheduler.html If you use CapacityScheduler (CS), then I believe you can do this by again making sure the job goes to a specific queue, and when needed to freeze it, simply set the queue's maximum-capacity to 0 (percentage) or to constrict it, choose a lower, positive percentage value as you need. You can also refresh CS to pick up config changes by refreshing queues via mradmin. More on CS at http://hadoop.apache.org/common/docs/current/capacity_scheduler.html Either approach will not freeze/constrict the job immediately, but should certainly prevent it from progressing. Meaning, their existing running tasks during the time of changes made to scheduler config will continue to run till completion but further tasks scheduling from those jobs shall begin seeing effect of the changes made. P.s. A better solution would be to make your job not take as many days, somehow? :-) On Fri, May 11, 2012 at 4:13 PM, Rita wrote: > I have a rather large map reduce job which takes few days. I was wondering > if its possible for me to freeze the job or make the job less intensive. Is > it possible to reduce the number of slots per host and then I can increase > them overnight? > > > tia > > -- > --- Get your facts first, then you can distort them as you please.-- -- Harsh J