Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 56FB8D2ED for ; Thu, 11 Oct 2012 20:51:14 +0000 (UTC) Received: (qmail 29104 invoked by uid 500); 11 Oct 2012 20:51:12 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 29059 invoked by uid 500); 11 Oct 2012 20:51:12 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 29051 invoked by uid 99); 11 Oct 2012 20:51:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Oct 2012 20:51:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbishop.rwc@gmail.com designates 209.85.220.169 as permitted sender) Received: from [209.85.220.169] (HELO mail-vc0-f169.google.com) (209.85.220.169) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Oct 2012 20:51:05 +0000 Received: by mail-vc0-f169.google.com with SMTP id fl17so3207876vcb.14 for ; Thu, 11 Oct 2012 13:50:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=WRvuzXzhE/qUUGkLuv+mExBLXILizJkifV8AM+wUYmE=; b=k0oc01j2sPHT1discXoe5YQPhf/O0wvcpwgzVhzduCZxqdgDa3Xe4v1uOjfmAtLqo0 r0dSZNnQ2zOyPIGw/DZWQYWCYg8ISrNizX5ryY2PbG4aPH2KLw2WOBgDOwlmixvv4MQG p4fGfzPm0vAGrr+/ZnSw3c74kSA15Uxl3HiWkAKAIwYAv9CVWi749Cw22vjjsfFTr1YG TmT/APYdH9ZghZd1yeOA/uJ+ba3rrMV81ZRfIFApvASgpURIuCvAwEardVsJMNPNVHKO KfDBBkKxoSCqV/sF+RkHA6LD4MmqCv+FzapYYAXJVvjO3qcLP6TwmFwc2vz7T2yKu+YE /zlA== MIME-Version: 1.0 Received: by 10.52.155.199 with SMTP id vy7mr1027367vdb.54.1349988644391; Thu, 11 Oct 2012 13:50:44 -0700 (PDT) Received: by 10.58.231.2 with HTTP; Thu, 11 Oct 2012 13:50:44 -0700 (PDT) In-Reply-To: References: Date: Thu, 11 Oct 2012 13:50:44 -0700 Message-ID: Subject: Re: Force the number of map tasks in MR? From: Jonathan Bishop To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=bcaec53aef50998b3704cbceb962 --bcaec53aef50998b3704cbceb962 Content-Type: text/plain; charset=ISO-8859-1 JM, The number of map tasks will be limited by the number of input splits available. Assuming you are reading files, that is. Also, you need to reboot your cluster for those setting to take effect. Hope this helps, Jon Bishop On Thu, Oct 11, 2012 at 1:44 PM, Jean-Marc Spaggiari < jean-marc@spaggiari.org> wrote: > But this is the limit per tasktracker, right? > > And I have 6 nodes, so 6 tasktrackers, which mean it should go up to 12 > tasks? > > Take a look at 2.7 here: http://wiki.apache.org/hadoop/FAQ > > I just tried with the setting below (changing 2 by 6) but I'm getting > the same result. > > JM > > 2012/10/11 Kevin O'dell : > > J-M, > > > > It should be in the mapred-site.xml the values > > are mapred.tasktracker.map.tasks.maximum and > > mapred.tasktracker.reduce.tasks.maximum. This is the default in CDH4 > > > > > > mapreduce.tasktracker.map.tasks.maximum > > 2 > > The maximum number of map tasks that will be run > > simultaneously by a task tracker. > > > > > > > > > > mapreduce.tasktracker.reduce.tasks.maximum > > 2 > > The maximum number of reduce tasks that will be run > > simultaneously by a task tracker. > > > > > > > > This would explain why they are going 2 by 2. Does this help? > > > > On Thu, Oct 11, 2012 at 4:25 PM, Jean-Marc Spaggiari < > > jean-marc@spaggiari.org> wrote: > > > >> I don't know. I did not touched that. Where can I found this > information? > >> > >> 2012/10/11 Kevin O'dell : > >> > What are you max tasks set to? > >> > > >> > On Thu, Oct 11, 2012 at 3:59 PM, Jean-Marc Spaggiari < > >> > jean-marc@spaggiari.org> wrote: > >> > > >> >> Hi, > >> >> > >> >> Is there a way to force the number of map tasks in a MR? > >> >> > >> >> I have a 25 regions table splitted over 6 nodes. But the MR is > running > >> >> the tasks only 2 by 2. > >> >> > >> >> Is there a way to force it to run one task on each regionserver > >> >> serving at least one region? Why is the MR waiting for 2 taskes to > >> >> complete before sending to the other tasks? > >> >> > >> >> I'm starting the MR with a caching of 100. > >> >> > >> >> I tried mapred.map.tasks and speculative=false with no success. > >> >> > >> >> Any idea how I can increase it this number of tasks? > >> >> > >> >> Thanks, > >> >> > >> >> JM > >> >> > >> > > >> > > >> > > >> > -- > >> > Kevin O'Dell > >> > Customer Operations Engineer, Cloudera > >> > > > > > > > > -- > > Kevin O'Dell > > Customer Operations Engineer, Cloudera > --bcaec53aef50998b3704cbceb962--