Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 620BDD5C5 for ; Wed, 29 Aug 2012 16:02:17 +0000 (UTC) Received: (qmail 99477 invoked by uid 500); 29 Aug 2012 16:02:12 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 99373 invoked by uid 500); 29 Aug 2012 16:02:12 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 99365 invoked by uid 99); 29 Aug 2012 16:02:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Aug 2012 16:02:12 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates 209.85.214.176 as permitted sender) Received: from [209.85.214.176] (HELO mail-ob0-f176.google.com) (209.85.214.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Aug 2012 16:02:07 +0000 Received: by obbtb18 with SMTP id tb18so1654407obb.35 for ; Wed, 29 Aug 2012 09:01:46 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=DAbegEFY7LnNJaa7VoJ+5MNb5bXpWzi9lTt1qTR+Yo0=; b=PUJPCsxPIxKi6uByi+9aqZMBxmVNw3XrxQlZAFFEiwarql+g3YWA+MtrTgNQHaCoro /O905RH732mPJH0xgkRdshbspFowX0JO3IhnSw8mttC/Alfbtp98ywn8A9x447diE06Z ePFg6f5bRCPXwztPaI/ltGpeAu8ncTIqMbDhDxTQUc/ylTHjCaKTNCttI/JruehMBNqS e3JRNgpRlF1lz8lgb+kUprB9bho2CaKd0Xfi3VJU12N7tUs6kyFUKD5XBGQogH91+OHt 7hFmNjqLsNxt65QlP3J6kNrBLQ5pXZvOF5+YQ1gMmEyPa163qyJAKocH4cNjv9e/5Qho vsIw== Received: by 10.60.11.136 with SMTP id q8mr1660636oeb.132.1346256106759; Wed, 29 Aug 2012 09:01:46 -0700 (PDT) MIME-Version: 1.0 Received: by 10.76.11.168 with HTTP; Wed, 29 Aug 2012 09:01:26 -0700 (PDT) In-Reply-To: <503E1BEB.7060003@bnl.gov> References: <53903ED9-146F-40FF-BD3B-183E1A95A3AB@yahoo.com> <503E1BEB.7060003@bnl.gov> From: Harsh J Date: Wed, 29 Aug 2012 21:31:26 +0530 Message-ID: Subject: Re: Delays in worker node jobs To: user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQnKhMbXU8qDNy+7T0wpTGXP6MQALq7LIgBKlxehFiYjlD659HM8ghAueNkP9JhOoi8RzHvn X-Virus-Checked: Checked by ClamAV on apache.org Hey Terry, Can you look at your JobTracker logs, grep it for this worker node's hostname and see the task assignment timestamps vs. when the task began in real (from the TaskTracker log, grepping for the same attempt ID)? On Wed, Aug 29, 2012 at 7:10 PM, Terry Healy wrote: > Running 1.0.2, in this case on Linux. > > I was watching the processes / loads on one TaskTracker instance and > noticed that it completed it's first 8 map tasks and reported 8 free > slots (the max for this system). It then waited doing nothing for more > than 30 seconds before the next "batch" of work came in and started running. > > Likewise it also has relatively long periods with all 8 cores running at > or near idle. There are no jobs failing or obvious errors in the > TaskTracker log. > > What could be causing this? > > Should I increase the number of map jobs to greater than number of cores > to try and keep it busier? > > -Terry -- Harsh J