Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0A049D875 for ; Fri, 9 Nov 2012 20:25:05 +0000 (UTC) Received: (qmail 36426 invoked by uid 500); 9 Nov 2012 20:25:00 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 36319 invoked by uid 500); 9 Nov 2012 20:25:00 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 36312 invoked by uid 99); 9 Nov 2012 20:25:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Nov 2012 20:25:00 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates 209.85.223.176 as permitted sender) Received: from [209.85.223.176] (HELO mail-ie0-f176.google.com) (209.85.223.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Nov 2012 20:24:54 +0000 Received: by mail-ie0-f176.google.com with SMTP id k11so7405944iea.35 for ; Fri, 09 Nov 2012 12:24:34 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding:x-gm-message-state; bh=Dth9gI6aUb/LF67YcijO2Jcjdwj45RAsor1QXLuWFWU=; b=M3HQUi6i2BaRMn3gGiWehk3s2aZ6ak775Xg8mp7xrhOysMZznK8WVQRtuTuxCqpMwS id7ZiCAfTKghLrHR+4HMw+4Nzm9mFNueBylXdBrwmnJdwEYUsG/4Ix9Sr4kpH6ceF/Dl Un4RVYvL/M36XUGEW648SWNGNLXMNVwSXiIevbwyUsZEgORvwfUMgtmH0DpT7GGN9c+8 oKzABXgwetudvqB/32asxOFdvj6bH68ohJjlP08d3glXCiFwk52QLPpcb5mnlsc8LvTl ROtiMdg/oGS7iDjz4Of/szezldWIjA+jBXvPSDo72DqIIDBuB55xaxxn0ZD7Y+Z5ACxN xgbA== Received: by 10.50.182.166 with SMTP id ef6mr2440700igc.65.1352492674434; Fri, 09 Nov 2012 12:24:34 -0800 (PST) MIME-Version: 1.0 Received: by 10.64.27.8 with HTTP; Fri, 9 Nov 2012 12:24:14 -0800 (PST) In-Reply-To: <19AD42E3F64F0F468A305399D0DF39D9335DAC6F75@winops07.win.compete.com> References: <19AD42E3F64F0F468A305399D0DF39D9335DAC6F75@winops07.win.compete.com> From: Harsh J Date: Sat, 10 Nov 2012 01:54:14 +0530 Message-ID: Subject: Re: mapreduce.job.end-notification settings To: user@hadoop.apache.org Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Gm-Message-State: ALoCoQk5+94NfFlzikSKaB23zaWZH2LmzJpyY9bNXxRfGL21BfqXyFRIYiGHH/bjnErmEMe6Iise X-Virus-Checked: Checked by ClamAV on apache.org Hey Steven, Nice catch - this may be a bug we should fix. There are 4 relevant properties in MR2: Attempts: "mapreduce.job.end-notification.retry.attempts" and "mapreduce.job.end-notification.max.attempts". Retries: "mapreduce.job.end-notification.retry.interval" and "mapreduce.job.end-notification.max.retry.interval". I believe the configs of the latter of both of the above classifications were meant to be added in as replacement names, but the property names got added in wrong (as the former/older named ones) in the XML. Could you file a JIRA for this as well as there being an issue with the word "seconds" in the description of retries? The code in MR2's JobEndNotifier seems to expect seconds but uses it directly in Thread.sleep(=85) without making it milliseconds, which may be a bug we need to fix as well, perhaps in a same issue as the configs ones. On Fri, Nov 9, 2012 at 11:21 PM, Steven Willis wrote: > I was looking at: > > http://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapr= educe-client-core/mapred-default.xml > > And I noticed that there are some duplicate properties with different val= ues and different descriptions: > > mapreduce.job.end-notification.retry.attempts 0 Indicates how man= y times hadoop should attempt to contact the notification URL > mapreduce.job.end-notification.retry.interval 30000 Indicates time in= milliseconds between notification URL retry calls > ... > mapreduce.job.end-notification.retry.attempts 5 The number of tim= es the submitter of the job wants to retry job end notification if it fails= . This is capped by mapreduce.job.end-notification.max.attempts > mapreduce.job.end-notification.retry.interval 1 The number of sec= onds the submitter of the job wants to wait before job end notification is = retried if it fails. This is capped by mapreduce.job.end-notification.max.r= etry.interval > > Any idea which values and descriptions are correct? > > -Steve --=20 Harsh J