Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B59697F43 for ; Sat, 22 Oct 2011 03:57:38 +0000 (UTC) Received: (qmail 46773 invoked by uid 500); 22 Oct 2011 03:57:37 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 46727 invoked by uid 500); 22 Oct 2011 03:57:37 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 46713 invoked by uid 99); 22 Oct 2011 03:57:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 22 Oct 2011 03:57:35 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates 209.85.160.176 as permitted sender) Received: from [209.85.160.176] (HELO mail-gy0-f176.google.com) (209.85.160.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 22 Oct 2011 03:57:28 +0000 Received: by gyh3 with SMTP id 3so6057610gyh.35 for ; Fri, 21 Oct 2011 20:57:07 -0700 (PDT) Received: by 10.68.75.170 with SMTP id d10mr4726542pbw.49.1319255826987; Fri, 21 Oct 2011 20:57:06 -0700 (PDT) Received: from [117.231.26.100] ([117.231.26.100]) by mx.google.com with ESMTPS id d8sm5513918pbb.6.2011.10.21.20.57.04 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 21 Oct 2011 20:57:06 -0700 (PDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1084) Subject: Re: Streaming jar creates only 1 reducer From: Harsh J In-Reply-To: <13F54E3A-149F-4FED-91EF-6DA1ED1A8F40@gmail.com> Date: Sat, 22 Oct 2011 09:27:04 +0530 Content-Transfer-Encoding: quoted-printable Message-Id: <08D2EC68-F4F1-4E75-8BA4-5E281ACC14EA@cloudera.com> References: <173778479550729024@unknownmsgid> <13F54E3A-149F-4FED-91EF-6DA1ED1A8F40@gmail.com> To: mapreduce-user@hadoop.apache.org X-Mailer: Apple Mail (2.1084) Mapred, You need to pass -Dmapred.reduce.tasks=3DN along. Reducers are a per-job = configurable number, unlike mappers whose numbers can be determined = based on inputs. P.s. Please do not cross post questions to multiple lists. On 22-Oct-2011, at 4:05 AM, Mapred Learn wrote: > Do you know what parameters from conf files ? >=20 > Thanks, >=20 > Sent from my iPhone >=20 > On Oct 21, 2011, at 3:32 PM, Nick Jones wrote: >=20 >> FWIW, I usually specify the number of reducers in both streaming and >> against the Java API. The "default" is what's read from your config >> files on the submitting node. >>=20 >> Nick Jones >>=20 >> On Oct 21, 2011, at 5:00 PM, Mapred Learn = wrote: >>=20 >>> Hi, >>> Does streaming jar create 1 reducer by default ? We have reduce = tasks per task tracker configured to be more than 1 but my job has about = 150 mappers and only 1 reducer: >>>=20 >>> reducer.py basically just reads the line and prints it. >>>=20 >>> Why doesn't streaming.jar invokes multiple reducers for this case ? >>>=20 >>> Thanks, >>> -JJ >>>=20 >>>=20