Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 66611 invoked from network); 29 Jan 2010 21:11:21 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 29 Jan 2010 21:11:21 -0000 Received: (qmail 76029 invoked by uid 500); 29 Jan 2010 21:11:19 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 75935 invoked by uid 500); 29 Jan 2010 21:11:19 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 75925 invoked by uid 500); 29 Jan 2010 21:11:19 -0000 Delivered-To: apmail-hadoop-core-user@hadoop.apache.org Received: (qmail 75922 invoked by uid 99); 29 Jan 2010 21:11:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Jan 2010 21:11:19 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Jan 2010 21:11:11 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1Nay78-0000jP-Rt for core-user@hadoop.apache.org; Fri, 29 Jan 2010 13:10:50 -0800 Message-ID: <27330927.post@talk.nabble.com> Date: Fri, 29 Jan 2010 13:10:50 -0800 (PST) From: adeelmahmood To: core-user@hadoop.apache.org Subject: do all mappers finish before reducer starts MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: adeelmahmood@gmail.com I just have a conceptual question. My understanding is that all the mappers have to complete their job for the reducers to start working because mappers dont know about each other so we need values for a given key from all the different mappers so we have to wait until all mappers have collectively given the system all possible values for a key .so that then that can be passed on the reducer .. but when I ran these jobs .. almost everytime before the mappers are all done the reducers start working .. so it would say map 60% reduce 30% .. how does this works Does it finds all possibly values for a single key from all mappers .. pass that on the reducer and then works on other keys any help is appreciated -- View this message in context: http://old.nabble.com/do-all-mappers-finish-before-reducer-starts-tp27330927p27330927.html Sent from the Hadoop core-user mailing list archive at Nabble.com.