Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 37622 invoked from network); 2 Oct 2006 19:14:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 2 Oct 2006 19:14:32 -0000 Received: (qmail 70807 invoked by uid 500); 2 Oct 2006 19:14:31 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 70635 invoked by uid 500); 2 Oct 2006 19:14:31 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 70625 invoked by uid 99); 2 Oct 2006 19:14:31 -0000 Received: from idunn.apache.osuosl.org (HELO idunn.apache.osuosl.org) (140.211.166.84) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Oct 2006 12:14:30 -0700 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests= Received: from [209.237.227.198] ([209.237.227.198:46113] helo=brutus.apache.org) by idunn.apache.osuosl.org (ecelerity 2.1.1.8 r(12930)) with ESMTP id C9/B5-24395-41561254 for ; Mon, 02 Oct 2006 12:14:29 -0700 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id C4F72714312 for ; Mon, 2 Oct 2006 12:14:20 -0700 (PDT) Message-ID: <16049312.1159816460804.JavaMail.root@brutus> Date: Mon, 2 Oct 2006 12:14:20 -0700 (PDT) From: "Doug Cutting (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Resolved: (HADOOP-570) Map tasks may fail due to out of memory, if the number of reducers are moderately big In-Reply-To: <21081187.1159810699740.JavaMail.root@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N [ http://issues.apache.org/jira/browse/HADOOP-570?page=all ] Doug Cutting resolved HADOOP-570. --------------------------------- Resolution: Duplicate This is a duplicate of HADOOP-331. > Map tasks may fail due to out of memory, if the number of reducers are moderately big > ------------------------------------------------------------------------------------- > > Key: HADOOP-570 > URL: http://issues.apache.org/jira/browse/HADOOP-570 > Project: Hadoop > Issue Type: Bug > Components: mapred > Reporter: Runping Qi > > Map tasks may fail due to out of memory, if the number of reducers are moderately big. > In my case, I set child task heap size to 1GB, turned on compression for the mapoutput files. > The average size of input records is about 30K (I don't know the variation though). > A lot of map tasks failed due to out of memory when the number of reducers was at 400 and higher. > The number of reducers can be somewhat higher (as high as 800) if the compression for the mapoutput files was off). > This problem will impose a hard limit on the scalability of map/reduce clusters. > One possible solution to this problem is to let the mapper to write out single map output file, > and then to perform sort/partition as a separate phrase. > his will also make it unnecessary for the reducers to perform sort on individual portions from mappers. > Rather, the reducers should just perform merge operations on the map output files directly. > This may even allow the possibility of dynamically collect some statistics of the map outputs and > use the stats to drive the partition on the mapper side, and obtain the optimal merge plan on the reducer side! > -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira