Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 95705 invoked from network); 11 Jan 2007 15:14:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Jan 2007 15:14:49 -0000 Received: (qmail 80738 invoked by uid 500); 11 Jan 2007 15:14:55 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 80486 invoked by uid 500); 11 Jan 2007 15:14:55 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 80476 invoked by uid 99); 11 Jan 2007 15:14:54 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Jan 2007 07:14:54 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Jan 2007 07:14:47 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 84FA67142CF for ; Thu, 11 Jan 2007 07:14:27 -0800 (PST) Message-ID: <27953431.1168528467514.JavaMail.jira@brutus> Date: Thu, 11 Jan 2007 07:14:27 -0800 (PST) From: "Devaraj Das (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-875) memory model is not accurate enough for map side sorts In-Reply-To: <18649543.1168447167858.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12463912 ] Devaraj Das commented on HADOOP-875: ------------------------------------ Could you please let me know whether there were multiple spills-to-disk of map outputs for those failed maps? > memory model is not accurate enough for map side sorts > ------------------------------------------------------ > > Key: HADOOP-875 > URL: https://issues.apache.org/jira/browse/HADOOP-875 > Project: Hadoop > Issue Type: Bug > Components: mapred > Affects Versions: 0.10.0 > Reporter: Owen O'Malley > Assigned To: Devaraj Das > > I configured a sort (IdentityMapper) with large compressed values with the map output buffer (io.sort.mb) set to 300mb and the child jvm heap size set to 900mb and some of the maps were running out of memory deterministically. I suspect that some part of the data path is not handling large values well and is consuming large amounts of ram. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira