Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 45339 invoked from network); 18 Apr 2009 13:40:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 18 Apr 2009 13:40:14 -0000 Received: (qmail 89045 invoked by uid 500); 18 Apr 2009 13:40:11 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 88947 invoked by uid 500); 18 Apr 2009 13:40:11 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 88931 invoked by uid 99); 18 Apr 2009 13:40:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Apr 2009 13:40:11 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [76.13.9.51] (HELO web65507.mail.ac4.yahoo.com) (76.13.9.51) by apache.org (qpsmtpd/0.29) with SMTP; Sat, 18 Apr 2009 13:40:03 +0000 Received: (qmail 6632 invoked by uid 60001); 18 Apr 2009 13:39:41 -0000 Message-ID: <539205.6484.qm@web65507.mail.ac4.yahoo.com> X-YMail-OSG: BeNT2XUVM1m8ZVHZ.dM0G2K0m.CHoWno9SmCFAoGFe1wcx_j7.yLw8mfnFjb9m_cujdZmj7cN8hOHO2p.y3D8OjZhcY0RqB2swuLNqjkNRUMQSgb7GUnBnG0aE.3PNUpYYoAqVilfTrojOlIdiADsTTM8LVTHkx6gwDGTPK_e4E2.T1w_r0fAdJ0xHtsIX2CQ_jfPinEHbBGPsaT1_NGrFxHHHlyO6t5C0JKNtSatUsjncnm5meSD3q8p7VWpbe.xEMqbjxqRhrVy49AcLXfjRZe_DLeLAOqNjLAa4ktOTvYQt7O04WP7JLIDpiX Received: from [220.136.186.143] by web65507.mail.ac4.yahoo.com via HTTP; Sat, 18 Apr 2009 06:39:41 PDT X-RocketYMMF: apurtell X-Mailer: YahooMailWebService/0.7.289.1 Date: Sat, 18 Apr 2009 06:39:41 -0700 (PDT) From: Andrew Purtell Reply-To: apurtell@apache.org Subject: Re: Ec2 instability To: hbase-user@hadoop.apache.org, core-user@hadoop.apache.org In-Reply-To: <384813770904170944l1b3843c8v6b32970b8391831b@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org Hi, This is an OS level exception. Your node is out of memory even to fork a process. How many instances do you currently have allocated? Have you increased the number of instances over time to try and spread the load of your application around? How many concurrent mapper and/or reducer processes do you execute on a node? Can you characterize the memory usage of your mappers and reducers? Are you running other processes external to hadoop/hbase which consume a lot of memory? Are you running Ganglia or similar to track and characterize resource usage over time? You may find you are trying to solve a 100 node problem with 10. - Andy > From: Rakhi Khatwani > Subject: Re: Ec2 instability > To: hbase-user@hadoop.apache.org, core-user@hadoop.apache.org > Date: Friday, April 17, 2009, 9:44 AM > Hi, > this is the exception i have been getting @ the mapreduce > > java.io.IOException: Cannot run program "bash": > java.io.IOException: > error=12, Cannot allocate memory > at java.lang.ProcessBuilder.start(ProcessBuilder.java:459) > at org.apache.hadoop.util.Shell.runCommand(Shell.java:149) > at org.apache.hadoop.util.Shell.run(Shell.java:134) > at org.apache.hadoop.fs.DF.getAvailable(DF.java:73) > at > org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:321) > at > org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124) > at > org.apache.hadoop.mapred.MapOutputFile.getOutputFileForWrite(MapOutputFile.java:61) > at > org.apache.hadoop.mapred.MapTask$MapOutputBuffer.mergeParts(MapTask.java:1199) > at > org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:857) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:333) > at org.apache.hadoop.mapred.Child.main(Child.java:155) > Caused by: java.io.IOException: java.io.IOException: > error=12, Cannot > allocate memory > at java.lang.UNIXProcess.(UNIXProcess.java:148) > at java.lang.ProcessImpl.start(ProcessImpl.java:65) > at java.lang.ProcessBuilder.start(ProcessBuilder.java:452) > ... 10 more