Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 80583 invoked from network); 23 Feb 2010 02:37:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 23 Feb 2010 02:37:07 -0000 Received: (qmail 96791 invoked by uid 500); 23 Feb 2010 02:37:05 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 96673 invoked by uid 500); 23 Feb 2010 02:37:05 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 96663 invoked by uid 99); 23 Feb 2010 02:37:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Feb 2010 02:37:05 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.136.44.50] (HELO n66.bullet.mail.sp1.yahoo.com) (98.136.44.50) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 23 Feb 2010 02:36:54 +0000 Received: from [216.252.122.219] by n66.bullet.mail.sp1.yahoo.com with NNFMP; 23 Feb 2010 02:36:32 -0000 Received: from [98.136.44.168] by t4.bullet.sp1.yahoo.com with NNFMP; 23 Feb 2010 02:36:32 -0000 Received: from [127.0.0.1] by omp609.mail.sp1.yahoo.com with NNFMP; 23 Feb 2010 02:36:32 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 746395.65154.bm@omp609.mail.sp1.yahoo.com Received: (qmail 2343 invoked by uid 60001); 23 Feb 2010 02:36:32 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1266892592; bh=mnymI6p9BZeArqZeZWKU2WFhGFeR1fJ8dBQsJhSYiDc=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=gCS9IrwogaKXtPBW69cV+VyOfweYArV9SZOwkrmjuof0kcjZLlsFwHVGcn2QAflkRC0aHoTdfz1Qt5KzEgDCd33/3G6zMVDxmRCOwgA93gz/F1iTB5fLwqf4bDtjUxRI55vlbwrCcORTDZ5YTJYa2U7yi5AMg343brLff8OCEOQ= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=DaCfclCmZYBIGQndrA9JzlEdpi7g92MTOuiYexQvrdtShNVcYcwOoteWf5fsn3a56F+a7WHwqUC5Uxq3zBMyvVJ9ZIFeKwTEAkJ6yI96R9kjxqN5FV/3eOahwb+06YAoIL3zgVyP4nqbYLZFGVB80zM+TBlxIAamVnIZDBznXMQ=; Message-ID: <257246.2073.qm@web45108.mail.sp1.yahoo.com> X-YMail-OSG: M4IlqrUVM1mYddL1NXQu6117GfZ9UTK51LJz57eUaXlD0LBS07Nh2iHukAXa4HAoWLR7k7IGkYHgdh_n_3SmD7Sv6xgGkSGq6xj17PyhhTz0mo61u_mozcChJWHrSn5mgEaWRU18_sXb34ANgQcK7hq4l3LJqS82hSzDIvvN5YRTVf2I9CXXDsdFWu75FrJ8cvGJyB.D5zHiWpYvu8ZpGj7S9Pc4lbQFr_82vatuJ39shI.3QCDzaC3VN4oIhbmwW5c_r4kIcYTElZsivOME5HJ8O38ZA1RHDxwHqsos1r4IeuZziBuW.LEjsW5biGt4 Received: from [71.6.110.222] by web45108.mail.sp1.yahoo.com via HTTP; Mon, 22 Feb 2010 18:36:31 PST X-Mailer: YahooMailClassic/9.2.12 YahooMailWebService/0.8.100.260964 Date: Mon, 22 Feb 2010 18:36:31 -0800 (PST) From: jiang licht Subject: Re: OutOfMemoryError of PIG job (UDF loads big file) To: common-user@hadoop.apache.org In-Reply-To: <8211a1321002221813g6d2be728wfd33058e0b623ee3@mail.gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="0-219683985-1266892591=:2073" X-Virus-Checked: Checked by ClamAV on apache.org --0-219683985-1266892591=:2073 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Thanks Jeff. I also just found this one and solved my problem. BTW, so many= settings to play with :) =0AMichael --- On Mon, 2/22/10, Jeff Zhang wrote: From: Jeff Zhang Subject: Re: OutOfMemoryError of PIG job (UDF loads big file) To: common-user@hadoop.apache.org Date: Monday, February 22, 2010, 8:13 PM Hi Jiang, you should set property *mapred.child.java.opts* in mapred-site.xml to increase the memeory as following: =A0 =A0 =A0 =A0=A0=A0mapred.child.java.opts =A0 =A0 =A0 =A0=A0=A0-Xmx1024m and then restart your hadoop cluster On Tue, Feb 23, 2010 at 9:43 AM, jiang licht wrote: > I am running a hadoop job written in PIG. It fails from out of memory > because a UDF function consumes a lot of memory, it loads a big file. Wha= t > are the settings to avoid the following OutOfMemoryError? I guess by simp= ly > giving PIG big memory (java -XmxBIGmemory org.apache.pig.Main ...) won't > work. > > Error message ---> > > java.lang.OutOfMemoryError: Java heap space >=A0 =A0 =A0 =A0 at java.util.regex.Pattern.compile(Pattern.java:1451) >=A0 =A0 =A0 =A0 at java.util.regex.Pattern.(Pattern.java:1133) >=A0 =A0 =A0 =A0 at java.util.regex.Pattern.compile(Pattern.java:823) >=A0 =A0 =A0 =A0 at java.lang.String.split(String.java:2293) >=A0 =A0 =A0 =A0 at java.lang.String.split(String.java:2335) >=A0 =A0 =A0 =A0 at UDF.load(Unknown Source) >=A0 =A0 =A0 =A0 at UDF.load(Unknown Source) >=A0 =A0 =A0 =A0 at UDF.exec(Unknown Source) >=A0 =A0 =A0 =A0 at UDF.exec(Unknown Source) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOpe= rators.POUserFunc.getNext(POUserFunc.java:201) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOpe= rators.POUserFunc.getNext(POUserFunc.java:287) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOpe= rators.POForEach.processPlan(POForEach.java:278) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOpe= rators.POForEach.getNext(POForEach.java:204) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOpera= tor.processInput(PhysicalOperator.java:231) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOpe= rators.POLocalRearrange.getNext(POLocalRearrange.java:240) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapBase.r= unPipeline(PigMapBase.java:249) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapBase.m= ap(PigMapBase.java:240) >=A0 =A0 =A0 =A0 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapReduce= $Map.map(PigMapReduce.java:93) >=A0 =A0 =A0 =A0 at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:5= 0) >=A0 =A0 =A0 =A0 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:332) >=A0 =A0 =A0 =A0 at org.apache.hadoop.mapred.Child.main(Child.java:155) > > Thanks! > Michael > > > > --=20 Best Regards Jeff Zhang =0A=0A=0A --0-219683985-1266892591=:2073--