Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 51838 invoked from network); 17 Mar 2011 19:02:05 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 17 Mar 2011 19:02:05 -0000 Received: (qmail 52606 invoked by uid 500); 17 Mar 2011 19:02:03 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 52545 invoked by uid 500); 17 Mar 2011 19:02:03 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 52537 invoked by uid 99); 17 Mar 2011 19:02:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Mar 2011 19:02:03 +0000 X-ASF-Spam-Status: No, hits=-5.0 required=5.0 tests=RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received-SPF: unknown (athena.apache.org: error in processing during lookup of jrottinghuis@ebay.com) Received: from [216.33.244.7] (HELO rhv-mipot-002.corp.ebay.com) (216.33.244.7) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Mar 2011 19:01:58 +0000 DomainKey-Signature: s=corp; d=ebay.com; c=nofws; q=dns; h=X-EBay-Corp:X-IronPort-AV:Received:Received:From:To:CC: Date:Subject:Thread-Topic:Thread-Index:Message-ID: References:In-Reply-To:Accept-Language:Content-Language: X-MS-Has-Attach:X-MS-TNEF-Correlator:acceptlanguage: x-ems-proccessed:x-ems-stamp:Content-Type: Content-Transfer-Encoding:MIME-Version:X-CFilter; b=pSgxhKuWmBvzHbIJm50Sbnqu5ynwg/mIMWjhDKpA7Ruul28n1pET0WXz t3NDnORd1eefpSreAzXc+ZXjy3S/pD9PHW95JeRQ/zU9o7ohChQyAAW+8 aSsigtNPACRyBw+; DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=ebay.com; i=jrottinghuis@ebay.com; q=dns/txt; s=corp; t=1300388518; x=1331924518; h=from:to:cc:date:subject:message-id:references: in-reply-to:content-transfer-encoding:mime-version; bh=fa0ZVHjma0bjdtITRgVsILGCbvTGlJDxbfEvHUUt/jI=; b=16oOZz3cfC0eNPucn54tsE2HfkLM4xwOSYQAvSPXNEUhoRrfsKQXFgZf hGeE8FgwWEcxFbkRP2pYVJXx1SX3FuHwUruJSQyyREe1V83xxUpDt4GEM Z726JTaQBO3pvAZ; X-EBay-Corp: Yes X-IronPort-AV: E=Sophos;i="4.63,200,1299484800"; d="scan'208";a="11431482" Received: from rhv-vtenf-002.corp.ebay.com (HELO RHV-MEXHT-002.corp.ebay.com) ([10.112.113.53]) by rhv-mipot-002.corp.ebay.com with ESMTP; 17 Mar 2011 12:01:36 -0700 Received: from RHV-MEXMS-002.corp.ebay.com ([10.245.17.114]) by RHV-MEXHT-002.corp.ebay.com ([10.245.24.101]) with mapi; Thu, 17 Mar 2011 12:01:36 -0700 From: "Rottinghuis, Joep" To: "general@hadoop.apache.org" , CDH Users CC: "wlangiewicz@gmail.com" Date: Thu, 17 Mar 2011 12:01:34 -0700 Subject: RE: java.io.IOException: Split metadata size exceeded 10000000 Thread-Topic: java.io.IOException: Split metadata size exceeded 10000000 Thread-Index: Acvi/HgegM3CCCS3TXyDOASoYHwwSwBzYtww Message-ID: References: <4D7F378E.30402@gmail.com> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US x-ems-proccessed: 10SqDH0iR7ekR7SRpKqm5A== x-ems-stamp: AltoNIPk0e6ZbVQynXM4kA== Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-CFilter: Scanned Doubt this is a CDH3 issue. We saw the same with a large job using the 0.20-security branch. There is a property (mapreduce.jobtracker.split.metainfo.maxsize) that can = be used to override the default of 10^6. We found that passing this along with the job has no effect, this worked on= ly when setting this property on the jobtracker node. Not sure if this is a= feature or a bug. Cheers, Joep -----Original Message----- From: Harsh J [mailto:qwertymaniac@gmail.com]=20 Sent: Tuesday, March 15, 2011 3:33 AM To: CDH Users Cc: wlangiewicz@gmail.com Subject: Re: java.io.IOException: Split metadata size exceeded 10000000 Moving this discussion to the CDH users list at cdh-user [at] cloudera.org since it could be a CDH specific issue. [Bcc: general] On Tue, Mar 15, 2011 at 3:25 PM, Wojciech Langiewicz wrote: > Hello, > I'm having this problem running mapreduce jobs over about 10TB of data > (smaller jobs are ok): > 2011-03-15 07:48:22,031 ERROR org.apache.hadoop.mapred.JobTracker: Job > initialization failed: > java.io.IOException: Split metadata size exceeded 10000000. Aborting job > job_201103141436_0058 > =A0 =A0 =A0 =A0at > org.apache.hadoop.mapreduce.split.SplitMetaInfoReader.readSplitMetaInfo(S= plitMetaInfoReader.java:48) > =A0 =A0 =A0 =A0at > org.apache.hadoop.mapred.JobInProgress.createSplits(JobInProgress.java:73= 2) > =A0 =A0 =A0 =A0at > org.apache.hadoop.mapred.JobInProgress.initTasks(JobInProgress.java:633) > =A0 =A0 =A0 =A0at org.apache.hadoop.mapred.JobTracker.initJob(JobTracker.= java:3965) > =A0 =A0 =A0 =A0at > org.apache.hadoop.mapred.EagerTaskInitializationListener$InitJob.run(Eage= rTaskInitializationListener.java:79) > =A0 =A0 =A0 =A0at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor= .java:886) > =A0 =A0 =A0 =A0at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.jav= a:908) > =A0 =A0 =A0 =A0at java.lang.Thread.run(Thread.java:619) > > 2011-03-15 07:48:22,031 INFO org.apache.hadoop.mapred.JobTracker: Failing > job job_201103141436_0058 > > What settings should I change to run this job? > I'm using CDH3b3. > Thanks for all answers. > > -- > Wojciech Langiewcz > --=20 Harsh J http://harshj.com