Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E7ED59D4E for ; Wed, 13 Mar 2013 12:11:23 +0000 (UTC) Received: (qmail 89482 invoked by uid 500); 13 Mar 2013 12:11:18 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 89221 invoked by uid 500); 13 Mar 2013 12:11:18 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 89200 invoked by uid 99); 13 Mar 2013 12:11:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Mar 2013 12:11:17 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of amits@infolinks.com designates 207.126.144.113 as permitted sender) Received: from [207.126.144.113] (HELO eu1sys200aog102.obsmtp.com) (207.126.144.113) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 13 Mar 2013 12:11:11 +0000 Received: from mail-ia0-f197.google.com ([209.85.210.197]) (using TLSv1) by eu1sys200aob102.postini.com ([207.126.147.11]) with SMTP ID DSNKUUBsyM7lv/M6xJz2SnxJy7soWCdz79ud@postini.com; Wed, 13 Mar 2013 12:10:49 UTC Received: by mail-ia0-f197.google.com with SMTP id u20so3155335iag.4 for ; Wed, 13 Mar 2013 05:10:44 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:x-received:in-reply-to:references:date :message-id:subject:from:to:content-type:x-gm-message-state; bh=2HYi7p4SYZf3vDFv+zQwBc3/UIBLvmU9MwHGjBeCRtA=; b=Jgwaab+i1XIwk12+2ZaztqttP9lRRcVyMPT2zy9+jmVq56MeUgrHaLgnU6/QLc7uWL gxLYAnRJYTzwmLH6ZdTxsBJ4fzwkFmgRAeqUVOVGjeA9VdXpEiY/YGU1E9t8f+DpkV0S xSS+JwpSPT6mQTRW9UwFWRPXVCAwsiac0aD2d1n5h2ANg/Pk5iFzj4Yx98+LKpeg8I8n CGREHXxmP2r1ihd4WaH10UjP4TK02XqGkVs3k5op2sjDlnkj7WiE5nf6lGGlALDlH9Sf SAnBCmCnff+uUlvhpwh4OC86q2PmvjovJqSv68+oiHmY9yu7JyY0qGXgwylawq9uzIFn t2RA== X-Received: by 10.50.135.74 with SMTP id pq10mr15208562igb.46.1363176644280; Wed, 13 Mar 2013 05:10:44 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.50.135.74 with SMTP id pq10mr15208552igb.46.1363176644188; Wed, 13 Mar 2013 05:10:44 -0700 (PDT) Received: by 10.64.71.33 with HTTP; Wed, 13 Mar 2013 05:10:44 -0700 (PDT) In-Reply-To: References: <1C40D33AEC9DCA40A06F4637B6E58ABF46A80E14@LAX-EX-MB2.datadirect.datadirectnet.com> <513FEB34.7040201@jp.fujitsu.com> Date: Wed, 13 Mar 2013 14:10:44 +0200 Message-ID: Subject: Re: Child error From: Amit Sela To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=e89a8f838b35a4c43204d7cd4b4a X-Gm-Message-State: ALoCoQmyh3W14gPcLvEoLmSuKIxOjCSbi+GXBoxT2i1lvRhrH5Ss/MD/XDWJPOkrcb4HaTO+EeRr10L8SFUYDLqOCEnanSjXONpNcewRGlu14dxJTVpFzIBklQIeuIUjgyc3i8uoSse3wQipW/uc1UAWy4OudjshwNYCHYK4HkV9v7w1BSceQm8= X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f838b35a4c43204d7cd4b4a Content-Type: text/plain; charset=ISO-8859-1 10x On Wed, Mar 13, 2013 at 1:56 PM, Azuryy Yu wrote: > dont wait patch, its a very simple fix. just do it. > On Mar 13, 2013 5:04 PM, "Amit Sela" wrote: > >> But the patch will work on 1.0.4 correct ? >> >> On Wed, Mar 13, 2013 at 4:57 AM, George Datskos < >> george.datskos@jp.fujitsu.com> wrote: >> >>> Leo >>> >>> That JIRA says "fix version=1.0.4" but it is not correct. The real JIRA >>> is MAPREDUCE-2374. >>> >>> The actual fix version for this bug 1.1.2 >>> >>> >>> George >>> >>> >>> or https://issues.apache.org/jira/browse/MAPREDUCE-4857**** >>> >>> Which is fixed in 1.0.4**** >>> >>> ** ** >>> >>> ** ** >>> >>> *From:* Amit Sela [mailto:amits@infolinks.com ] >>> *Sent:* Tuesday, March 12, 2013 5:08 AM >>> *To:* user@hadoop.apache.org >>> *Subject:* Re: Child error**** >>> >>> ** ** >>> >>> Hi Jean-Marc, **** >>> >>> I am running Hadoop 1.0.3, and I did see this issue you've mentioned but >>> the exit status in the issue is 126 and sometimes I get 255.**** >>> >>> Any ideas what do theses status codes mean ? **** >>> >>> Did you suffer this issue and upgraded to 1.0.4 ? If so, How "smooth" is >>> such upgrade (shouldn't differ from 1.0.3 that much no ?)**** >>> >>> ** ** >>> >>> Thanks!**** >>> >>> ** ** >>> >>> ** ** >>> >>> On Tue, Mar 12, 2013 at 1:40 PM, Jean-Marc Spaggiari < >>> jean-marc@spaggiari.org> wrote:**** >>> >>> Hi Amit, >>> >>> Which Hadoop version are you using? >>> >>> I have been told it's because of >>> https://issues.apache.org/jira/browse/MAPREDUCE-2374 >>> >>> JM >>> >>> 2013/3/12 Amit Sela :**** >>> >>> > Hi all, >>> > >>> > I have a weird failure occurring every now and then during a MapReduce >>> job. >>> > >>> > This is the error: >>> > >>> > java.lang.Throwable: Child Error >>> > at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271) >>> > Caused by: java.io.IOException: Task process exit with nonzero status >>> of >>> > 255. >>> > at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258) >>> > >>> > And sometimes it's the same but with status of 126. >>> > >>> > Any ideas ? >>> > >>> > Thanks.**** >>> >>> ** ** >>> >>> >>> >> --e89a8f838b35a4c43204d7cd4b4a Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
10x

On Wed, Mar 13, 2013= at 1:56 PM, Azuryy Yu <azuryyyu@gmail.com> wrote:

dont wait patch, its a very simple fix. just do it.

On Mar 13, 2013 5:04 PM, "= ;Amit Sela" <amits@infolinks.com> wrote:
But the patch will work on 1.0.4 correct ?

On Wed, Mar 13, 2013 at 4:57 AM, George Datskos <george.datskos@jp.fujitsu.com> wrote:
=20 =20 =20
Leo

That JIRA says "fix version=3D1.0.4" but it is not correct.= =A0 The real JIRA is MAPREDUCE-2374.

The actual fix version for this bug 1.1.2


George


=20 =20 =20

or https://issues.apache.org/jira/browse/MAPREDUCE-4857

Which is fixed in 1.0.4

=A0=

=A0=

From: Amit Sela [mailto:amits@infolinks.com]
Sent: Tuesday, March 12, 2013 5:08 AM
To: user@hadoop.apache.org
Subject: Re: Child error

=A0

Hi=A0Jean-Marc,=A0

I am running=A0Hadoop 1.0.3, and I did see this issue you've mentioned but the exit status in th= e issue is 126 and sometimes I get 255.

Any ideas what do theses status codes mean ?=A0

Did you suffer this issue and upgraded to 1.0.4 ? If so, How "smooth" is such upgrade (sho= uldn't differ from 1.0.3 that much no ?)

=A0

Thanks!

=A0

= =A0

On Tue, Mar 12, 2013 at 1:40 PM, Jean-Marc Spaggiari <jean-marc@spaggiari.org> wrote:

Hi Amit,

Which Hadoop version are you using?

I have been told it's because of
https://issues.apache.org/jira/browse/MAPREDUCE-237= 4

JM

2013/3/12 Amit Sela <amits@infolinks.com>:

> Hi all,
>
> I have a weird failure occurring every now and then during a MapReduce job.
>
> This is the error:
>
> java.lang.Throwable: Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java= :271)
> Caused by: java.io.IOException: Task process exit with nonzero status of
> 255.
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java= :258)
>
> And sometimes it's the same but with status of 126.
>
> Any ideas ?
>
> Thanks.

=A0




--e89a8f838b35a4c43204d7cd4b4a--