Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2BFC518DC1 for ; Wed, 20 Jan 2016 12:29:56 +0000 (UTC) Received: (qmail 6823 invoked by uid 500); 20 Jan 2016 12:29:52 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 6709 invoked by uid 500); 20 Jan 2016 12:29:52 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 6699 invoked by uid 99); 20 Jan 2016 12:29:51 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Jan 2016 12:29:51 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 540ED1A0A86 for ; Wed, 20 Jan 2016 12:29:51 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.997 X-Spam-Level: ** X-Spam-Status: No, score=2.997 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id EMNiZ4Aa37cX for ; Wed, 20 Jan 2016 12:29:49 +0000 (UTC) Received: from emea01-db3-obe.outbound.protection.outlook.com (mail-db3on0081.outbound.protection.outlook.com [157.55.234.81]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 44109215D6 for ; Wed, 20 Jan 2016 12:29:49 +0000 (UTC) Received: from AM4PR07MB1444.eurprd07.prod.outlook.com (10.165.248.23) by AM4PR07MB1441.eurprd07.prod.outlook.com (10.165.248.20) with Microsoft SMTP Server (TLS) id 15.1.365.19; Wed, 20 Jan 2016 12:29:47 +0000 Received: from AM4PR07MB1444.eurprd07.prod.outlook.com ([10.165.248.23]) by AM4PR07MB1444.eurprd07.prod.outlook.com ([10.165.248.23]) with mapi id 15.01.0365.023; Wed, 20 Jan 2016 12:29:47 +0000 From: Siddharth Ubale To: "user@spark.apache.org" , "user@hadoop.apache.org" Subject: Container exited with a non-zero exit code 1-SparkJOb on YARN Thread-Topic: Container exited with a non-zero exit code 1-SparkJOb on YARN Thread-Index: AdFTfU9KGSOOJ+2XRfSlE7VaZPGRXw== Date: Wed, 20 Jan 2016 12:29:47 +0000 Message-ID: Accept-Language: en-GB, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: spf=none (sender IP is ) smtp.mailfrom=siddharth.ubale@syncoms.com; x-originating-ip: [122.166.61.86] x-microsoft-exchange-diagnostics: 1;AM4PR07MB1441;5:P4n3UC0hhSclxTnYb2KbLA2AVjZkMITtpcmBzm2NaJMRk7X86GfJdvH6XQij18ns8GMxSDaf/71TLmxRqXbbnncuon1HIBwYjHMS+guU08WRN5c/i1jJkysm8kU1jtx1owyDLwIu2RMLM8BUi3TDgw==;24:ks8CLKJw0hfZg96k1qcgtQcU9U3nHrOEDZNrX7Ki/T20EBtP1nvj7ef144MKgbdDvYYJIrp92q2El3nai4VMihiD1uh305vrr3F8Va9N2xM= x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:;SRVR:AM4PR07MB1441; x-ms-office365-filtering-correlation-id: ce745b98-0eb6-41f8-647e-08d32195628b x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:; x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(601004)(2401047)(520078)(8121501046)(5005006)(3002001)(10201501046);SRVR:AM4PR07MB1441;BCL:0;PCL:0;RULEID:;SRVR:AM4PR07MB1441; x-forefront-prvs: 0827D7ACB9 x-forefront-antispam-report: SFV:NSPM;SFS:(10009020)(6009001)(164054003)(199003)(189002)(2900100001)(81156007)(92566002)(2501003)(50986999)(6116002)(3846002)(229853001)(790700001)(102836003)(5003600100002)(1220700001)(33656002)(5008740100001)(5004730100002)(586003)(189998001)(19625215002)(2906002)(87936001)(77096005)(16236675004)(5001960100002)(10400500002)(106356001)(107886002)(86362001)(122556002)(105586002)(101416001)(76576001)(1096002)(15975445007)(74316001)(40100003)(5002640100001)(19580395003)(11100500001)(8558605004)(66066001)(97736004)(54356999)(5001770100001)(19300405004);DIR:OUT;SFP:1101;SCL:1;SRVR:AM4PR07MB1441;H:AM4PR07MB1444.eurprd07.prod.outlook.com;FPR:;SPF:None;PTR:InfoNoRecords;A:1;MX:1;LANG:en; received-spf: None (protection.outlook.com: syncoms.com does not designate permitted sender hosts) spamdiagnosticoutput: 1:23 spamdiagnosticmetadata: NSPM Content-Type: multipart/alternative; boundary="_000_AM4PR07MB1444DA9352EC65152A6B226BEFC20AM4PR07MB1444eurp_" MIME-Version: 1.0 X-OriginatorOrg: syncoms.com X-MS-Exchange-CrossTenant-originalarrivaltime: 20 Jan 2016 12:29:47.4174 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 49099e21-d9d9-45dc-b95b-f5f4afb3b3ff X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM4PR07MB1441 --_000_AM4PR07MB1444DA9352EC65152A6B226BEFC20AM4PR07MB1444eurp_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi, I am running a Spark Job on the yarn cluster. The spark job is a spark streaming application which is reading JSON from a= kafka topic , inserting the JSON values to hbase tables via Phoenix , ands= then sending out certain messages to a websocket if the JSON satisfies a c= ertain criteria. My cluster is a 3 node cluster with 24GB ram and 24 cores in total. Now : 1. when I am submitting the job with 10GB memory, the application fails say= ing memory is insufficient to run the job 2. The job is submitted with 6G ram. However, it does not run successfully = always.Common issues faced : a. Container exited with a non-zero exit code 1 , and after= multiple such warning the job is finished. d. The failed job notifies that it was unable to find a fil= e in HDFS which is something like _hadoop_conf_xxxxxx.zip Can someone pls let me know why am I seeing the above 2 issues. Thanks, Siddharth Ubale, --_000_AM4PR07MB1444DA9352EC65152A6B226BEFC20AM4PR07MB1444eurp_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Hi,

 

I am running a Spark Job on the yarn cluster.

The spark job is a spark streaming application which= is reading JSON from a kafka topic , inserting the JSON values to hbase ta= bles via Phoenix , ands then sending out certain messages to a websocket if= the JSON satisfies a certain criteria.

 

My cluster is a 3 node cluster with 24GB ram and 24 = cores in total.

 

Now :

1. when I am submitting the job with 10GB memory, th= e application fails saying memory is insufficient to run the job=

2. The job is submitted with 6G ram. However, it doe= s not run successfully always.Common issues faced :

        &nbs= p;       a. Container exited with a non-zero = exit code 1 , and after multiple such warning the job is finished.

        &nbs= p;       d. The failed job notifies that it w= as unable to find a file in HDFS which is something like _hadoop_conf_xxxxxx.zip

 

Can someone pls let me know why am I seeing the abov= e 2 issues.

 

Thanks,

Siddharth Ubale,

 

--_000_AM4PR07MB1444DA9352EC65152A6B226BEFC20AM4PR07MB1444eurp_--