Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 686F1179D8 for ; Tue, 6 Oct 2015 10:13:58 +0000 (UTC) Received: (qmail 27123 invoked by uid 500); 6 Oct 2015 10:13:53 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 26978 invoked by uid 500); 6 Oct 2015 10:13:53 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 26968 invoked by uid 99); 6 Oct 2015 10:13:53 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2015 10:13:53 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 0624FC391D for ; Tue, 6 Oct 2015 10:13:53 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id CpdiIqczz3RJ for ; Tue, 6 Oct 2015 10:13:51 +0000 (UTC) Received: from mail-wi0-f170.google.com (mail-wi0-f170.google.com [209.85.212.170]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 2D880201F9 for ; Tue, 6 Oct 2015 10:13:51 +0000 (UTC) Received: by wicfx3 with SMTP id fx3so158656950wic.1 for ; Tue, 06 Oct 2015 03:13:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:content-type:subject:date:message-id:to:mime-version; bh=alHp0+ExfHzBHHwF5HIazRduynx154DjdNBn6MzTebA=; b=cCrTeT94j52nqdr/vEd+u7A1viQS5GVB8yyYMYqT/eiB3iG6H5VGqgqpbiOMiX7erp kZov7x31bT6bNmJcwOcMSgCnykkYSy1PhkQrHi8R98JOBnXtuaLJrUf4hguAhWPoN/Wo 9oVNQvGhkmOp40EVnmy3uz1b/qasTWcR+vvE+bfmthnB3dLhLLvRl+sJ1BnnjtT+aN9X WCXrcmXe+KcXpHG2hYik/fBlgPGuhFmzK4cmOfyH2UfpYOkHytGgZQs691qoJFhG9yVK dac4Z3zGatT8dv70TqjJWroULSB+VJombz/DpUWF/l+cRiXiTwMa1/ll3FH2vmK7PEMj 1aTw== X-Received: by 10.180.106.229 with SMTP id gx5mr17337939wib.24.1444126430714; Tue, 06 Oct 2015 03:13:50 -0700 (PDT) Received: from kaiserj-2.zdv.uni-mainz.de ([2001:4c80:40:4b4:22c9:d0ff:fe2a:805f]) by smtp.gmail.com with ESMTPSA id gd10sm9590235wjb.47.2015.10.06.03.13.49 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 06 Oct 2015 03:13:49 -0700 (PDT) From: gmail X-Pgp-Agent: GPGMail 2.5.2 Content-Type: multipart/signed; boundary="Apple-Mail=_78374A78-EE7D-40D8-9D51-18024D66CDBB"; protocol="application/pgp-signature"; micalg=pgp-sha512 Subject: Yarn doesn't start mappers fast enough Date: Tue, 6 Oct 2015 12:13:39 +0200 Message-Id: <179AFCE3-428F-4311-A461-3259EDF44C8C@gmail.com> To: user@hadoop.apache.org Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2104\)) X-Mailer: Apple Mail (2.2104) --Apple-Mail=_78374A78-EE7D-40D8-9D51-18024D66CDBB Content-Type: multipart/alternative; boundary="Apple-Mail=_26047C2B-0974-4646-9781-31F921A924FD" --Apple-Mail=_26047C2B-0974-4646-9781-31F921A924FD Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 Hallo everyone, I have a problem with my yarn setup and hope you can help me. I already = searched for this issue but didn't find anything. My problem is that yarn doesn=E2=80=99t start new mappers fast enough. = This results in a poor cluster utilization. Setup: - 8 nodes @64cores+128GB - Hadoop version: Hadoop 2.6.0, - Standard Terasort of 100GB, input data generated by teragen with two = mappers What I see: At most ~40 mappers run at the same time. It looks like the = rate of starting new mappers and the finishing rate is about the same at = that point. The avg. processing time of each mapper is about 34-40s. If = I start a second Terasort at the same time, it also only runs up to ~40 = mappers. It seems that 1) yarn correctly detects that it can run more = but 2) doesn't start new mappers fast enough (1 at a time?). What I expect: better utilization of all nodes since there are 300+ map = jobs. Are there parameters to change this behavior? How can I tell yarn to = start more instances at the same time? for completeness: - the behavior doesn't change if I use more mappers during teragen. - the bahavior doesn't change if I modify the number of nodes. - I recompiled Hadoop for 64bit according to = https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/N= ativeLibraries.html = - I use an GPFS as backend with the IBM gpfs-connector. Thanks in advance, J=C3=BCrgen --Apple-Mail=_26047C2B-0974-4646-9781-31F921A924FD Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8
Hallo everyone,

I have a problem with my yarn setup and = hope you can help me. I already searched for this issue but didn't find = anything.

My = problem is that yarn doesn=E2=80=99t start new mappers fast enough. This = results in a poor cluster utilization.

Setup: 
 -= 8 nodes @64cores+128GB
 - Hadoop version: = Hadoop 2.6.0,
 - Standard Terasort of 100GB, = input data generated by teragen with two mappers

What I see: At most ~40 mappers run at = the same time. It looks like the rate of starting new mappers and the = finishing rate is about the same at that point. The avg. processing time = of each mapper is about 34-40s. If I start a second Terasort at the same = time, it also  only runs up to ~40 mappers. It seems that 1) yarn = correctly detects that it can run more but 2) doesn't start new mappers = fast enough (1 at a time?).
What I expect: better = utilization of all nodes since there are 300+ map jobs.

Are there parameters to = change this behavior? How can I tell yarn to start more instances at the = same time?

for = completeness:
    - the behavior doesn't = change if I use more mappers during teragen.
  =   - the bahavior doesn't change if I modify the number of = nodes.
    - = I use an GPFS as backend with the IBM gpfs-connector.

Thanks in = advance,
J=C3=BCrgen
= --Apple-Mail=_26047C2B-0974-4646-9781-31F921A924FD-- --Apple-Mail=_78374A78-EE7D-40D8-9D51-18024D66CDBB Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=signature.asc Content-Type: application/pgp-signature; name=signature.asc Content-Description: Message signed with OpenPGP using GPGMail -----BEGIN PGP SIGNATURE----- Comment: GPGTools - https://gpgtools.org iQQcBAEBCgAGBQJWE57ZAAoJEJU3ZLZYFRWm4W4gAJlmad48DfUw6teAA0dIx1ns rX8ucV9/lu8hqegNr3wg736hZbqL73Vi/bcQ/C253HGuiEIMhrmTspoWcQYKvI3d qxoy9r4/UMqxfoJd4dJaAfI9hZ5P3DAmND9looUBsnYSnh+X/U0AZJ0vPnb2vr0L sN6MXlohH5p0PqRo88VNQw5HKj/NevhyyxlVYKRmV36KmMrWwTFywwqTLT2MVEsJ cd/EkHWGe0UoTn1uwjycNAA+Dx21NHz35ApbJxRZEWWx3BfBLpafIz3M5hBSrBVq 5H1HyNJBOSGqnHPndaf0twDiFOZCSAFJPoO8YYzt5TWJ9BI//cXQLfpxdyhjwBig fVMGgXsBMzpvwu2xcyiGRmqk3dm/V6Y8TN4TKhDgL6n4z8hQbjuLcwzpwquWRlMf eWxGbG9t/UV5aSWySJpBW+SWn48MD1LT350+EGX+waJqZ3vcXnCMp/FIlAdmgYkk AYUk3iuHVZFW7MjUD6Wa/+M7tLH7JeW9KJ6OnQuNmfkzwe7UqG8FrQXmCM81Ngt1 zfHBngsWkJpIqgTAV6rhRlTS/USVVkQPzsz7rU/ei9DI9UYYeBmHb0V2F3hQWrH8 s6thA+o+kqpQ8XFB5FkrJ2arkKX3m41abnz81OC8xJ5OPixUvP74SZ8MQ2CsnHw7 CR3ih6hYVndyg12DsiR96RzNhhceY69fuxqKhXFUXym2LGZ+1ntyseQhqnGoWUMs zbi2WLzNotKO0BaqL46oGsnJGqAPu7SlNpO/XcyuFdxYdxpuXiRRXURUyYBg3wvb pnP/8XOGLxMcpFHXgtCgaptqIhRAff/cFOGD2HjRLLpYNnZtfxKTU2oiZ8/yci1J Y5SBu7ApvxK2WCbRqF2VCcERf8PCwE2HSGJGVK4mflwsztMiIaCtep9dp9EdUENc medxM8ceaut5BQKNFycbof85gdsp2VcYQwtDf2Sw34s2BuXvIEMtMeqfhG8R/Ym+ hoiwvKeH7iufWKP/wtcUpdSH7lPT2q7VGsUaPOEqSH3qP38CC8UIytZ0GzVUeNA1 UoOXd/I90iQTkuoSONbmMfm+kBln1brCeNoQixp9uM2BZtHwPv0U7alqglBLAUI6 ppKkBRbjkyMF6+ozCI02BbUpZK/x8SyJVz6dQZr3f+EZeh1oJtB+dqqH2U24okzc OT5+foOOAhW3TfU4mLl3B3czdabQSy7TfiujzswWUXWavfLl8QiYA3wgmZdLhXUj 2Sy4szeXsb7aH8dTVC/6TZbyjtv3x/4mmHaOj1/mUz8UrU8vvGNSOFxDSCFiBJMK nE9pBuEniB5TR2ZuvDX2oKepV02EXERRgrOu6nsY1nGExCr9vFcMLKlUT4/+y5A= =tHfY -----END PGP SIGNATURE----- --Apple-Mail=_78374A78-EE7D-40D8-9D51-18024D66CDBB--