Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 6491A200D5D for ; Wed, 6 Dec 2017 03:20:03 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 63281160C1C; Wed, 6 Dec 2017 02:20:03 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 81D35160C1B for ; Wed, 6 Dec 2017 03:20:02 +0100 (CET) Received: (qmail 63404 invoked by uid 500); 6 Dec 2017 02:20:01 -0000 Mailing-List: contact user-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ignite.apache.org Delivered-To: mailing list user@ignite.apache.org Received: (qmail 63382 invoked by uid 99); 6 Dec 2017 02:20:01 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Dec 2017 02:20:01 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id A1CC4C2505 for ; Wed, 6 Dec 2017 02:20:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id VfzGl6TrwHSW for ; Wed, 6 Dec 2017 02:19:59 +0000 (UTC) Received: from mail-it0-f46.google.com (mail-it0-f46.google.com [209.85.214.46]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4ABF25F2A9 for ; Wed, 6 Dec 2017 02:19:59 +0000 (UTC) Received: by mail-it0-f46.google.com with SMTP id b5so5432640itc.3 for ; Tue, 05 Dec 2017 18:19:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=yzb+qtb7v3834kPjwbrEMqoW2ZRuQzhcPS2KHTqqgH8=; b=bDs2bVwLfLBuCyjOeWG3SZGtsgbj983/xEyxBH1NQgfRrO8P/GSVo8xPgku8nbDf8T qLRaV2+RSrYKnv2AKkdn6FaknVJHTzyf6U0aem0r1EC7ogI49RpUhMUKs7FejdC66Rsg fs+9D6KI+Aho1dhf9MGQO/JmA5xBAf8L7dhxyJRLfGOkjtkrKu36fPckfUrtY/mIr8gd xabBvflPf3ydhVQtXaMoMD2eNucMu/5PDkJh5O8eTS7o4GgrbIvvPF5i+SOTlY1y2ZnZ PqDiZfpyiXlE4THSXv+j0t0ReSRJZY9v54frCpt3ZmrjVNnRWUcn9JkcKJC3CTpicPtE Rt+g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=yzb+qtb7v3834kPjwbrEMqoW2ZRuQzhcPS2KHTqqgH8=; b=XPjmGc/ZFRrqtkXr92WU8zh+C/doi9moqf40J1Q29ukwuV7Wk/2xMqlhs/ydA59UXl h/jimQM4kulWTGr5LEkn9XKs7IK6in5/giuQk0malR3Uz5pdLbm4BIPNOCjl/U5ZWYt6 CW6xHv/m2mimLTWt3uf/SnNp5jQaAUC39k5jZ0uDMrr0tdzlikwaiW66hrhtLU9T5WwL mHAIis4d29UIbLmm6k6t3HKHuqhX04+pG7vkT/U8KGtpEeX27LLpyrNAfAuLa/JQ8ZwC 5eKA1oCasuBndzIjEqarUwEoi+sOUO65jras2+knMbp1flCPFXf0hvPoIC6qGNncfkNo mejA== X-Gm-Message-State: AKGB3mJ444szhhTBiqW7phfpXtQudU5TVqLKvD2OaTf3XQZ3+MvHFcQ9 sob4PYLrfqvQ94yAn2V1mOOSDr1sjaY3C2Jk5lt9 X-Google-Smtp-Source: AGs4zMZhD4yFLnW3Dmc263bSBx6WufuNikGEKuY60aN1t5MQiaff9pCtAdbfm0gEXqnGDlbmLuWFUKM57KaQroocxG8= X-Received: by 10.36.162.76 with SMTP id o12mr11934249iti.76.1512526792515; Tue, 05 Dec 2017 18:19:52 -0800 (PST) MIME-Version: 1.0 Received: by 10.107.242.23 with HTTP; Tue, 5 Dec 2017 18:19:51 -0800 (PST) In-Reply-To: <1512139390968-0.post@n6.nabble.com> References: <1512139390968-0.post@n6.nabble.com> From: =?UTF-8?B?SnVhbiBSb2Ryw61ndWV6IEhvcnRhbMOh?= Date: Tue, 5 Dec 2017 18:19:51 -0800 Message-ID: Subject: Re: Error running ignite in YARN To: user@ignite.apache.org Content-Type: multipart/alternative; boundary="f403045fad682e6338055fa2963d" archived-at: Wed, 06 Dec 2017 02:20:03 -0000 --f403045fad682e6338055fa2963d Content-Type: text/plain; charset="UTF-8" Hi Ilya, Thanks a lot for your help, I'll try and the test the patch. Regarding how I run this, I wait for the cluster to enter in WAITING state and then I ssh to the master node as described in https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-connect-master-node-ssh.html, and then I just run the commands there. Before running the command I put the files in HDFS. We cannot use a bootstrap action here because they run before HDFS starts, but another option is using a EMR step for this. For the node manager logs, I follow https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-web-interfaces.html to have access to the resource manager UI, and then I see the nodes that are involved with the YARN application. Then I ssh to some of the slave nodes involved with the job from the EMR master, and check the node manager logs in /var/log/hadoop-yarn. Also we could configure yarn like suggested in https://slider.incubator.apache.org/docs/getting_started.html to have 1 hour of timeout before cleaning up containers, using https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-configure-apps.html to configure that on EMR. The container files are then available in the EMR slaves at /mnt/yarn/ I'll tell you how it goes when I have tested the patch. Thanks a lot! Juan On Fri, Dec 1, 2017 at 6:43 AM, ilya.kasnacheev wrote: > Hello once more! > > I have prepared a patch. I think it won't be hard for you to try it out > since you are accustomized to building Ignite. Please try it, tell if it > solves the current issue. ignite-7080.patch > t1312/ignite-7080.patch> > > I was trying to set Ignite up on AWS EMR today, without much success. I > wonder if you could provide me with some directions. > > > IGNITE_YARN_JAR=/mnt/ignite/apache-ignite-2.3.0-src/ > modules/yarn/target/ignite-yarn-2.3.0.jar > > yarn jar ${IGNITE_YARN_JAR} ${IGNITE_YARN_JAR} > > /mnt/ignite/ignite_yarn.properties > > Can you tell me what were you using to launch this command? I have tried > executing "Custom JAR" step on EMR cluster, after copying Ignite properties > and JAR to /mnt on master manually. So I was informed that an application > was started, and is now running, but no indication about execution results. > I suspect it hung with the same problem that you outlined (or earlier), but > I haven't found a way to view application logs, neither "node manager" logs > that you are referring to. Can you please point me where I can watch > intermediate execution results? > > Also there's two data access questions. First, Ignite YARN jar expects two > files on local FS, the jar itself, and properties file. Is there a way to > make sure that these files are on file system, at predictable locations, > when starting EMR cluster? To be able to add "Custom JAR" step to cluster > initialization? E.g. specify an s3 bucket to be mounted to master node? > > Another question is that you are specifying ignite zip and ignite config > are > on hdfs. How would you put these files to hdfs when cluster is created? Or, > for that matter, after it is created? I haven't found any access to HDFS in > EMR console, neither any instructions on how to access it. > > Maybe we should change this process in order to use Ignite YARN with EMR > without extensive boilerplate? > > Regards, > > > > -- > Sent from: http://apache-ignite-users.70518.x6.nabble.com/ > --f403045fad682e6338055fa2963d Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi Ilya,=C2=A0

Thanks a lot for your he= lp, I'll try and the test the patch.=C2=A0

Reg= arding how I run this, I wait for the cluster to enter in WAITING state and= then I ssh to the master node as described in htt= ps://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-connect-master-node= -ssh.html, and then I just run the commands there. Before running the c= ommand I put the files in HDFS. We cannot use a bootstrap action here becau= se they run before HDFS starts, but another option is using a EMR step for = this.=C2=A0

For the node manager logs, I follow=C2= =A0https://docs.aws.amazon.com/emr/latest/ManagementGuide/e= mr-web-interfaces.html to have access to the resource manager UI, and t= hen I see the nodes that are involved with the YARN application. Then I ssh= to some of the slave nodes involved with the job from the EMR master, and = check the node manager logs in=C2=A0/var/log/hadoop-yarn. Also we could con= figure yarn like suggested in=C2=A0https://slider.incubator.apache.org/docs/= getting_started.html to have 1 hour of timeout before cleaning up conta= iners, using=C2=A0https://docs.aws.amazon.com/emr/latest/Relea= seGuide/emr-configure-apps.html to configure that on EMR. The container= files are then available in the EMR slaves at=C2=A0/mnt/yarn/


I'll tell you how it goes when I have t= ested the patch.=C2=A0

Thanks a lot!
Juan=C2=A0


On Fri, Dec 1, 2017 at 6:43 AM, ilya.kasnac= heev <ilya.kasnacheev@gmail.com> wrote:
Hello once more!

I have prepared a patch. I think it won't be hard for you to try it out=
since you are accustomized to building Ignite. Please try it, tell if it solves the current issue.=C2=A0 ignite-7080.patch
<http://apache-ignite-= users.70518.x6.nabble.com/file/t1312/ignite-7080.patch>
I was trying to set Ignite up on AWS EMR today, without much success. I
wonder if you could provide me with some directions.

> IGNITE_YARN_JAR=3D/mnt/ignite/apache-ignite-2.3.0-src/module= s/yarn/target/ignite-yarn-2.3.0.jar
>=C2=A0 yarn jar ${IGNITE_YARN_JAR} ${IGNITE_YARN_JAR}
> /mnt/ignite/ignite_yarn.properties

Can you tell me what were you using to launch this command? I have t= ried
executing "Custom JAR" step on EMR cluster, after copying Ignite = properties
and JAR to /mnt on master manually. So I was informed that an application was started, and is now running, but no indication about execution results.=
I suspect it hung with the same problem that you outlined (or earlier), but=
I haven't found a way to view application logs, neither "node mana= ger" logs
that you are referring to. Can you please point me where I can watch
intermediate execution results?

Also there's two data access questions. First, Ignite YARN jar expects = two
files on local FS, the jar itself, and properties file. Is there a way to make sure that these files are on file system, at predictable locations, when starting EMR cluster? To be able to add "Custom JAR" step to= cluster
initialization? E.g. specify an s3 bucket to be mounted to master node?

Another question is that you are specifying ignite zip and ignite config ar= e
on hdfs. How would you put these files to hdfs when cluster is created? Or,=
for that matter, after it is created? I haven't found any access to HDF= S in
EMR console, neither any instructions on how to access it.

Maybe we should change this process in order to use Ignite YARN with EMR without extensive boilerplate?

--f403045fad682e6338055fa2963d--