Mailing-List: contact dev-help@airavata.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@airavata.apache.org
MIME-Version: 1.0
In-Reply-To: <CAGuEt4dKQMbpV3HYgV7t-ASZs7thKPqRcKAk7o4M_ZsLVNh8HA@mail.gmail.com>
References: <CAGuEt4dAQNqstpj5OH+n=U+FFAZE4Qcwn+f6t4rKtPDeTw6q=g@mail.gmail.com>
 <DD8880C4-95BF-4B75-A6CE-537C3DDBB3F7@indiana.edu> <CAGuEt4ca8ALCUKrW5bq-GrnwckJQF3e_qzXczdLNuzFZW9x9Dw@mail.gmail.com>
 <CAGuEt4f-z4u-V_XAPqqKLtBbuGdi=vfCbKcckpPe4H_ivd--aw@mail.gmail.com>
 <CAGuEt4eP-8gCzN32nyj=adgQvz941cwe+yQRKR1+CPVWLnHKHw@mail.gmail.com>
 <CAGuEt4evF15oToxckgA1xQ0PS-Qh+Jmtbft6ErKBF0223JuScw@mail.gmail.com>
 <3866C89E-6864-444E-82D8-54BB6421F2AF@iu.edu> <CAGuEt4dKQMbpV3HYgV7t-ASZs7thKPqRcKAk7o4M_ZsLVNh8HA@mail.gmail.com>
From: Mangirish Wagle <vaglomangirish@gmail.com>
Date: Tue, 18 Oct 2016 01:54:03 -0400
Message-ID: <CAGuEt4d=-ftA43v0aAsTDw1FbeTd7+fYbRyXK1s5RQFd47hSCA@mail.gmail.com>
Subject: Re: Running MPI jobs on Mesos based clusters
To: dev@airavata.apache.org
Content-Type: multipart/alternative; boundary=047d7bfcf6ece19826053f1d5165
archived-at: Tue, 18 Oct 2016 05:54:16 -0000

--047d7bfcf6ece19826053f1d5165
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hello Devs,

Here is an update on some new learnings and thoughts based on my
interactions with Mesos and Aurora devs.

MPI implementations in Mesos repositories (like MPI Hydra) rely on obsolete
MPI platforms and no longer supported my the developer community. Hence it
is not recommended that we use this for our purpose.

One of the known ways of running MPI jobs over mesos is using "gang
scheduling" which is basically distributing the MPI run over multiple jobs
on mesos in place of multiple nodes. The challenge here is the jobs need to
be scheduled as one task and any job errored should collectively error out
the main program including all the distributed jobs.

One of the Mesos developer (Niklas Nielsen) pointed me out to his work on
gang scheduling: https://github.com/nqn. This code may not be fully tested
but certainly a good starting point to explore gang scheduling.

One of the Aurora developer (Stephen Erb) suggests using gang scheduling on
top of Aurora. Aurora scheduler assumes that every job is independent.
Hence, there would be a need to develop some external scaffolding to
coordinate and schedule these jobs, which might not be trivial. One
advantage of using Aurora as a backend for gang scheduling is that we would
inherit the robustness of Aurora, which otherwise would be a key challenge
if targeting bare mesos.

Alternative to all the options above, I think we should probably be able to
run a 1 node MPI job through Aurora. A resource offer with CPUs and Memory
from Mesos is abstracted as a single runtime, but is mapped to multiple
nodes underneath, which eventually would exploit distributed resource
capabilities.

I intend to try out the 1 node MPI job submission approach first and
simultaneously explore the gang scheduling approach.

Please let me know your thoughts/ suggestions.

Best Regards,
Mangirish


On Thu, Oct 13, 2016 at 12:39 PM, Mangirish Wagle <vaglomangirish@gmail.com=
>
wrote:

> Hi Marlon,
> Thanks for confirming and sharing the legal link.
>
> -Mangirish
>
> On Thu, Oct 13, 2016 at 12:13 PM, Pierce, Marlon <marpierc@iu.edu> wrote:
>
>> BSD is ok: https://www.apache.org/legal/resolved.
>>
>>
>>
>> *From: *Mangirish Wagle <vaglomangirish@gmail.com>
>> *Reply-To: *"dev@airavata.apache.org" <dev@airavata.apache.org>
>> *Date: *Thursday, October 13, 2016 at 12:03 PM
>> *To: *"dev@airavata.apache.org" <dev@airavata.apache.org>
>> *Subject: *Re: Running MPI jobs on Mesos based clusters
>>
>>
>>
>> Hello Devs,
>>
>> I needed some advice on the license of the MPI libraries. The MPICH
>> library that I have been trying claims to have a "BSD Like" license (
>> http://git.mpich.org/mpich.git/blob/HEAD:/COPYRIGHT).
>>
>> I am aware that OpenMPI which uses BSD license is currently used in our
>> application. I had chosen to start investigating MPICH because it claims=
 to
>> be a highly portable and high quality implementation of latest MPI
>> standard, suitable to cloud based clusters.
>>
>> If anyone could please advise on the acceptance of the MPICH libraries
>> MSD Like license for ASF, that would help.
>>
>> Thank you.
>>
>> Best Regards,
>>
>> Mangirish Wagle
>>
>>
>>
>> On Thu, Oct 6, 2016 at 1:48 AM, Mangirish Wagle <vaglomangirish@gmail.co=
m>
>> wrote:
>>
>> Hello Devs,
>>
>>
>>
>> The network issue mentioned above now stands resolved. The problem was
>> with the iptables had some conflicting rules which blocked the traffic. =
It
>> was resolved by simple iptables flush.
>>
>>
>>
>> Here is the test MPI program running on multiple machines:-
>>
>>
>>
>> [centos@mesos-slave-1 ~]$ mpiexec -f machinefile -n 2 ./mpitest
>>
>> Hello world!  I am process number: 0 on host mesos-slave-1
>>
>> Hello world!  I am process number: 1 on host mesos-slave-2
>>
>>
>>
>> The next step is to try invoking this through framework like Marathon.
>> However, the job submission still does not run through Marathon. It seem=
s
>> to gets stuck in the 'waiting' state forever (For example
>> http://149.165.170.245:8080/ui/#/apps/%2Fmaw-try). Further, I notice
>> that Marathon is listed under 'inactive frameworks' in mesos dashboard (
>> http://149.165.171.33:5050/#/frameworks).
>>
>>
>>
>> I am trying to get this working, though any help/ clues with this would
>> be really helpful.
>>
>>
>>
>> Thanks and Regards,
>>
>> Mangirish Wagle
>>
>>
>>
>>
>> On Fri, Sep 30, 2016 at 9:21 PM, Mangirish Wagle <
>> vaglomangirish@gmail.com> wrote:
>>
>> Hello Devs,
>>
>>
>>
>> I am currently running a sample MPI C program using 'mpiexec' provided b=
y
>> MPICH. I followed their installation guide
>> <http://www.mpich.org/static/downloads/3.2/mpich-3.2-installguide.pdf> t=
o
>> install the libraries on the master and slave nodes of the mesos cluster=
.
>>
>>
>>
>> The approach that I am trying out here is that I am equipping the
>> underlying nodes with MPI handling tools and then use the Mesos framewor=
k
>> like Marathon/ Aurora to submit jobs to run MPI programs by invoking the=
se
>> tools.
>>
>>
>>
>> You can potentially run an MPI program using mpiexec in the following
>> manner:-
>>
>>
>>
>> # *mpiexec -f machinefile -n 2 ./mpitest*
>>
>>    - *machinefile *-> File which contains an inventory of machines to
>>    run the program on and number of processes on each machine.
>>    - *mpitest *-> MPI program compiled in C using mpicc compiler. The
>>    program returns the process number and he hostname of the machine run=
ning
>>    the process.
>>    - *-n *option indicates number of processes that it needs to spawn
>>
>> Example of machinefile contents:-
>>
>>
>>
>> # Entries in the format <hostname/IP>:<number of processes>
>>
>> mesos-slave-1:1
>>
>> mesos-slave-2:1
>>
>>
>>
>> The reason for choosing slaves is that Mesos runs the jobs on slaves,
>> managed by 'agents' pertaining to the slaves.
>>
>>
>>
>> Output of the program with '-n 1':-
>>
>>
>>
>> # mpiexec -f machinefile -n 1 ./mpitest
>>
>> Hello world!  I am process number: 0 on host mesos-slave-1
>>
>>
>>
>> But when I try for '-n 2', I am hitting the following error:-
>>
>>
>>
>> # mpiexec -f machinefile -n 2 ./mpitest
>>
>> [proxy:0:1@mesos-slave-2] HYDU_sock_connect
>> (/home/centos/mpich-3.2/src/pm/hydra/utils/sock/sock.c:172): unable to
>> connect from "mesos-slave-2" to "mesos-slave-1" (No route to host)
>>
>> [proxy:0:1@mesos-slave-2] main (/home/centos/mpich-3.2/src/pm/hydra/pm/p=
miserv/pmip.c:189):
>> *unable to connect to server mesos-slave-1 at port 44788* (check for
>> firewalls!)
>>
>>
>>
>> It seems to not allow the program execution due to network traffic being
>> blocked. I checked security groups in scigap openstack for mesos-slave-1=
,
>> mesos-slave-2 nodes and it is set to 'wideopen' policy. Furthermore, I
>> tried adding explicit rules to the policies to allow all TCP and UDP
>> (Currently I am not sure what protocol is used underneath), even then it
>> continues throwing this error.
>>
>>
>>
>> Any clues, suggestions, comments about the error or approach as a whole
>> would be helpful.
>>
>>
>>
>> Thanks and Regards,
>>
>> Mangirish Wagle
>>
>>
>>
>> *Error! Filename not specified.*
>>
>>
>>
>> On Tue, Sep 27, 2016 at 11:23 AM, Mangirish Wagle <
>> vaglomangirish@gmail.com> wrote:
>>
>> Hello Devs,
>>
>>
>>
>> Thanks Gourav and Shameera for all the work w.r.t. setting up the
>> Mesos-Marathon cluster on Jetstream.
>>
>>
>>
>> I am currently evaluating MPICH (http://www.mpich.org/about/overview/)
>> to be used for launching MPI jobs on top of mesos. MPICH version 1.2
>> supports Mesos based MPI scheduling. I have been also trying to submit j=
obs
>> to the cluster through Marathon. However, in either cases I am currently
>> facing issues which I am working to get resolved.
>>
>>
>>
>> I am compiling my notes into the following google doc. You may please
>> review and let me know your comments, suggestions.
>>
>>
>>
>> https://docs.google.com/document/d/1p_Y4Zd4I4lgt264IHspXJli3
>> la25y6bcPcmrTD6nR8g/edit?usp=3Dsharing
>>
>>
>>
>> Thanks and Regards,
>>
>> Mangirish Wagle
>>
>>
>>
>> *Error! Filename not specified.*
>>
>>
>>
>> On Wed, Sep 21, 2016 at 3:20 PM, Shenoy, Gourav Ganesh <
>> goshenoy@indiana.edu> wrote:
>>
>> Hi Mangirish,
>>
>>
>>
>> I have set up a Mesos-Marathon cluster for you on Jetstream. I will shar=
e
>> with you with the cluster details in a separate email. Kindly note that
>> there are 3 masters & 2 slaves in this cluster.
>>
>>
>>
>> I am also working on automating this process for Jetstream (similar to
>> Shameera=E2=80=99s ansible script for EC2) and when that is ready, we ca=
n create
>> clusters or add/remove slave machines from the cluster.
>>
>>
>>
>> Thanks and Regards,
>>
>> Gourav Shenoy
>>
>>
>>
>> *From: *Mangirish Wagle <vaglomangirish@gmail.com>
>> *Reply-To: *"dev@airavata.apache.org" <dev@airavata.apache.org>
>> *Date: *Wednesday, September 21, 2016 at 2:36 PM
>> *To: *"dev@airavata.apache.org" <dev@airavata.apache.org>
>> *Subject: *Running MPI jobs on Mesos based clusters
>>
>>
>>
>> Hello All,
>>
>>
>>
>> I would like to post for everybody's awareness about the study that I am
>> undertaking this fall, i.e. to evaluate various different frameworks tha=
t
>> would facilitate MPI jobs on Mesos based clusters for Apache Airavata.
>>
>>
>>
>> Some of the options that I am looking at are:-
>>
>>    1. MPI support framework bundled with Mesos
>>    2. Apache Aurora
>>    3. Marathon
>>    4. Chronos
>>
>> Some of the evaluation criteria that I am planning to base my
>> investigation are:-
>>
>>    - Ease of setup
>>    - Documentation
>>    - Reliability features like HA
>>    - Scaling and Fault recovery
>>    - Performance
>>    - Community Support
>>
>> Gourav and Shameera are working on ansible based automation to spin up a
>> mesos based cluster and I am planning to use it to setup a cluster for
>> experimentation.
>>
>>
>>
>> Any suggestions or information about prior work on this would be highly
>> appreciated.
>>
>>
>>
>> Thank you.
>>
>>
>>
>> Best Regards,
>>
>> Mangirish Wagle
>>
>> *Error! Filename not specified.*
>>
>>
>>
>>
>>
>>
>>
>>
>>
>
>

--047d7bfcf6ece19826053f1d5165
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div><div><div><div><div><div>Hello Devs,<br><br=
></div>Here is an update on some new learnings and thoughts based on my int=
eractions with Mesos and Aurora devs.<br><br></div>MPI implementations in M=
esos repositories (like MPI Hydra) rely on obsolete MPI platforms and no lo=
nger supported my the developer community. Hence it is not recommended that=
 we use this for our purpose.<br><br></div>One of the known ways of running=
 MPI jobs over mesos is using &quot;gang scheduling&quot; which is basicall=
y distributing the MPI run over multiple jobs on mesos in place of multiple=
 nodes. The challenge here is the jobs need to be scheduled as one task and=
 any job errored should collectively error out the main program including a=
ll the distributed jobs. <br><br>One of the Mesos developer (Niklas Nielsen=
) pointed me out to his work on gang scheduling: <a href=3D"https://github.=
com/nqn">https://github.com/nqn</a>. This code may not be fully tested but =
certainly a good starting point to explore gang scheduling.<br><br>One of t=
he Aurora developer (Stephen Erb) suggests using gang scheduling on top of =
Aurora. Aurora scheduler assumes that every job is independent. Hence, ther=
e would be a need to develop some external scaffolding to coordinate and sc=
hedule these jobs, which might not be trivial. One advantage of using Auror=
a as a backend for gang scheduling is that we would inherit the robustness =
of Aurora, which otherwise would be a key challenge if targeting bare mesos=
.<br><br></div>Alternative to all the options above, I think we should prob=
ably be able to run a 1 node MPI job through Aurora. A resource offer with =
CPUs and Memory from Mesos is abstracted as a single runtime, but is mapped=
 to multiple nodes underneath, which eventually would exploit distributed r=
esource capabilities.<br><br></div>I intend to try out the 1 node MPI job s=
ubmission approach first and simultaneously explore the gang scheduling app=
roach.<br><br></div>Please let me know your thoughts/ suggestions.<br><br><=
/div>Best Regards,<br></div>Mangirish <br><div><div><div><div><div><br><br>=
</div></div></div></div></div></div><div class=3D"gmail_extra"><br><div cla=
ss=3D"gmail_quote">On Thu, Oct 13, 2016 at 12:39 PM, Mangirish Wagle <span =
dir=3D"ltr">&lt;<a href=3D"mailto:vaglomangirish@gmail.com" target=3D"_blan=
k">vaglomangirish@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"g=
mail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex"><div dir=3D"ltr"><div><div>Hi Marlon,<br></div>Thanks for confirmi=
ng and sharing the legal link.<span class=3D"HOEnZb"><font color=3D"#888888=
"><br><br></font></span></div><span class=3D"HOEnZb"><font color=3D"#888888=
">-Mangirish<br></font></span></div><div class=3D"HOEnZb"><div class=3D"h5"=
><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Thu, Oct 13, =
2016 at 12:13 PM, Pierce, Marlon <span dir=3D"ltr">&lt;<a href=3D"mailto:ma=
rpierc@iu.edu" target=3D"_blank">marpierc@iu.edu</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex"><div bgcolor=3D"white" link=3D"blue" vlink=3D=
"purple" lang=3D"EN-US"><div class=3D"m_-812730585519136837m_-7171393712865=
019719WordSection1"><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;=
font-family:Calibri">BSD is ok: <a href=3D"https://www.apache.org/legal/res=
olved" target=3D"_blank">https://www.apache.org/legal/r<wbr>esolved</a>. <u=
></u><u></u></span></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0=
pt;font-family:Calibri"><u></u>=C2=A0<u></u></span></p><div style=3D"border=
:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0in 0in 0in"><p class=3D=
"MsoNormal"><b><span style=3D"font-family:Calibri;color:black">From: </span=
></b><span style=3D"font-family:Calibri;color:black">Mangirish Wagle &lt;<a=
 href=3D"mailto:vaglomangirish@gmail.com" target=3D"_blank">vaglomangirish@=
gmail.com</a>&gt;<br><b>Reply-To: </b>&quot;<a href=3D"mailto:dev@airavata.=
apache.org" target=3D"_blank">dev@airavata.apache.org</a>&quot; &lt;<a href=
=3D"mailto:dev@airavata.apache.org" target=3D"_blank">dev@airavata.apache.o=
rg</a>&gt;<br><b>Date: </b>Thursday, October 13, 2016 at 12:03 PM<br><b>To:=
 </b>&quot;<a href=3D"mailto:dev@airavata.apache.org" target=3D"_blank">dev=
@airavata.apache.org</a>&quot; &lt;<a href=3D"mailto:dev@airavata.apache.or=
g" target=3D"_blank">dev@airavata.apache.org</a>&gt;<br><b>Subject: </b>Re:=
 Running MPI jobs on Mesos based clusters<u></u><u></u></span></p></div><di=
v><div class=3D"m_-812730585519136837h5"><div><p class=3D"MsoNormal"><u></u=
>=C2=A0<u></u></p></div><div><div><div><div><div><div><div><p class=3D"MsoN=
ormal" style=3D"margin-bottom:12.0pt">Hello Devs,<u></u><u></u></p></div><p=
 class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">I needed some advice on=
 the license of the MPI libraries. The MPICH library that I have been tryin=
g claims to have a &quot;BSD Like&quot; license (<a href=3D"http://git.mpic=
h.org/mpich.git/blob/HEAD:/COPYRIGHT" target=3D"_blank">http://git.mpich.or=
g/mpich.gi<wbr>t/blob/HEAD:/COPYRIGHT</a>).<u></u><u></u></p></div><p class=
=3D"MsoNormal" style=3D"margin-bottom:12.0pt">I am aware that OpenMPI which=
 uses BSD license is currently used in our application. I had chosen to sta=
rt investigating MPICH because it claims to be a highly portable and high q=
uality implementation of latest MPI standard, suitable to cloud based clust=
ers.<u></u><u></u></p></div><p class=3D"MsoNormal" style=3D"margin-bottom:1=
2.0pt">If anyone could please advise on the acceptance of the MPICH librari=
es MSD Like license for ASF, that would help.<u></u><u></u></p></div><p cla=
ss=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Thank you.<u></u><u></u></p=
></div><p class=3D"MsoNormal">Best Regards,<u></u><u></u></p></div><p class=
=3D"MsoNormal">Mangirish Wagle<u></u><u></u></p></div></div></div><div><p c=
lass=3D"MsoNormal"><u></u>=C2=A0<u></u></p><div><div><div class=3D"m_-81273=
0585519136837h5"><p class=3D"MsoNormal">On Thu, Oct 6, 2016 at 1:48 AM, Man=
girish Wagle &lt;<a href=3D"mailto:vaglomangirish@gmail.com" target=3D"_bla=
nk">vaglomangirish@gmail.com</a>&gt; wrote:<u></u><u></u></p></div></div><b=
lockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0in =
0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in"><div><div class=3D"m_-812=
730585519136837h5"><div><p class=3D"MsoNormal">Hello Devs, <u></u><u></u></=
p><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=
=3D"MsoNormal">The network issue mentioned above now stands resolved. The p=
roblem was with the iptables had some conflicting rules which blocked the t=
raffic. It was resolved by simple iptables flush.<u></u><u></u></p></div><d=
iv><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"Ms=
oNormal">Here is the test MPI program running on multiple machines:-<u></u>=
<u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div>=
<div><div><p class=3D"MsoNormal"><span style=3D"font-family:&quot;monospace=
&quot;,&quot;serif&quot;">[centos@mesos-slave-1 ~]$ mpiexec -f machinefile =
-n 2 ./mpitest</span><u></u><u></u></p></div><div><p class=3D"MsoNormal"><s=
pan style=3D"font-family:&quot;monospace&quot;,&quot;serif&quot;">Hello wor=
ld!=C2=A0 I am process number: 0 on host mesos-slave-1</span><u></u><u></u>=
</p></div><div><p class=3D"MsoNormal"><span style=3D"font-family:&quot;mono=
space&quot;,&quot;serif&quot;">Hello world!=C2=A0 I am process number: 1 on=
 host mesos-slave-2</span><u></u><u></u></p></div></div><div><p class=3D"Ms=
oNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"MsoNormal">The next=
 step is to try invoking this through framework like Marathon. However, the=
 job submission still does not run through Marathon. It seems to gets stuck=
 in the &#39;waiting&#39; state forever (For example <a href=3D"http://149.=
165.170.245:8080/ui/#/apps/%2Fmaw-try" target=3D"_blank">http://149.165.170=
.245:8080/ui<wbr>/#/apps/%2Fmaw-try</a>). Further, I notice that Marathon i=
s listed under &#39;inactive frameworks&#39; in mesos dashboard (<a href=3D=
"http://149.165.171.33:5050/#/frameworks" target=3D"_blank">http://149.165.=
171.33:5050/#/<wbr>frameworks</a>).<u></u><u></u></p></div><div><p class=3D=
"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"MsoNormal">I am =
trying to get this working, though any help/ clues with this would be reall=
y helpful.<u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0=
<u></u></p></div><div><p class=3D"MsoNormal">Thanks and Regards,<u></u><u><=
/u></p></div><div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Man=
girish Wagle<br><br><br><u></u><u></u></p></div><p class=3D"MsoNormal"><img=
 id=3D"m_-812730585519136837m_-7171393712865019719_x0000_i1025" src=3D"http=
s://mailtrack.io/trace/mail/32158903dd2524b8740924eeb316e1d5b17c721a.png?u=
=3D765734" width=3D"1" border=3D"0" height=3D"1"><u></u><u></u></p></div></=
div></div><div><div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p><di=
v><div><div class=3D"m_-812730585519136837h5"><p class=3D"MsoNormal">On Fri=
, Sep 30, 2016 at 9:21 PM, Mangirish Wagle &lt;<a href=3D"mailto:vaglomangi=
rish@gmail.com" target=3D"_blank">vaglomangirish@gmail.com</a>&gt; wrote:<u=
></u><u></u></p></div></div><blockquote style=3D"border:none;border-left:so=
lid #cccccc 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-right:=
0in"><div><div><div class=3D"m_-812730585519136837h5"><p class=3D"MsoNormal=
">Hello Devs, <u></u><u></u></p><div><p class=3D"MsoNormal"><u></u>=C2=A0<u=
></u></p><div><p class=3D"MsoNormal">I am currently running a sample MPI C =
program using &#39;mpiexec&#39; provided by MPICH. I followed their <a href=
=3D"http://www.mpich.org/static/downloads/3.2/mpich-3.2-installguide.pdf" t=
arget=3D"_blank">installation guide</a>=C2=A0to install the libraries on th=
e master and slave nodes of the mesos cluster.<u></u><u></u></p></div><div>=
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"MsoNo=
rmal">The approach that I am trying out here is that I am equipping the und=
erlying nodes with MPI handling tools and then use the Mesos framework like=
 Marathon/ Aurora to submit jobs to run MPI programs by invoking these tool=
s.<u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u><=
/p></div><div><p class=3D"MsoNormal">You can potentially run an MPI program=
 using mpiexec in the following manner:-<u></u><u></u></p></div><div><p cla=
ss=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"MsoNormal">=
#=C2=A0<b>mpiexec -f machinefile -n 2 ./mpitest</b><u></u><u></u></p></div>=
<div><ul type=3D"disc"><li class=3D"MsoNormal"><b>machinefile </b>-&gt; Fil=
e which contains an inventory of machines to run the program on and number =
of processes on each machine.<u></u><u></u></li><li class=3D"MsoNormal"><b>=
mpitest </b>-&gt; MPI program compiled in C using mpicc compiler. The progr=
am returns the process number and he hostname of the machine running the pr=
ocess.<u></u><u></u></li><li class=3D"MsoNormal"><b>-n </b>option indicates=
 number of processes that it needs to spawn<u></u><u></u></li></ul></div><d=
iv><p class=3D"MsoNormal">Example of machinefile contents:-<u></u><u></u></=
p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p c=
lass=3D"MsoNormal"><span style=3D"font-family:&quot;monospace&quot;,&quot;s=
erif&quot;"># Entries in the format &lt;hostname/IP&gt;:&lt;number of proce=
sses&gt;</span><u></u><u></u></p></div><div><div><p class=3D"MsoNormal"><sp=
an style=3D"font-family:&quot;monospace&quot;,&quot;serif&quot;">mesos-slav=
e-1:1</span><u></u><u></u></p></div><div><p class=3D"MsoNormal"><span style=
=3D"font-family:&quot;monospace&quot;,&quot;serif&quot;">mesos-slave-2:1</s=
pan><u></u><u></u></p></div></div><div><p class=3D"MsoNormal"><u></u>=C2=A0=
<u></u></p></div><div><p class=3D"MsoNormal"><span style=3D"font-family:Ari=
al">The reason for choosing slaves is that Mesos runs the jobs on slaves, m=
anaged by &#39;agents&#39; pertaining to the slaves.</span><u></u><u></u></=
p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p c=
lass=3D"MsoNormal"><span style=3D"font-family:Arial">Output of the program =
with &#39;-n 1&#39;:-</span><u></u><u></u></p></div><div><p class=3D"MsoNor=
mal"><u></u>=C2=A0<u></u></p></div><div><div><p class=3D"MsoNormal"><span s=
tyle=3D"font-family:Courier"># mpiexec -f machinefile -n 1 ./mpitest<u></u>=
<u></u></span></p></div><div><p class=3D"MsoNormal"><span style=3D"font-fam=
ily:Courier">Hello world!=C2=A0 I am process number: 0 on host mesos-slave-=
1<u></u><u></u></span></p></div><div><p class=3D"MsoNormal"><span style=3D"=
font-family:Courier"><u></u>=C2=A0<u></u></span></p></div><div><p class=3D"=
MsoNormal"><span style=3D"font-family:Arial">But when I try for &#39;-n 2&#=
39;, I am hitting the following error:-</span><u></u><u></u></p></div><div>=
<p class=3D"MsoNormal"><span style=3D"font-family:Courier"><u></u>=C2=A0<u>=
</u></span></p></div><div><div><p class=3D"MsoNormal"><span style=3D"font-f=
amily:Courier"># mpiexec -f machinefile -n 2 ./mpitest<u></u><u></u></span>=
</p></div><div><p class=3D"MsoNormal"><span style=3D"font-family:Courier">[=
proxy:0:1@mesos-slave-2] HYDU_sock_connect (/home/centos/mpich-3.2/src/pm<w=
br>/hydra/utils/sock/sock.c:172): unable to connect from &quot;mesos-slave-=
2&quot; to &quot;mesos-slave-1&quot; (No route to host)<u></u><u></u></span=
></p></div><div><p class=3D"MsoNormal"><span style=3D"font-family:Courier">=
[proxy:0:1@mesos-slave-2] main (/home/centos/mpich-3.2/src/pm<wbr>/hydra/pm=
/pmiserv/pmip.c:189): <b>unable to connect to server mesos-slave-1 at port =
44788</b> (check for firewalls!)<u></u><u></u></span></p></div></div></div>=
<div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"=
MsoNormal">It seems to not allow the program execution due to network traff=
ic being blocked. I checked security groups in scigap openstack for mesos-s=
lave-1, mesos-slave-2 nodes and it is set to &#39;wideopen&#39; policy. Fur=
thermore, I tried adding explicit rules to the policies to allow all TCP an=
d UDP (Currently I am not sure what protocol is used underneath), even then=
 it continues throwing this error.<u></u><u></u></p></div><div><p class=3D"=
MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=3D"MsoNormal">Any cl=
ues, suggestions, comments about the error or approach as a whole would be =
helpful.<u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u=
></u></p></div><div><p class=3D"MsoNormal">Thanks and Regards,<u></u><u></u=
></p></div><div><p class=3D"MsoNormal">Mangirish Wagle<u></u><u></u></p></d=
iv><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div></div></div></=
div><p class=3D"MsoNormal"><b>Error! Filename not specified.</b><u></u><u><=
/u></p></div><div><div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>=
<div><span><p class=3D"MsoNormal">On Tue, Sep 27, 2016 at 11:23 AM, Mangiri=
sh Wagle &lt;<a href=3D"mailto:vaglomangirish@gmail.com" target=3D"_blank">=
vaglomangirish@gmail.com</a>&gt; wrote:<u></u><u></u></p></span><blockquote=
 style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0in 0in 0in 6=
.0pt;margin-left:4.8pt;margin-right:0in"><div><span><p class=3D"MsoNormal">=
Hello Devs, <u></u><u></u></p><div><p class=3D"MsoNormal"><u></u>=C2=A0<u><=
/u></p></div><div><p class=3D"MsoNormal">Thanks Gourav and Shameera for all=
 the work w.r.t. setting up the Mesos-Marathon cluster on Jetstream.<u></u>=
<u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div>=
<div><p class=3D"MsoNormal">I am currently evaluating MPICH (<a href=3D"htt=
p://www.mpich.org/about/overview/" target=3D"_blank">http://www.mpich.org/a=
bout/ov<wbr>erview/</a>) to be used for launching MPI jobs on top of mesos.=
 MPICH version 1.2 supports Mesos based MPI scheduling. I have been also tr=
ying to submit jobs to the cluster through Marathon. However, in either cas=
es I am currently facing issues which I am working to get resolved.<u></u><=
u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><=
div><p class=3D"MsoNormal">I am compiling my notes into the following googl=
e doc. You may please review and let me know your comments, suggestions.<u>=
</u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></=
div><div><p class=3D"MsoNormal"><a href=3D"https://docs.google.com/document=
/d/1p_Y4Zd4I4lgt264IHspXJli3la25y6bcPcmrTD6nR8g/edit?usp=3Dsharing" target=
=3D"_blank">https://docs.google.com/docume<wbr>nt/d/1p_Y4Zd4I4lgt264IHspXJl=
i3<wbr>la25y6bcPcmrTD6nR8g/edit?usp=3D<wbr>sharing</a><u></u><u></u></p></d=
iv><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p></div><div><p class=
=3D"MsoNormal">Thanks and Regards,<u></u><u></u></p></div><div><p class=3D"=
MsoNormal">Mangirish Wagle<u></u><u></u></p></div></span><p class=3D"MsoNor=
mal"><br><br><b>Error! Filename not specified.</b><u></u><u></u></p></div><=
div><div><div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p><div><div><div=
 class=3D"m_-812730585519136837h5"><p class=3D"MsoNormal">On Wed, Sep 21, 2=
016 at 3:20 PM, Shenoy, Gourav Ganesh &lt;<a href=3D"mailto:goshenoy@indian=
a.edu" target=3D"_blank">goshenoy@indiana.edu</a>&gt; wrote:<u></u><u></u><=
/p></div></div><blockquote style=3D"border:none;border-left:solid #cccccc 1=
.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in"><div><di=
v><div><div class=3D"m_-812730585519136837h5"><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:Calibri">Hi Mangirish,</span><u></u><=
u></u></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-famil=
y:Calibri">=C2=A0</span><u></u><u></u></p><p class=3D"MsoNormal"><span styl=
e=3D"font-size:11.0pt;font-family:Calibri">I have set up a Mesos-Marathon c=
luster for you on Jetstream. I will share with you with the cluster details=
 in a separate email. Kindly note that there are 3 masters &amp; 2 slaves i=
n this cluster. </span><u></u><u></u></p><p class=3D"MsoNormal"><span style=
=3D"font-size:11.0pt;font-family:Calibri">=C2=A0</span><u></u><u></u></p><p=
 class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:Calibri">I=
 am also working on automating this process for Jetstream (similar to Shame=
era=E2=80=99s ansible script for EC2) and when that is ready, we can create=
 clusters or add/remove slave machines from the cluster.</span><u></u><u></=
u></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:Ca=
libri">=C2=A0</span><u></u><u></u></p><p class=3D"MsoNormal"><span style=3D=
"font-size:11.0pt;font-family:Calibri">Thanks and Regards,</span><u></u><u>=
</u></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:=
Calibri">Gourav Shenoy</span><u></u><u></u></p><p class=3D"MsoNormal"><span=
 style=3D"font-size:11.0pt;font-family:Calibri">=C2=A0</span><u></u><u></u>=
</p><div style=3D"border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt =
0in 0in 0in"><p class=3D"MsoNormal"><b><span style=3D"font-family:Calibri;c=
olor:black">From: </span></b><span style=3D"font-family:Calibri;color:black=
">Mangirish Wagle &lt;<a href=3D"mailto:vaglomangirish@gmail.com" target=3D=
"_blank">vaglomangirish@gmail.com</a>&gt;<br><b>Reply-To: </b>&quot;<a href=
=3D"mailto:dev@airavata.apache.org" target=3D"_blank">dev@airavata.apache.o=
rg</a>&quot; &lt;<a href=3D"mailto:dev@airavata.apache.org" target=3D"_blan=
k">dev@airavata.apache.org</a>&gt;<br><b>Date: </b>Wednesday, September 21,=
 2016 at 2:36 PM<br><b>To: </b>&quot;<a href=3D"mailto:dev@airavata.apache.=
org" target=3D"_blank">dev@airavata.apache.org</a>&quot; &lt;<a href=3D"mai=
lto:dev@airavata.apache.org" target=3D"_blank">dev@airavata.apache.org</a>&=
gt;<br><b>Subject: </b>Running MPI jobs on Mesos based clusters</span><u></=
u><u></u></p></div></div></div><div><div><div><p class=3D"MsoNormal">=C2=A0=
<u></u><u></u></p></div><div><div><div class=3D"m_-812730585519136837h5"><p=
 class=3D"MsoNormal">Hello All, <u></u><u></u></p><div><p class=3D"MsoNorma=
l">=C2=A0<u></u><u></u></p></div><div><p class=3D"MsoNormal">I would like t=
o post for everybody&#39;s awareness about the study that I am undertaking =
this fall, i.e. to evaluate various different frameworks that would facilit=
ate MPI jobs on Mesos based clusters for Apache Airavata.<u></u><u></u></p>=
</div><div><p class=3D"MsoNormal">=C2=A0<u></u><u></u></p></div><div><p cla=
ss=3D"MsoNormal">Some of the options that I am looking at are:-<u></u><u></=
u></p></div><div><ol start=3D"1" type=3D"1"><li class=3D"MsoNormal">MPI sup=
port framework bundled with Mesos<u></u><u></u></li><li class=3D"MsoNormal"=
>Apache Aurora<u></u><u></u></li><li class=3D"MsoNormal">Marathon<u></u><u>=
</u></li><li class=3D"MsoNormal">Chronos<u></u><u></u></li></ol></div><div>=
<p class=3D"MsoNormal">Some of the evaluation criteria that I am planning t=
o base my investigation are:-<u></u><u></u></p></div><div><ul type=3D"disc"=
><li class=3D"MsoNormal">Ease of setup<u></u><u></u></li><li class=3D"MsoNo=
rmal">Documentation<u></u><u></u></li><li class=3D"MsoNormal">Reliability f=
eatures like HA<u></u><u></u></li><li class=3D"MsoNormal">Scaling and Fault=
 recovery<u></u><u></u></li><li class=3D"MsoNormal">Performance<u></u><u></=
u></li><li class=3D"MsoNormal">Community Support<u></u><u></u></li></ul></d=
iv><div><p class=3D"MsoNormal">Gourav and Shameera are working on ansible b=
ased automation to spin up a mesos based cluster and I am planning to use i=
t to setup a cluster for experimentation.<u></u><u></u></p></div><div><p cl=
ass=3D"MsoNormal">=C2=A0<u></u><u></u></p></div><div><p class=3D"MsoNormal"=
>Any suggestions or information about prior work on this would be highly ap=
preciated.<u></u><u></u></p></div><div><p class=3D"MsoNormal">=C2=A0<u></u>=
<u></u></p></div><div><p class=3D"MsoNormal">Thank you.<u></u><u></u></p></=
div><div><p class=3D"MsoNormal">=C2=A0<u></u><u></u></p></div><div><p class=
=3D"MsoNormal">Best Regards,<u></u><u></u></p></div><div><p class=3D"MsoNor=
mal">Mangirish Wagle<u></u><u></u></p></div></div></div><p class=3D"MsoNorm=
al"><b>Error! Filename not specified.</b><u></u><u></u></p></div></div></di=
v></div></div></blockquote></div><p class=3D"MsoNormal"><u></u>=C2=A0<u></u=
></p></div></div></div></blockquote></div><p class=3D"MsoNormal"><u></u>=C2=
=A0<u></u></p></div></div></div></blockquote></div><p class=3D"MsoNormal"><=
u></u>=C2=A0<u></u></p></div></div></div></blockquote></div><p class=3D"Mso=
Normal"><u></u>=C2=A0<u></u></p></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--047d7bfcf6ece19826053f1d5165--