Mailing-List: contact dev-help@airavata.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@airavata.apache.org
MIME-Version: 1.0
From: Gourav Rattihalli <grattih1@binghamton.edu>
Date: Mon, 21 Mar 2016 10:22:31 -0400
Message-ID: 
 <CAB75Ban7PmW945t7eAuC_fU9E6tvS_Ckt=0D3J21DCD_4GuoVw@mail.gmail.com>
Subject: [GSoC Proposal] - Integrating Job and Cloud Health Information of
 Apache Aurora with Apache Airavata
To: dev@airavata.apache.org
Content-Type: multipart/alternative; boundary=001a1142fd801cb4ba052e8fd697

--001a1142fd801cb4ba052e8fd697
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Dev Team,

Please review the following GSoC proposal that I plan to submit:

*Title*: Integrating Job and Cloud Health Information of Apache Aurora with
Apache Airavata

*Abstract*:

This project will incorporate Apache Aurora to enable Airavata to launch
jobs on large cloud environments, and collect the related information on
the health of each job and the cloud resources. The project will also
analyze the current micro-services architecture of Airavata and develop
code for an updated architecture for modules such as Logging. As as result,
another outcome of this project would be development of a module that will
collect all the logging information from the various execution points in an
Airavata job's lifecycle and provide search and mining capability.


*Introduction*:

Apache Aurora is a service scheduler, that runs on top of Apache Mesos.
This combination enables the use of long running services that take
advantage of Apache Mesos scalability, fault-tolerance and resource
isolation. Apache Mesos is a cluster manager, which provides information
about the state of the cluster. Aurora uses that knowledge to make
scheduling decisions. For example, when a machine experiences failure
Aurora automatically reschedules those previously-running services onto a
healthy machine in order to keep them running. Each job is tracked by
Aurora to be in one of the following states: pending, assigned, starting,
running, and finished.

Apache Aurora requires a configuration file =E2=80=9D.aurora=E2=80=9D to la=
unch jobs.
Following is an example of Aurora configuration file:

import os
hello_world_process =3D Process(name =3D 'hello_world', cmdline =3D 'echo h=
ello
world')

hello_world_task =3D Task(
 resources =3D Resources(cpu =3D 0.1, ram =3D 16 * MB, disk =3D 16 * MB),
 processes =3D [hello_world_process])

hello_world_job =3D Job(
 cluster =3D 'cluster1',
 role =3D os.getenv('USER'),
 task =3D hello_world_task)

jobs =3D [hello_world_job]

To launch the job with the above configuration we use:

aurora job create cluster1/$USER/test/hello_world hello_world.aurora

This project will develop modules in Airavata to automatically generate the
Aurora configuration file to launch a job on an Aurora-managed cluster in a
cloud environment. The Aurora user interface, as shown in the web portal
displayed above, provides detailed information on the job status, job name,
start and finish times, location of the logs, and resource usage. This
project will use add a module to Apache Aurora to pull this detailed
information using the the Aurora HTTP API.

*Goals*:

   -

   This project will investigate how apache Aurora collects information of
   cluster environment for display on the Aurora web interface. We will stu=
dy
   the Aurora HTTP API and retrieve all the information related to the targ=
et
   infrastructure and job health, and make it available to the Airavata job
   submission module.
   -

   We will process the retrieved information from Aurora and convert the
   information in a format that can be used by Airavata for further action.
   -

   We will use the appropriate design patterns to integrate the use of
   Aurora as one of the options for Big Data and Cloud resource frameworks
   with the Airavata framework
   -

   We will make the resource information from Aurora available for display
   on the Airavata dashboard.


Any comment and suggestions would be very helpful.

-Gourav Rattihalli

--001a1142fd801cb4ba052e8fd697
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><span style=3D"font-size:12.8px">Hi Dev Team,</span><br st=
yle=3D"font-size:12.8px"><br style=3D"font-size:12.8px"><span style=3D"font=
-size:12.8px">Please review the following GSoC proposal that I plan to subm=
it:</span><br style=3D"font-size:12.8px"><br style=3D"font-size:12.8px"><sp=
an style=3D"font-size:12.8px"><b>Title</b>:=C2=A0</span><span style=3D"font=
-size:12.8px"><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-al=
ign:baseline;white-space:pre-wrap;background-color:transparent">Integrating=
 Job and Cloud Health Information of Apache Aurora with Apache Airavata</sp=
an></span><div style=3D"font-size:12.8px"><font color=3D"#000000" face=3D"A=
rial"><span style=3D"white-space:pre-wrap"><br></span></font><span style=3D=
"font-size:12.8px"><b>Abstract</b>:=C2=A0</span></div><div style=3D"text-al=
ign:left;font-size:12.8px">


<p class=3D"MsoNormal" style=3D"text-align:justify"><span style=3D"font-siz=
e:9.5pt;font-family:Arial;color:black">This
project will incorporate Apache Aurora to enable Airavata to launch jobs on
large cloud environments, and collect the related information on the health=
 of
each job and the cloud resources. The project will also analyze the current
micro-services architecture of Airavata and develop code for an updated
architecture for modules such as Logging. As as result, another outcome of =
this
project would be development of a module that will collect all the logging
information from the various execution points in an Airavata job&#39;s life=
cycle
and provide search and mining capability.</span><span style=3D"font-size:9.=
5pt;font-family:Arial"></span></p><p class=3D"MsoNormal" style=3D"text-alig=
n:justify"><span style=3D"font-size:9.5pt;font-family:Arial;color:black"><b=
r></span></p>

</div><div><span style=3D"font-size:12.8px"><b>Introduction</b>:=C2=A0</spa=
n><p dir=3D"ltr" style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;m=
argin-bottom:0pt;text-align:justify"><span style=3D"font-family:Arial;color=
:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:t=
ransparent">Apache Aurora is a service scheduler, that runs on top of Apach=
e Mesos. This combination enables the use of long running services that tak=
e advantage of Apache Mesos scalability, fault-tolerance and resource isola=
tion. </span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-ali=
gn:baseline;white-space:pre-wrap;background-color:rgb(254,254,254)">Apache =
Mesos is a cluster manager, which provides information about the state of t=
he cluster. Aurora uses that knowledge to make scheduling decisions. For ex=
ample, when a machine experiences failure Aurora automatically reschedules =
those previously-running services onto a healthy machine in order to keep t=
hem running. Each job is tracked by Aurora to be in one of the following st=
ates: pending, assigned, starting, running, and finished.</span></p><br><p =
dir=3D"ltr" style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin=
-bottom:0pt"><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-ali=
gn:baseline;white-space:pre-wrap;background-color:transparent">Apache Auror=
a requires a configuration file =E2=80=9D.aurora=E2=80=9D to launch jobs. F=
ollowing is an example of Aurora configuration file:</span></p><br><p dir=
=3D"ltr" style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin-bo=
ttom:8pt"><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:=
baseline;white-space:pre-wrap;background-color:rgb(245,245,245)">import os<=
/span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:base=
line;white-space:pre-wrap;background-color:rgb(245,245,245)"><br></span><sp=
an style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;whit=
e-space:pre-wrap;background-color:rgb(245,245,245)">hello_world_process =3D=
 Process(name =3D &#39;hello_world&#39;, cmdline =3D &#39;echo hello world&=
#39;)</span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-alig=
n:baseline;white-space:pre-wrap;background-color:rgb(245,245,245)"><br></sp=
an><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baselin=
e;white-space:pre-wrap;background-color:rgb(245,245,245)"><br></span><span =
style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-s=
pace:pre-wrap;background-color:rgb(245,245,245)">hello_world_task =3D Task(=
</span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:bas=
eline;white-space:pre-wrap;background-color:rgb(245,245,245)"><br></span><s=
pan style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;whi=
te-space:pre-wrap;background-color:rgb(245,245,245)"> =C2=A0resources =3D R=
esources(cpu =3D 0.1, ram =3D 16 * MB, disk =3D 16 * MB),</span><span style=
=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:=
pre-wrap;background-color:rgb(245,245,245)"><br></span><span style=3D"font-=
family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;=
background-color:rgb(245,245,245)"> =C2=A0processes =3D [hello_world_proces=
s])</span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:=
baseline;white-space:pre-wrap;background-color:rgb(245,245,245)"><br></span=
><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;=
white-space:pre-wrap;background-color:rgb(245,245,245)"><br></span><span st=
yle=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-spa=
ce:pre-wrap;background-color:rgb(245,245,245)">hello_world_job =3D Job(</sp=
an><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baselin=
e;white-space:pre-wrap;background-color:rgb(245,245,245)"><br></span><span =
style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-s=
pace:pre-wrap;background-color:rgb(245,245,245)"> =C2=A0cluster =3D &#39;cl=
uster1&#39;,</span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertic=
al-align:baseline;white-space:pre-wrap;background-color:rgb(245,245,245)"><=
br></span><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:=
baseline;white-space:pre-wrap;background-color:rgb(245,245,245)"> =C2=A0rol=
e =3D os.getenv(&#39;USER&#39;),</span><span style=3D"font-family:Arial;col=
or:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color=
:rgb(245,245,245)"><br></span><span style=3D"font-family:Arial;color:rgb(0,=
0,0);vertical-align:baseline;white-space:pre-wrap;background-color:rgb(245,=
245,245)"> =C2=A0task =3D hello_world_task)</span><span style=3D"font-famil=
y:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;backg=
round-color:rgb(245,245,245)"><br></span><span style=3D"font-family:Arial;c=
olor:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-col=
or:rgb(245,245,245)"><br></span><span style=3D"font-family:Arial;color:rgb(=
0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:rgb(24=
5,245,245)">jobs =3D [hello_world_job]</span></p><br><p dir=3D"ltr" style=
=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin-bottom:0pt"><spa=
n style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white=
-space:pre-wrap;background-color:transparent">To launch the job with the ab=
ove configuration we use: </span></p><br><p dir=3D"ltr" style=3D"font-size:=
12.8px;line-height:1.2;margin-top:0pt;margin-bottom:8pt"><span style=3D"fon=
t-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wra=
p;background-color:rgb(245,245,245)">aurora job create cluster1/$USER/test/=
hello_world hello_world.aurora</span></p><br><p dir=3D"ltr" style=3D"font-s=
ize:12.8px;line-height:1.2;margin-top:0pt;margin-bottom:0pt;text-align:just=
ify"><span style=3D"font-family:Arial;color:rgb(0,0,0);vertical-align:basel=
ine;white-space:pre-wrap;background-color:transparent">This project will de=
velop modules in Airavata to automatically generate the Aurora configuratio=
n file to launch a job on an Aurora-managed cluster in a cloud environment.=
 The Aurora user interface, as shown in the web portal displayed above, pro=
vides detailed information on the job status, job name, start and finish ti=
mes, location of the logs, and resource usage. This project will use add a =
module to Apache Aurora to pull this detailed information using the the Aur=
ora HTTP API. </span></p><div style=3D"font-size:12.8px"><span style=3D"fon=
t-size:14.6667px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline=
;white-space:pre-wrap;background-color:transparent"><br></span></div><span =
style=3D"font-size:12.8px"><b>Goals</b>:=C2=A0</span><ul style=3D"font-size=
:12.8px;margin-top:0pt;margin-bottom:0pt"><li dir=3D"ltr" style=3D"margin-l=
eft:15px;list-style-type:disc;font-family:Arial;color:rgb(0,0,0);vertical-a=
lign:baseline;background-color:transparent"><p dir=3D"ltr" style=3D"line-he=
ight:1.2;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style=
=3D"vertical-align:baseline;white-space:pre-wrap;background-color:transpare=
nt">This project will investigate how apache Aurora collects information of=
 cluster environment for display on the Aurora web interface. We will study=
 the Aurora HTTP API and retrieve all the information related to the target=
 infrastructure and job health, and make it available to the Airavata job s=
ubmission module. </span></p></li><li dir=3D"ltr" style=3D"margin-left:15px=
;list-style-type:disc;font-family:Arial;color:rgb(0,0,0);vertical-align:bas=
eline;background-color:transparent"><p dir=3D"ltr" style=3D"line-height:1.2=
;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style=3D"vertic=
al-align:baseline;white-space:pre-wrap;background-color:transparent">We wil=
l process the retrieved information from Aurora and convert the information=
 in a format that can be used by Airavata for further action.</span></p></l=
i><li dir=3D"ltr" style=3D"margin-left:15px;list-style-type:disc;font-famil=
y:Arial;color:rgb(0,0,0);vertical-align:baseline;background-color:transpare=
nt"><p dir=3D"ltr" style=3D"line-height:1.2;margin-top:0pt;margin-bottom:0p=
t;text-align:justify"><span style=3D"vertical-align:baseline;white-space:pr=
e-wrap;background-color:transparent">We will use the appropriate design pat=
terns to integrate the use of Aurora as one of the options for Big Data and=
 Cloud resource frameworks with the Airavata framework</span></p></li><li d=
ir=3D"ltr" style=3D"margin-left:15px;list-style-type:disc;font-family:Arial=
;color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p =
dir=3D"ltr" style=3D"line-height:1.2;margin-top:0pt;margin-bottom:0pt;text-=
align:justify"><span style=3D"vertical-align:baseline;white-space:pre-wrap;=
background-color:transparent">We will make the resource information from Au=
rora available for display on the Airavata dashboard. =C2=A0</span></p></li=
></ul><br style=3D"font-size:12.8px"><span style=3D"font-size:12.8px">Any c=
omment and suggestions would be very helpful.</span></div><div><br></div>-G=
ourav Rattihalli
</div>

--001a1142fd801cb4ba052e8fd697--