Mailing-List: contact dev-help@airavata.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@airavata.apache.org
Received-SPF: pass (athena.apache.org: domain of spamidig@illinois.edu
 designates 192.17.82.70 as permitted sender)
From: "Pamidighantam, Sudhakar V" <spamidig@illinois.edu>
To: "dev@airavata.apache.org" <dev@airavata.apache.org>
CC: Srinath Perera <srinath@wso2.com>, Dilum Bandara <dilumb@cse.mrt.ac.lk>
Subject: Re: DataCat Project Progress
Thread-Topic: DataCat Project Progress
Thread-Index: AQHQFSsKsuHgqvzFqE6SSq7pSaX6RZyK00uA
Date: Thu, 11 Dec 2014 14:12:41 +0000
Message-ID: <DBE83381-C3C5-4E7E-AA87-63A723AFCBC3@illinois.edu>
References: 
 <CAFwzmVCHYJiixu0hbC353EF+-duM3QwBc=et5HSqwzFV6GQFQg@mail.gmail.com>
In-Reply-To: 
 <CAFwzmVCHYJiixu0hbC353EF+-duM3QwBc=et5HSqwzFV6GQFQg@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_DBE83381C3C54E7EAA8763A723AFCBC3illinoisedu_"
MIME-Version: 1.0

--_000_DBE83381C3C54E7EAA8763A723AFCBC3illinoisedu_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

Supun:
I support these goals. I am available for you to engage with you and anythi=
ng I can do to expedite the project please let me know. Even if you think I=
 can not,  do please ask anyway.
I have written the original parsers in perl myself and have directed others=
 when the Cup/JFLex system was put in place. I have generated and modified =
the CUP/JFlex code before, so I am familiar with how it works. I will look =
at the paper and may suggest  additions.

I can not test the system now by adding more data and see if this can parse=
 the new data. How can we get to that point. This is critical first step fo=
r me before I can ask friendly users to test this further. Current state is=
 a prototype only and is not interesting enough for any of our users. Unles=
s we can add parsing of more data and more salient data and create products=
 it is difficult engage end users in any meaningful way.

Perhaps if this is deployed somewhere in Indiana it may be easier to move f=
orward. If you need more data please let me know where I should locate it f=
or you to access.

Thanks,
Sudhakar.


On Dec 11, 2014, at 4:11 AM, Supun Nakandala <supun.nakandala@gmail.com<mai=
lto:supun.nakandala@gmail.com>> wrote:

Hi All,

We had the mid evaluation of the project last Tuesday and the following con=
cerns were raised.

  1.  The lack of visibility of the overall solution in the project demonst=
ration.
  2.  The ability to come up with a solution where, scientist who does not =
have a background in computer science can create new parsers (metadata extr=
action logic)

The project was demonstrated using the web interface that we developed. For=
 the final evaluation we expect to demonstrate the system using laravel PHP=
 Reference Gateway running in a production server and demonstrate how a new=
 data product that gets generated will be identified, indexed and will be a=
vailable for searching and hope this will handle the first issue.

We also had a meeting with Dr. Dilum our internal supervisor where we ident=
ified things that can be done from now to 15th January, the expected projec=
t completion date

  1.  Do a proper performance test and publish a paper before final marks f=
or the project is finalized (marks will be finalized by the end of March).
  2.  Getting to work more parsers, so that Sudhakar can ask more users to =
use the system. This will help to get more feedback on the system and have =
a real world usage.
  3.  Implement the support for provenance aware workflow execution in Aira=
vata using our system.

We have written a draft paper which I have attached here with. We showed th=
is to Dr. Srinath and Dr Dilum and they suggested that we do a proper perfo=
rmance testing (The one that already done is not up to the expected standar=
ds). Given the available time we need to prioritize our work and select a s=
et of tasks that is doable and has the most impact. What do you all think?

Draft Paper: https://docs.google.com/document/d/1PLfST6hLygQpsr4RlgiDoffmDE=
wMOWbmb1WZ0uKTtd8/edit#heading=3Dh.6fjqfavj2nov

Literature Review: https://drive.google.com/file/d/0B0cLF-CLa59oaXRBazF1aUR=
vQTg/view?usp=3Dsharing

Supun


--_000_DBE83381C3C54E7EAA8763A723AFCBC3illinoisedu_
Content-Type: text/html; charset="us-ascii"
Content-ID: <7E598C842B5CF44EB758C4D9AD1C5339@mx.uillinois.edu>
Content-Transfer-Encoding: quoted-printable

<html>
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
</head>
<body style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-lin=
e-break: after-white-space;">
Supun:
<div>I support these goals. I am available for you to engage with you and a=
nything I can do to expedite the project please let me know. Even if you th=
ink I can not, &nbsp;do please ask anyway.&nbsp;</div>
<div>I have written the original parsers in perl myself and have directed o=
thers when the Cup/JFLex system was put in place. I have generated and modi=
fied the CUP/JFlex code before, so I am familiar with how it works. I will =
look at the paper and may suggest
 &nbsp;additions.&nbsp;</div>
<div><br>
</div>
<div>I can not test the system now by adding more data and see if this can =
parse the new data. How can we get to that point. This is critical first st=
ep for me before I can ask friendly users to test this further. Current sta=
te is a prototype only and is not
 interesting enough for any of our users. Unless we can add parsing of more=
 data and more salient data and create products it is difficult engage end =
users in any meaningful way.&nbsp;</div>
<div><br>
</div>
<div>Perhaps if this is deployed somewhere in Indiana it may be easier to m=
ove forward. If you need more data please let me know where I should locate=
 it for you to access.</div>
<div><br>
</div>
<div>Thanks,</div>
<div>Sudhakar.</div>
<div><br>
</div>
<div><br>
<div>
<div>On Dec 11, 2014, at 4:11 AM, Supun Nakandala &lt;<a href=3D"mailto:sup=
un.nakandala@gmail.com">supun.nakandala@gmail.com</a>&gt; wrote:</div>
<br class=3D"Apple-interchange-newline">
<blockquote type=3D"cite">
<div dir=3D"ltr">
<div class=3D"gmail_signature">Hi All,</div>
<div class=3D"gmail_signature"><br>
</div>
<div class=3D"gmail_signature">We had the mid evaluation of the project las=
t Tuesday and the following concerns were raised.</div>
<div class=3D"gmail_signature">
<ol>
<li>The lack of visibility of the overall solution in the project demonstra=
tion.</li><li>The ability to come up with a solution where, scientist who d=
oes not have a background in computer science can create new parsers (metad=
ata extraction logic)</li></ol>
<div>The project was demonstrated using the web interface that we developed=
. For the final evaluation we expect to demonstrate the system using larave=
l PHP Reference Gateway running in a production server and demonstrate how =
a new data product that gets generated
 will be identified, indexed and will be available for searching and hope t=
his will handle the first issue.</div>
<div><br>
</div>
<div>We also had a meeting with Dr. Dilum our internal supervisor where we =
identified things that can be done from now to 15th January, the expected p=
roject completion date</div>
<div>
<ol>
<li>Do a proper performance test and publish a paper before final marks for=
 the project is finalized (marks will be finalized by the end of March).</l=
i><li>Getting to work more parsers, so that Sudhakar can ask more users to =
use the system. This will help to get more feedback on the system and have =
a real world usage.</li><li>Implement the support for provenance aware work=
flow execution in Airavata using our system.</li></ol>
<div>We have written a draft paper which I have attached here with. We show=
ed this to Dr. Srinath and Dr Dilum and they suggested that we do a proper =
performance testing (The one that already done is not up to the expected st=
andards). Given the available time
 we need to prioritize our work and select a set of tasks that is doable an=
d has the most impact. What do you all think?</div>
</div>
<div><br>
</div>
<div>Draft Paper:&nbsp;<a href=3D"https://docs.google.com/document/d/1PLfST=
6hLygQpsr4RlgiDoffmDEwMOWbmb1WZ0uKTtd8/edit#heading=3Dh.6fjqfavj2nov">https=
://docs.google.com/document/d/1PLfST6hLygQpsr4RlgiDoffmDEwMOWbmb1WZ0uKTtd8/=
edit#heading=3Dh.6fjqfavj2nov</a></div>
<div><br>
</div>
<div>Literature Review:&nbsp;<a href=3D"https://drive.google.com/file/d/0B0=
cLF-CLa59oaXRBazF1aURvQTg/view?usp=3Dsharing">https://drive.google.com/file=
/d/0B0cLF-CLa59oaXRBazF1aURvQTg/view?usp=3Dsharing</a></div>
<div><br>
</div>
<div>Supun</div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</body>
</html>

--_000_DBE83381C3C54E7EAA8763A723AFCBC3illinoisedu_--