Return-Path: X-Original-To: apmail-oodt-dev-archive@www.apache.org Delivered-To: apmail-oodt-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 17473114C4 for ; Fri, 19 Sep 2014 21:41:01 +0000 (UTC) Received: (qmail 55471 invoked by uid 500); 19 Sep 2014 21:41:01 -0000 Delivered-To: apmail-oodt-dev-archive@oodt.apache.org Received: (qmail 55441 invoked by uid 500); 19 Sep 2014 21:41:01 -0000 Mailing-List: contact dev-help@oodt.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@oodt.apache.org Delivered-To: mailing list dev@oodt.apache.org Received: (qmail 55418 invoked by uid 99); 19 Sep 2014 21:41:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Sep 2014 21:41:00 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=AC_DIV_BONANZA,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of brbarkstrom@gmail.com designates 74.125.82.45 as permitted sender) Received: from [74.125.82.45] (HELO mail-wg0-f45.google.com) (74.125.82.45) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Sep 2014 21:40:56 +0000 Received: by mail-wg0-f45.google.com with SMTP id l18so295954wgh.28 for ; Fri, 19 Sep 2014 14:40:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=4qv89BXpbxcTboUOamHg1T2BVrVQlZRY0Dfh8X0GfnU=; b=0TiIVX0thy8j8IuSKdGjnVUoLk1NnEFclKj3yLgYLuiXZ9UE+dr1faXyZ+YlVmfy3X 7B4lzt9d+ZtVnGrjDi7mqANzt8EG2icQHnSY7h5VAevPD3ap5apDrliMYVJTXEnViWHJ YgTrl5X+PElbMtKwwhWe0p9BBAeoF1iEk3dPHJdALzXW24NaLBKF0O/WUX4dNMjsKJ8g SRu6K9jzCcE4fvA7jyRwrx5i8gmff5irx+Kx9ENrN/+6/1SW0/eF6K2xojPN2aMSE5ra 4WwOBFNwFvuvggz3pFH8VI2OmgDfGihTyHSwwFSmseeNjRaorwW6ac/xF3ruQBcugeP8 XruA== MIME-Version: 1.0 X-Received: by 10.194.58.108 with SMTP id p12mr3703504wjq.71.1411162835358; Fri, 19 Sep 2014 14:40:35 -0700 (PDT) Received: by 10.194.154.165 with HTTP; Fri, 19 Sep 2014 14:40:35 -0700 (PDT) In-Reply-To: References: <4B3A58EE-54EF-4FE9-A3DA-8EE876299430@apache.org> Date: Fri, 19 Sep 2014 17:40:35 -0400 Message-ID: Subject: Re: Evaluate Suitable Scientific Workflow Language for Airavata. From: Bruce Barkstrom To: dev@oodt.apache.org, alekny7@gmail.com Content-Type: multipart/alternative; boundary=047d7ba977fc859b45050371f35f X-Virus-Checked: Checked by ClamAV on apache.org --047d7ba977fc859b45050371f35f Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Thanks very much for the documentation. Just glancing through it, I'm not sure I'd describe production by embedding a particular language in the basic description. In the large-scale production systems I architected for NASA's ERBE and CERES production, I started with a Data Flow Diagram. Now, I'd probably move to UML before getting close to a language- specific design. A key part of our work was two years getting an agreement on the naming convention of the files that tied in to the multiple instruments, multiple products, and multiple versions of each element in the provenance. I did a formal description of those structures in an ESI paper (I guess 2009) on configuration management in large-scale satellite data production. That paper contains some example figures of the provenance graphs - which are probably easiest to visualize if you get the pdf version and view the graphs with a zoom of 2800% or so. One point that was important is that our production flow was not completely automated - and humans had to invoke the proper connection between input files, various pieces of production executables, and output files - including variations from one month to the next a as well as match*ing *which satellite was involved (plus some other configuration issues). In the long run, we also needed to build production schedules so the archive could check quality control on the proper running of the process. That work required interaction between objects and staff on the archive and the science teams, which makes for the usual schedule complications that are an intrinsic part of production planning on big projects. Also, in some work I'm doing currently, I am quite concerned about user search states (or production states) that come from a Markov model that allows simulation of user loadings on an archive. I haven't gone into full detail on this yet - but it may mean that the interaction between the user and the archive needs to keep track of the state of both agents. The basic notion for this work is that the archive and the user are playing a two agent cooperative game where the archive knows what it contains and the user knows what his or her target is. I think that requires maintaining agent states across multiple sessions. I don't recall whether that violates the original source of the RESTFUL design, so I'll need to go back to the original to check. I'll try to look at what you've sent in more detail and provide some more comments. Regardless, I appreciate the information. Bruce B. On Fri, Sep 19, 2014 at 5:01 PM, Aleksander Slominski (NY) < alekny7@gmail.com> wrote: > Hi, > > it is not dataflow instead focused on orchestrating REST services but you > may find it useful datapoint - we created worfklow service that uses > natively JavaScript and JSON to describe what happens during workflow > execution: > https://www.ng.bluemix.net/docs/#services/workflow/index.html#coewf002 > > HTH, > > Alek > > On Thu, Sep 18, 2014 at 1:54 PM, Suresh Marru wrote: > > > Hi Chris, > > > > Great to hear OODT community will be interested in adopting a JSON base= d > > workflow language and potentially a web based composer as well. Airavat= a > > previously had BPEL support initially through a home grown implementati= on > > [1] by Alek Slominski and later through Apache ODE [2]. Also a white > paper > > [3] by Alek on this topic is an interesting read. > > > > I am of the same opinion that we should adopt something more modern as > the > > challenges from scientific workflows seems to be converging with the da= ta > > flow patterns in business workflows. > > > > It will be great if we can all compile a list of potential candidates a= nd > > hack them through. > > > > Suresh > > [1] - > > http://link.springer.com/chapter/10.1007%2F978-1-84628-757-2_14#page-1 > > [2] - > > > http://www.academia.edu/1485773/Experience_with_adapting_a_WS-BPEL_runtim= e_for_eScience_workflows > > [3] - > > > http://www.computer.org/csdl/proceedings/services/2010/4129/00/4129a326.p= df > > > > > > On Sep 18, 2014, at 1:15 PM, Mattmann, Chris A (3980) < > > chris.a.mattmann@jpl.nasa.gov> wrote: > > > > > Hi Guys, > > > > > > I've been interested in this too - we don't per have a specific > > > OODT workflow language, but we specific workflows using XML, and > > > other configuration (we are also thinking of moving to JSON for > > > this). > > > > > > In the past I've also looked at YAWL and BPEL - both seem complex > > > to me. > > > > > > I wonder at the end of the day if we should adopt something more > > > modern like PIG or some other data flow type of language (PIG > > > is really neat). > > > > > > Cheers, > > > Chris > > > > > > > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > Chris Mattmann, Ph.D. > > > Chief Architect > > > Instrument Software and Science Data Systems Section (398) > > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > Office: 168-519, Mailstop: 168-527 > > > Email: chris.a.mattmann@nasa.gov > > > WWW: http://sunset.usc.edu/~mattmann/ > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > Adjunct Associate Professor, Computer Science Department > > > University of Southern California, Los Angeles, CA 90089 USA > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > > > > > > > > > > > > > > > > > > > -----Original Message----- > > > From: Shameera Rathnayaka > > > Reply-To: "architecture@airavata.apache.org" > > > > > > Date: Thursday, September 18, 2014 8:26 AM > > > To: "architecture@airavata.apache.org" < > architecture@airavata.apache.org > > >, > > > dev > > > Subject: Evaluate Suitable Scientific Workflow Language for Airavata. > > > > > >> Hi All, > > >> > > >> As we all know Airavata has its own workflow language call XWF. When > XWF > > >> was introduced, main focus points are interoperability and > > convertibility. > > >> But with years of experience it is convinced that above requirements > are > > >> not really useful when we come to real world use cases. And XWF is X= ML > > >> based bulky language where we attache WSDLs and Workflow image it > self. > > >> But > > >> with the recent changes WSDL part is being removed from XWF. > > >> > > >> It is worth to evaluate handy Scientific workflow languages in > industry > > >> and > > >> find out pros and cons, at the end of this evaluation we need to com= e > up > > >> with idea how we should improve Airavata workflow language, either w= e > > can > > >> improve existing XWF language, totally change to a new language > > available > > >> in industry or write a new light weight language. Basic requirements > > that > > >> we expect from new improvement are, high usability, flexible, light > > weight > > >> and real time monitoring support. As you can see above requirements > are > > >> not > > >> direct comes with workflow languages but we need workflow language > which > > >> help to support above requirements. > > >> > > >> After reading few papers and googling, initially i have come up with > > >> following three existing languages, > > >> 1. YAWL > > >> 2. WS-BPEL > > >> =E2=80=8B3. SIDL > > >> > > >> > > >> In my opinion SIDL is more familiar with scientific domain, > Radical-SAGA > > >> also uses slightly modified version of SIDL. Other than above three > > >> languages we can come up with simple workflow language base on json(= or > > >> yaml) which support all our requirements for some extends. > > >> > > >> It would be grate if I can get more input regarding the $Subject for= m > > the > > >> airavata community. You all are more than welcome to provide any typ= e > of > > >> suggestions. > > >> > > >> Thanks, > > >> Shameera. > > >> > > >> =E2=80=8B > > >> > > >> -- > > >> Best Regards, > > >> Shameera Rathnayaka. > > >> > > >> email: shameera AT apache.org , shameerainfo AT gmail.com > > >> Blog : http://shameerarathnayaka.blogspot.com/ > > > > > > > -- > The best way to predict the future is to invent it - Alan Kay > --047d7ba977fc859b45050371f35f--