Return-Path: X-Original-To: apmail-oodt-dev-archive@www.apache.org Delivered-To: apmail-oodt-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 88D7F9ED6 for ; Wed, 9 May 2012 16:06:48 +0000 (UTC) Received: (qmail 69231 invoked by uid 500); 9 May 2012 16:06:48 -0000 Delivered-To: apmail-oodt-dev-archive@oodt.apache.org Received: (qmail 69195 invoked by uid 500); 9 May 2012 16:06:48 -0000 Mailing-List: contact dev-help@oodt.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@oodt.apache.org Delivered-To: mailing list dev@oodt.apache.org Received: (qmail 69183 invoked by uid 99); 9 May 2012 16:06:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 May 2012 16:06:48 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [128.149.139.109] (HELO mail.jpl.nasa.gov) (128.149.139.109) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 May 2012 16:06:42 +0000 Received: from mail.jpl.nasa.gov (ap-ehub-sp02.jpl.nasa.gov [128.149.137.149]) by smtp.jpl.nasa.gov (Sentrion-MTA-4.2.2/Sentrion-MTA-4.2.2) with ESMTP id q49G6Fpb011588 (using TLSv1/SSLv3 with cipher AES128-SHA (128 bits) verified NO) for ; Wed, 9 May 2012 09:06:21 -0700 Received: from AP-EMBX-SP40.RES.AD.JPL ([169.254.7.245]) by ap-ehub-sp02.RES.AD.JPL ([fe80::dd85:7b07:1e36:7e3c%15]) with mapi id 14.02.0298.004; Wed, 9 May 2012 09:06:20 -0700 From: "Mattmann, Chris A (388J)" To: "" Subject: Re: Resource Manager client question Thread-Topic: Resource Manager client question Thread-Index: AQHNLNlddGrXGOMyXUabvkDBZVeUdZa/2CqAgAHCyzeAAHwHgA== Date: Wed, 9 May 2012 16:06:19 +0000 Message-ID: <3E626F79-74CC-4B05-B011-4BE064EE3006@jpl.nasa.gov> References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [128.149.137.114] Content-Type: text/plain; charset="us-ascii" Content-ID: <3F8574A80AABCD4B99FD74374E839674@ad.jpl> Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Source-Sender: chris.a.mattmann@jpl.nasa.gov X-AUTH: Authorized X-Virus-Checked: Checked by ClamAV on apache.org Hey Mike, Anytime! Would be happy to help more as you guys progress. Take care and keep rockin' on! Cheers, Chris On May 9, 2012, at 8:42 AM, Iwunze, Michael C (GSFC-4700)[NOAA-JPSS] wrote: > Thanks Chris, this was good information. >=20 > On 5/8/12 1:48 AM, "Mattmann, Chris A" > wrote: >=20 >> Hey Cam, >>=20 >> Thanks, some comments below: >>=20 >> On May 7, 2012, at 8:26 PM, Cameron Goodale wrote: >>=20 >>> Hey Mike and Sheryl, >>>=20 >>> Mike was asking me for some similar advice and I plain ran outta talent= on >>> this topic. From what I can tell Mike would like to run his python scr= ipts >>> on Resource Manager without the need for setting up Workflow or PGE. >>>=20 >>> At the time I hadn't really thought through all the configuration files >>> needed, but having stewed on it I thought I should reply. Now my curre= nt >>> SnowDS implementation is to have the Workflow Task reference a CAS-PGE >>> (which contains the execution block for my python program i want to run= ). >>> Then the Workflow is merely configured to farm the jobs out to the >>> Resource Manager. >>>=20 >>> Here is a list of questions that I have started to wonder about with Mi= ke's >>> help, any answers would be appreciated: >>>=20 >>> 1. Can Resource Manager + Batchstubs be used without any additional OO= DT >>> components? >>>=20 >>=20 >> Yep one way to see this in action is to run the >> org.apache.oodt.cas.resource.tools.JobSubmitter >> tool by cd'ing into a resource manager deployment (let's assume >> /usr/local/resmgr/bin) and then >> running: >>=20 >> java -Djava.ext.dirs=3D../lib org.apache.oodt.cas.resource.tools.JobSubm= itter >>=20 >> Which produces: >>=20 >> JobSubmitter --rUrl [options] >> --file >> [--dir ] >>=20 >> This will let you submit a resource manager XML "job file" looks like th= is: >>=20 >> http://svn.apache.org/repos/asf/oodt/trunk/resource/src/main/resources/e= xample >> s/jobs/exJob.xml >>=20 >> Key parameters there are: >>=20 >> Name - the human readable name of the job >> Id - the id of the job >> Instance Class - the JobInstance >> Input Class - specification for how to read/write input for the job, wi= th >> properties >>=20 >> That being said, interfacing with the resource manager at this level wou= ld be >> a lot harder >> than simply running workflows, which yes, is the more developer/user fri= endly >> interface >> for specifying tasks to run, which get turned into jobs in resource mana= ger >> ville. >>=20 >>=20 >>> 2. Is PGE required to run/wrap non-Java programs so they can run withi= n >>> Resource Manager? >>=20 >> Well, PGE doesn't directly run in Resource Manager. All workflow tasks a= re >> submitted >> to Resource Manager using the TaskJob, and TaskJobInput constructs: >>=20 >> http://s.apache.org/I6S >> http://s.apache.org/8F1 >>=20 >>=20 >>>=20 >>> Closing comments to Mike: >>>=20 >>> If you are planning to use OODT for data management, it >>> is initially very tempting to only setup and configure the minimal set = of >>> components because you will feel productive and it feels like progress = is >>> being made. Trust me I know since I was in your shoes about 6 months a= go >>> when trying to get some image processing IDL code to run and I bably ne= eded >>> to see progress (notice I didn't use the works "make progress"). Becau= se I >>> wanted to use (what I thought was) the "easier" solution I ended up >>> hardcoding paths to resources my python code needed in the code instead= of >>> passing the parameters into the code in the first place. This worked >>> reasonably well as long as everything stayed the same....but then it di= dn't >>> so I had to re-visit my "easier" setup and fix it. >>> Recently I have been working to undo my mistakes and python has been >>> very forgiving, but the best part was that all the strange and mystic >>> Workflow setups and PGEConfig.xml files actually started to make a whol= e >>> lot more sense. I am now able to configure and stand up a complete >>> workflow config, then jump into PGEConfig and get the input parameters = to >>> my python code. This means if the input files i need to process change= s I >>> don't need to change my python code, instead I can merely pass in a >>> different set of parameters into the workflow and they will persist to = my >>> wrapped python. >>> In short I know that combing through all the xml config is tough, >>> especially when things are not working as quickly as you would like. I >>> understand how defeated and frustrating it can be to have the component >>> fail and just feel lost, not knowing what is causing the problem. I kn= ow >>> the documentation isn't perfect and sometimes it is missing altogether,= but >>> the people that are on this list will bend over backwards to help you >>> understand (some will even share their config files with line-by-line >>> comments included at no extra charge ;) >>>=20 >>> Thank you Sheryl for being awesome and helpful (you always are). Mike = keep >>> the questions coming and I will be sure to add in my $0.02 when I am ab= le >>> to. >>=20 >> +1. >>=20 >> Cheers, >> Chris >>=20 >>>=20 >>> On Mon, May 7, 2012 at 5:09 PM, Sheryl John wrote: >>>=20 >>>> Hi Mike, >>>>=20 >>>> Yup, you can run your python scripts, java programs etc. from CAS-PGE = which >>>> is used with the Workflow Manager. Check out this cas-pge guide [1] an= d the >>>> other wiki pages related to workflow. >>>>=20 >>>> You can use Resource Manager to run tasks sent from the Workflow Manag= er. >>>> I've recently started testing this but there are others on the list wh= o can >>>> guide you more on the Resource Manager. >>>>=20 >>>> HTH! >>>>=20 >>>> Sheryl >>>>=20 >>>> [1] https://cwiki.apache.org/OODT/cas-pge-learn-by-example.html >>>>=20 >>>>=20 >>>> On Mon, May 7, 2012 at 3:43 PM, Iwunze, Michael C (GSFC-4700)[NOAA-JPS= S] < >>>> michael.iwunze@nasa.gov> wrote: >>>>=20 >>>>>=20 >>>>> I have two questions, I am able to run the Resource Manager with no >>>>> issues. I have some python scripts and possibly some other programs = I >>>>> would like to run using the Resource Manager. From what I know so far= I >>>>> believe the cas-pge component needs to be used in conjunction with th= e >>>>> Resource Manager and is used as a wrapper program for running my scri= pts. >>>>> Can someone give me more information on how this can be accomplished = or >>>> are >>>>> there any examples to view? >>>>>=20 >>>>> I would also like to be able to utilize the Job Scheduler, Monitor an= d >>>>> Job queue classes that are part of the Resource Manager. I can't find= any >>>>> examples of how they are used anywhere. And if examples do exist can >>>>> someone point me in the right direction or give me more information o= n >>>> this? >>>>>=20 >>>>> Thanks >>>>>=20 >>>>> Mike >>>>=20 >>>>=20 >>>>=20 >>>>=20 >>>> -- >>>> -Sheryl >>>>=20 >>=20 >>=20 >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> Chris Mattmann, Ph.D. >> Senior Computer Scientist >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 171-266B, Mailstop: 171-246 >> Email: chris.a.mattmann@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> Adjunct Assistant Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>=20 >=20 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.a.mattmann@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++