oodt-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (388J)" <chris.a.mattm...@jpl.nasa.gov>
Subject Re: OODT Workflow Wiki
Date Sat, 07 Apr 2012 04:01:58 GMT
Hi Mike,

Thanks, what a great page!

I noticed this comment in the page:

"At the time of this writing, jobs that cannot be added to the queue disappear...."

I think we should be more clear than "disappear". They don't disappear. The 
Scheduler will try and send a Job to the BatchMgr, and if there is an exception,
it tries to re-queue the Job back onto the JobStack. If it's unable to do that, then
there is an issue, but it at the very least tries to re-queue the job if there was an

Also, in general, you will have as many jobs queued in Resource Manager land
as the size of that job stack. So we should probably note that.

Great resource here, thanks for putting it together!


On Apr 5, 2012, at 8:43 AM, Cayanan, Michael D (388J) wrote:

> Hi all,
> I recently added a page to the OODT wiki:
> https://cwiki.apache.org/confluence/display/OODT/Workflow+Manager+Help
> I ran into some issues with the Workflow hanging up and also Workflow jobs being lost
when trying to send them off to the Resource Manager and just wanted to share what I learned
and what to do if you run into these issues as well.
> Cheers,
> Mike

Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA

View raw message