reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Weimer <mar...@weimo.de>
Subject Re: new runtime: stand-alone distributed runtime?
Date Thu, 31 Dec 2015 07:53:49 GMT
On 2015-12-30 23:37, John Yang wrote:
> Regarding option 2, how do you plan to retrieve the exit status of an
> Evaluator? REEF relies on the underlying resource manager layer to
> report unclean Evaluator exits(e.g., failure, preemption).

Excellent question. I haven't thought about it. While all machines are 
running and have life SSH connections, this should be "easy". It becomes 
tricky of the connections are interrupted. At that time, a SSH runtime 
could just declare the Evaluator `failed`, right?

Markus


Mime
View raw message