mesos-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benjamin Mahler <benjamin.mah...@gmail.com>
Subject Re: Slave aborting
Date Fri, 28 Jun 2013 21:02:58 GMT
Thanks for the report, will send out a fix shortly!

The bug was introduced here:
https://reviews.apache.org/r/12074/diff/#1.2


On Fri, Jun 28, 2013 at 1:58 PM, Brenden Matthews <
brenden.matthews@airbedandbreakfast.com> wrote:

> Hey guys,
>
> I'm having the following problem right now:
>
> I0628 20:38:41.354115  7628 slave.cpp:2529] Current usage 30.15%. Max
> > allowed age: 4.189215180656065days
> > I0628 20:39:41.355823  7619 slave.cpp:2529] Current usage 30.18%. Max
> > allowed age: 4.187323886213797days
> > I0628 20:40:41.357043  7623 slave.cpp:2529] Current usage 30.21%. Max
> > allowed age: 4.185234990470278days
> > I0628 20:41:41.358212  7613 slave.cpp:2529] Current usage 30.24%. Max
> > allowed age: 4.183099359015254days
> > I0628 20:42:41.360421  7631 slave.cpp:2529] Current usage 30.27%. Max
> > allowed age: 4.180847453502627days
> > I0628 20:43:41.362519  7610 slave.cpp:2529] Current usage 30.30%. Max
> > allowed age: 4.178742828462974days
> > I0628 20:44:41.364753  7613 slave.cpp:2529] Current usage 30.34%. Max
> > allowed age: 4.176281597348264days
> > Result::get() but state == NONE
> > *** Aborted at 1372452289 (unix time) try "date -d @1372452289" if you
> are
> > using GNU date ***
> > PC: @     0x7f4e64237425 (unknown)
> > *** SIGABRT (@0x1d9e) received by PID 7582 (TID 0x7f4e58a6f700) from PID
> > 7582; stack trace: ***
> >     @     0x7f4e645cfcb0 (unknown)
> >     @     0x7f4e64237425 (unknown)
> >     @     0x7f4e6423ab8b (unknown)
> >     @     0x7f4e650d47ba mesos::internal::slave::ProcessIsolator::usage()
> >     @     0x7f4e6505f5b0 std::tr1::_Function_handler<>::_M_invoke()
> >     @     0x7f4e6506d526 process::internal::pdispatcher<>()
> >     @     0x7f4e65060808 std::tr1::_Function_handler<>::_M_invoke()
> >     @     0x7f4e6525215c process::ProcessManager::resume()
> >     @     0x7f4e65252dbc process::schedule()
> >     @     0x7f4e645c7e9a start_thread
> >     @     0x7f4e642f4ccd (unknown)
> > I0628 20:44:52.014605 21274 main.cpp:119] Creating "process" isolator
> > I0628 20:44:52.015563 21274 main.cpp:127] Build: 2013-06-27 22:40:04 by
> > I0628 20:44:52.015847 21274 main.cpp:128] Starting Mesos slave
>
>
> The stack trace shows it blows up at 'src/slave/process_isolator.cpp:420'.
>  Thoughts?
>
> For the time being I've made a copy of the os::Process from process.get().
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message