mesos-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod Kone (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MESOS-1262) MultipleExecutorsTest.TasksExecutorInfoDiffers is flaky
Date Mon, 28 Apr 2014 22:29:16 GMT
Vinod Kone created MESOS-1262:
---------------------------------

             Summary: MultipleExecutorsTest.TasksExecutorInfoDiffers is flaky
                 Key: MESOS-1262
                 URL: https://issues.apache.org/jira/browse/MESOS-1262
             Project: Mesos
          Issue Type: Bug
          Components: test
            Reporter: Vinod Kone
             Fix For: 0.19.0


[ RUN      ] MultipleExecutorsTest.TasksExecutorInfoDiffers
I0428 19:05:48.540652 24547 leveldb.cpp:174] Opened db in 20.021997ms
I0428 19:05:48.541288 24547 leveldb.cpp:181] Compacted db in 562623ns
I0428 19:05:48.541317 24547 leveldb.cpp:196] Created db iterator in 8388ns
I0428 19:05:48.541326 24547 leveldb.cpp:202] Seeked to beginning of db in 1139ns
I0428 19:05:48.541334 24547 leveldb.cpp:271] Iterated through 0 keys in the db in 484ns
I0428 19:05:48.541360 24547 replica.cpp:729] Replica recovered with log positions 0 ->
0 with 1 holes and 0 unlearned
I0428 19:05:48.541730 24584 recover.cpp:425] Starting replica recovery
I0428 19:05:48.541844 24584 recover.cpp:451] Replica is in EMPTY status
I0428 19:05:48.542312 24584 replica.cpp:626] Replica in EMPTY status received a broadcasted
recover request
I0428 19:05:48.542394 24584 recover.cpp:188] Received a recover response from a replica in
EMPTY status
I0428 19:05:48.542531 24584 recover.cpp:542] Updating replica status to STARTING
I0428 19:05:48.543365 24584 leveldb.cpp:304] Persisting metadata (8 bytes) to leveldb took
757883ns
I0428 19:05:48.543386 24584 replica.cpp:320] Persisted replica status to STARTING
I0428 19:05:48.543460 24584 recover.cpp:451] Replica is in STARTING status
I0428 19:05:48.543798 24584 replica.cpp:626] Replica in STARTING status received a broadcasted
recover request
I0428 19:05:48.543859 24584 recover.cpp:188] Received a recover response from a replica in
STARTING status
I0428 19:05:48.543967 24584 recover.cpp:542] Updating replica status to VOTING
I0428 19:05:48.544258 24584 leveldb.cpp:304] Persisting metadata (8 bytes) to leveldb took
239912ns
I0428 19:05:48.544271 24584 replica.cpp:320] Persisted replica status to VOTING
I0428 19:05:48.544307 24584 recover.cpp:556] Successfully joined the Paxos group
I0428 19:05:48.544369 24584 recover.cpp:440] Recover process terminated
I0428 19:05:48.545640 24584 master.cpp:266] Master 20140428-190548-143311683-40673-24547 (minerva.apache.org)
started on 67.195.138.8:40673
I0428 19:05:48.545666 24584 master.cpp:303] Master only allowing authenticated frameworks
to register
I0428 19:05:48.545673 24584 master.cpp:308] Master only allowing authenticated slaves to register
I0428 19:05:48.545680 24584 credentials.hpp:35] Loading credentials for authentication
W0428 19:05:48.545733 24584 credentials.hpp:48] Failed to stat credentials file 'file:///tmp/MultipleExecutorsTest_TasksExecutorInfoDiffers_zyeSd0/credentials':
No such file or directory
I0428 19:05:48.546339 24584 hierarchical_allocator_process.hpp:302] Initializing hierarchical
allocator process with master : master@67.195.138.8:40673
I0428 19:05:48.546376 24584 master.cpp:104] No whitelist given. Advertising offers for all
slaves
I0428 19:05:48.546589 24584 master.cpp:922] The newly elected leader is master@67.195.138.8:40673
with id 20140428-190548-143311683-40673-24547
I0428 19:05:48.546600 24584 master.cpp:932] Elected as the leading master!
I0428 19:05:48.546608 24584 master.cpp:753] Recovering from registrar
I0428 19:05:48.546699 24584 registrar.cpp:275] Recovering registrar
I0428 19:05:48.547029 24584 log.cpp:656] Attempting to start the writer
I0428 19:05:48.547441 24584 replica.cpp:474] Replica received implicit promise request with
proposal 1
I0428 19:05:48.547695 24584 leveldb.cpp:304] Persisting metadata (8 bytes) to leveldb took
239042ns
I0428 19:05:48.547708 24584 replica.cpp:342] Persisted promised to 1
I0428 19:05:48.547904 24584 coordinator.cpp:229] Coordinator attemping to fill missing position
I0428 19:05:48.548322 24584 replica.cpp:375] Replica received explicit promise request for
position 0 with proposal 2
I0428 19:05:48.548477 24584 leveldb.cpp:341] Persisting action (8 bytes) to leveldb took 138289ns
I0428 19:05:48.548493 24584 replica.cpp:664] Persisted action at 0
I0428 19:05:48.559414 24586 replica.cpp:508] Replica received write request for position 0
I0428 19:05:48.559535 24586 leveldb.cpp:436] Reading position from leveldb took 46649ns
I0428 19:05:48.559965 24586 leveldb.cpp:341] Persisting action (14 bytes) to leveldb took
413113ns
I0428 19:05:48.559980 24586 replica.cpp:664] Persisted action at 0
I0428 19:05:48.560171 24586 replica.cpp:643] Replica received learned notice for position
0
I0428 19:05:48.560372 24586 leveldb.cpp:341] Persisting action (16 bytes) to leveldb took
186478ns
I0428 19:05:48.560385 24586 replica.cpp:664] Persisted action at 0
I0428 19:05:48.560395 24586 replica.cpp:649] Replica learned NOP action at position 0
I0428 19:05:48.560659 24586 log.cpp:672] Writer started with ending position 0
I0428 19:05:48.561046 24586 leveldb.cpp:436] Reading position from leveldb took 11812ns
I0428 19:05:48.562821 24586 registrar.cpp:308] Successfully recovered registrar
I0428 19:05:48.562868 24586 registrar.cpp:379] Attempting to update the 'registry'
I0428 19:05:48.564551 24586 log.cpp:680] Attempting to append 137 bytes to the log
I0428 19:05:48.564638 24586 coordinator.cpp:339] Coordinator attempting to write APPEND action
at position 1
I0428 19:05:48.565008 24586 replica.cpp:508] Replica received write request for position 1
I0428 19:05:48.571110 24586 leveldb.cpp:341] Persisting action (156 bytes) to leveldb took
6.053569ms
I0428 19:05:48.571272 24586 replica.cpp:664] Persisted action at 1
I0428 19:05:48.579241 24585 replica.cpp:643] Replica received learned notice for position
1
I0428 19:05:48.579629 24585 leveldb.cpp:341] Persisting action (158 bytes) to leveldb took
332299ns
I0428 19:05:48.579644 24585 replica.cpp:664] Persisted action at 1
I0428 19:05:48.579656 24585 replica.cpp:649] Replica learned APPEND action at position 1
I0428 19:05:48.580095 24585 registrar.cpp:427] Successfully updated 'registry'
I0428 19:05:48.580178 24585 log.cpp:699] Attempting to truncate the log to 1
I0428 19:05:48.580258 24585 master.cpp:780] Recovered 0 slaves from the Registry (99B) ; allowing
10mins for slaves to re-register
I0428 19:05:48.580320 24585 coordinator.cpp:339] Coordinator attempting to write TRUNCATE
action at position 2
I0428 19:05:48.580610 24585 replica.cpp:508] Replica received write request for position 2
I0428 19:05:48.580724 24585 leveldb.cpp:341] Persisting action (16 bytes) to leveldb took
99788ns
I0428 19:05:48.580735 24585 replica.cpp:664] Persisted action at 2
I0428 19:05:48.580899 24585 replica.cpp:643] Replica received learned notice for position
2
I0428 19:05:48.580988 24585 leveldb.cpp:341] Persisting action (18 bytes) to leveldb took
78815ns
I0428 19:05:48.581009 24585 leveldb.cpp:399] Deleting ~1 keys from leveldb took 10110ns
I0428 19:05:48.581018 24585 replica.cpp:664] Persisted action at 2
I0428 19:05:48.581027 24585 replica.cpp:649] Replica learned TRUNCATE action at position 2
I0428 19:05:49.549613 24585 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:49.549664 24585 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 94678ns
I0428 19:05:50.550092 24585 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:50.550153 24585 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 78268ns
2014-04-28 19:05:51,026:24547(0x2b174c200700):ZOO_ERROR@handle_socket_error_msg@1697: Socket
[127.0.0.1:49011] zk retcode=-4, errno=111(Connection refused): server refused to accept the
client
I0428 19:05:51.550614 24585 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:51.550695 24585 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 125181ns
I0428 19:05:52.551409 24585 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:52.551448 24585 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 58922ns
I0428 19:05:53.547530 24584 master.cpp:104] No whitelist given. Advertising offers for all
slaves
I0428 19:05:53.552539 24589 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:53.552562 24589 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 77109ns
2014-04-28 19:05:54,362:24547(0x2b174c200700):ZOO_ERROR@handle_socket_error_msg@1697: Socket
[127.0.0.1:49011] zk retcode=-4, errno=111(Connection refused): server refused to accept the
client
I0428 19:05:54.553315 24590 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:54.553380 24590 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 80265ns
I0428 19:05:55.553812 24587 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:55.553856 24587 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 90273ns
I0428 19:05:56.558269 24589 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:56.558323 24589 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 71014ns
I0428 19:05:57.559363 24589 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:57.559417 24589 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 74060ns
2014-04-28 19:05:57,698:24547(0x2b174c200700):ZOO_ERROR@handle_socket_error_msg@1697: Socket
[127.0.0.1:49011] zk retcode=-4, errno=111(Connection refused): server refused to accept the
client
I0428 19:05:58.550842 24589 master.cpp:104] No whitelist given. Advertising offers for all
slaves
I0428 19:05:58.562800 24589 hierarchical_allocator_process.hpp:726] No resources available
to allocate!
I0428 19:05:58.562861 24589 hierarchical_allocator_process.hpp:688] Performed allocation for
0 slaves in 79561ns
F0428 19:05:58.582340 24547 cluster.hpp:373] Failed to wait for _recover
*** Check failure stack trace: ***
    @     0x2b16240b41ed  google::LogMessage::Fail()
    @     0x2b16240b627f  google::LogMessage::SendToLog()
    @     0x2b16240b3ddc  google::LogMessage::Flush()
    @     0x2b16240b6aed  google::LogMessageFatal::~LogMessageFatal()
    @           0x72b4f9  mesos::internal::tests::Cluster::Masters::start()
    @           0x726233  mesos::internal::tests::MesosTest::StartMaster()
    @           0x76803d  MultipleExecutorsTest_TasksExecutorInfoDiffers_Test::TestBody()
    @           0x8a571d  testing::internal::HandleExceptionsInMethodIfSupported<>()
    @           0x89dd51  testing::Test::Run()
    @           0x89de36  testing::TestInfo::Run()
    @           0x89df77  testing::TestCase::Run()
    @           0x89e2de  testing::internal::UnitTestImpl::RunAllTests()
    @           0x8a529d  testing::internal::HandleExceptionsInMethodIfSupported<>()
    @           0x89d3ae  testing::UnitTest::Run()
    @           0x4a2b90  main
    @     0x2b162550176d  (unknown)
    @           0x4adf81  (unknown)
make[4]: *** [check-local] Aborted




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message