mesos-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod Kone (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (MESOS-439) Slave crashes on the duplicate ACK when waiting for the next update
Date Thu, 18 Apr 2013 17:46:13 GMT

     [ https://issues.apache.org/jira/browse/MESOS-439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Vinod Kone resolved MESOS-439.
------------------------------

    Resolution: Fixed

pushed to trunk.
                
> Slave crashes on the duplicate ACK when waiting for the next update
> -------------------------------------------------------------------
>
>                 Key: MESOS-439
>                 URL: https://issues.apache.org/jira/browse/MESOS-439
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Vinod Kone
>            Assignee: Vinod Kone
>
> I0418 15:17:04.299052 43021 slave.cpp:719] Got assigned task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
for framework 201103282247-0000000019-0000
> I0418 15:17:04.305749 43021 slave.cpp:792] Launching task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
for framework 201103282247-0000000019-0000
> I0418 15:17:04.307135 43021 paths.hpp:302] Created executor directory '/var/lib/mesos/slaves/201303281614-1937777162-5050-34776-36/frameworks/201103282247-0000000019-0000/executors/thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93/runs/e02f6720-c049-4a54-9865-e537a9d47ec6'
> I0418 15:17:04.322979 43021 slave.cpp:940] Queuing task '1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93'
for executor thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework '201103282247-0000000019-0000
> I0418 15:17:04.323269 43028 cgroups_isolator.cpp:520] Launching thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
(./thermos_executor) in /var/lib/mesos/slaves/201303281614-1937777162-5050-34776-36/frameworks/201103282247-0000000019-0000/executors/thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93/runs/e02f6720-c049-4a54-9865-e537a9d47ec6
with resources cpus=0.25; mem=128 for framework 201103282247-0000000019-0000 in cgroup mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6
> I0418 15:17:04.325932 43028 cgroups_isolator.cpp:655] Changing cgroup controls for executor
thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 with resources cpus=0.25; mem=128
> I0418 15:17:04.326355 43028 cgroups_isolator.cpp:839] Updated 'cpu.shares' to 256 for
executor thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:04.326457 43032 slave.cpp:512] Successfully attached file '/var/lib/mesos/slaves/201303281614-1937777162-5050-34776-36/frameworks/201103282247-0000000019-0000/executors/thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93/runs/e02f6720-c049-4a54-9865-e537a9d47ec6'
> I0418 15:17:04.326828 43028 cgroups_isolator.cpp:977] Updated 'memory.limit_in_bytes'
to 134217728 for executor thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:04.329628 43028 cgroups_isolator.cpp:1003] Started listening for OOM events
for executor thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> Fetching resources into '/var/lib/mesos/slaves/201303281614-1937777162-5050-34776-36/frameworks/201103282247-0000000019-0000/executors/thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93/runs/e02f6720-c049-4a54-9865-e537a9d47ec6'
> I0418 15:17:05.550911 43022 slave.cpp:1391] Got registration for executor 'thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93'
of framework 201103282247-0000000019-0000
> I0418 15:17:05.551237 43023 cgroups_isolator.cpp:655] Changing cgroup controls for executor
thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 with resources cpus=0.35; mem=384; disk=1024
> I0418 15:17:05.551877 43023 cgroups_isolator.cpp:839] Updated 'cpu.shares' to 358 for
executor thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:05.552738 43023 cgroups_isolator.cpp:977] Updated 'memory.limit_in_bytes'
to 402653184 for executor thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:05.600024 43030 slave.cpp:1733] Handling status update TASK_STARTING from
task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:05.600225 43028 status_update_manager.cpp:289] Received status update TASK_STARTING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 with checkpoint=false
> I0418 15:17:05.600306 43028 status_update_manager.cpp:451] Creating StatusUpdate stream
for task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:05.600374 43028 status_update_manager.hpp:336] Handling UPDATE for status
update TASK_STARTING from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:05.600409 43028 status_update_manager.cpp:335] Forwarding status update TASK_STARTING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 to the master at master@10.34.128.115:5050
> I0418 15:17:05.600632 43021 slave.cpp:1793] Sending ACK for status update TASK_STARTING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 to executor executor(1)@10.34.135.114:42980
> I0418 15:17:07.127419 43036 slave.cpp:1733] Handling status update TASK_RUNNING from
task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:07.127655 43025 status_update_manager.cpp:289] Received status update TASK_RUNNING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 with checkpoint=false
> I0418 15:17:07.127707 43025 status_update_manager.hpp:336] Handling UPDATE for status
update TASK_RUNNING from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:07.127779 43025 slave.cpp:1793] Sending ACK for status update TASK_RUNNING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 to executor executor(1)@10.34.135.114:42980
> W0418 15:17:15.601752 43024 status_update_manager.cpp:434] Resending status update TASK_STARTING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:15.601836 43024 status_update_manager.cpp:335] Forwarding status update TASK_STARTING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 to the master at master@10.34.128.115:5050
> I0418 15:17:18.861799 43021 slave.cpp:1307] Got acknowledgement of status update for
task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:18.869899 43025 status_update_manager.cpp:360] Received status update acknowledgement
for task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:18.870002 43025 status_update_manager.hpp:336] Handling ACK for status update
TASK_STARTING from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:18.870111 43025 status_update_manager.cpp:335] Forwarding status update TASK_RUNNING
from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000 to the master at master@10.34.128.115:5050
> I0418 15:17:18.870247 43025 slave.cpp:1344] Status update manager successfully handled
status update acknowledgement for task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:19.301548 43024 slave.cpp:1307] Got acknowledgement of status update for
task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> I0418 15:17:19.301774 43024 status_update_manager.cpp:360] Received status update acknowledgement
for task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93 of framework
201103282247-0000000019-0000
> F0418 15:17:19.375743 43024 status_update_manager.hpp:236] Check failed: uuid == UUID::fromBytes(update.uuid())
Unexpected UUID mismatch! (received 72fae945-1afb-4f86-a80e-c0b67df0aa04, expecting e1ea786a-7a7a-4f02-ae74-ae09b525ce11)
for update TASK_RUNNING from task 1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93
of framework 201103282247-0000000019-0000
> I0418 15:17:22.727701 24131 cgroups_isolator.cpp:784] Removing orphaned cgroup 'mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6'
> I0418 15:17:22.729791 24126 cgroups.cpp:1175] Trying to freeze cgroup /cgroup/mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6
> I0418 15:17:23.363044 24126 cgroups.cpp:1214] Successfully froze cgroup /cgroup/mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6
after 7 attempts
> I0418 15:17:23.365375 24123 cgroups.cpp:1190] Trying to thaw cgroup /cgroup/mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6
> I0418 15:17:23.365535 24123 cgroups.cpp:1298] Successfully thawed /cgroup/mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6
> I0418 15:17:23.475486 24132 cgroups_isolator.cpp:1125] Successfully destroyed cgroup
mesos/framework_201103282247-0000000019-0000_executor_thermos-1366298223270-mesos-meta_slave_9-13-6b22df3f-d77f-44ac-a0b2-6047cebf0a93_tag_e02f6720-c049-4a54-9865-e537a9d47ec6

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message