mesos-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Badal Naik <think_h...@icloud.com>
Subject After restarting cluster task disappeared
Date Fri, 09 Oct 2015 12:16:48 GMT
Hello Mesos-Users,

I have set up 3 node mess cluster with ubuntu 14.04. i have started zookeeper,Mesos and marathon.
Every thing working fine expect three things.

1) When i restart the whole cluster mesos does not show completed tasks. is it expected behaviour?
if not what i should do?

2) in mesos web ui i’m not able to see staged/started/finished/killed/failed/lost task numbers
even when tasks are running.

3) Every zookeeper instance throws this exception regularly:

 2015-10-09 17:27:26,302 [myid:3] - WARN  [SendWorker:1:QuorumCnxManager$SendWorker@679] -
Interrupted while waiting for message on queue
java.lang.InterruptedException
	at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.reportInterruptAfterWait(AbstractQueuedSynchronizer.java:2014)
	at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2088)
	at java.util.concurrent.ArrayBlockingQueue.poll(ArrayBlockingQueue.java:418)
	at org.apache.zookeeper.server.quorum.QuorumCnxManager.pollSendQueue(QuorumCnxManager.java:831)
	at org.apache.zookeeper.server.quorum.QuorumCnxManager.access$500(QuorumCnxManager.java:62)
	at org.apache.zookeeper.server.quorum.QuorumCnxManager$SendWorker.run(QuorumCnxManager.java:667)




Here is my Mesos-master configuration:

mesos master --ip=10.1.0.72  --work_dir=/var/lib/mesos-master --zk=file:///etc/mesos/conf/zk
--quorum=file:///etc/mesos/conf/quorum
	
	Where zk=zk://zoo.service.consul:2181/mesos
              quorum=2
              


Mesos-Slave Configuration:

mesos slave --work_dir=/var/lib/mesos-slave --ip=10.1.0.72 --hostname=10.1.0.72 --strict=false
 --master=file:///etc/mesos/conf/master FrameworkInfo.checkpoint=True
      


Marathon Configuration:

java -jar /opt/marathon.jar  --master zk://zoo.service.consul:2181/mesos  --zk zk://zoo.service.consul:2181/marathon
 --ha --hostname 10.1.0.72  --checkpoint




Zookeeper configs with java version "1.8.0_45":



dataDir=/var/lib/zookeeper
clientPort=2181
tickTime=2000
initLimit=10
syncLimit=20


autopurge.purgeInterval=0


zookeeper.connection.timeout.ms=6000
server.1=10.1.0.70:2888:3888
server.2=10.1.0.71:2888:3888
server.3=10.1.0.72:2888:3888

And different myid has been given.


Can Anyone Help!!!



Mime
View raw message