spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "zengqiuyang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-9629) Client session timed out, have not heard from server in
Date Fri, 07 Aug 2015 02:30:45 GMT

    [ https://issues.apache.org/jira/browse/SPARK-9629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14661211#comment-14661211
] 

zengqiuyang commented on SPARK-9629:
------------------------------------

May be ,  i'm not sure because the error log is not same , it's similarity.
So now waiting for error appear.
These days it's working well.
If this error not appear again. I will change the issue to zookeeper Classification;

Thanks

>  Client session timed out, have not heard from server in
> --------------------------------------------------------
>
>                 Key: SPARK-9629
>                 URL: https://issues.apache.org/jira/browse/SPARK-9629
>             Project: Spark
>          Issue Type: Bug
>          Components: Deploy
>    Affects Versions: 1.4.0, 1.4.1
>         Environment: spark1.4.1    ./make-distribution.sh --tgz -Dhadoop.version=2.5.2
-Dyarn.version=2.5.2 -Phive -Phive-thriftserver  -Pyarn  
> zookeeper-3.4.6.tar.gz 
> standalone HA
> Linux version 2.6.32-358.el6.x86_64 (mockbuild@c6b8.bsys.dev.centos.org) (gcc version
4.4.7 20120313 (Red Hat 4.4.7-3) (GCC) ) #1 SMP Fri Feb 22 00:31:26 UTC 2013
>            Reporter: zengqiuyang
>            Priority: Critical
>
> the spark  HA   running  every few days , then " Client session timed out" appear。
> show reconnect but not do it,  and master shutting down.
> logs:
>  15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Client session timed out, have not heard
from server in 37753ms for sessionid 0x34ee39684b70005, closing socket connection and attempting
reconnect
> 15/08/05 05:32:57 INFO state.ConnectionStateManager: State change: SUSPENDED
> 15/08/05 05:32:57 WARN state.ConnectionStateManager: There are no ConnectionStateListeners
registered.
> 15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Opening socket connection to server h5/192.168.0.18:2181.
Will not attempt to authenticate using SASL (unknown error)
> 15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Socket connection established to h5/192.168.0.18:2181,
initiating session
> 15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Session establishment complete on server
h5/192.168.0.18:2181, sessionid = 0x34ee39684b70005, negotiated timeout = 40000
> 15/08/05 05:32:57 INFO state.ConnectionStateManager: State change: RECONNECTED
> 15/08/05 05:32:57 WARN state.ConnectionStateManager: There are no ConnectionStateListeners
registered.
> 15/08/05 05:32:58 INFO zookeeper.ClientCnxn: Client session timed out, have not heard
from server in 37753ms for sessionid 0x34ee39684b70006, closing socket connection and attempting
reconnect
> 15/08/05 05:32:58 INFO state.ConnectionStateManager: State change: SUSPENDED
> 15/08/05 05:32:58 INFO master.ZooKeeperLeaderElectionAgent: We have lost leadership
> 15/08/05 05:32:58 ERROR master.Master: Leadership has been revoked -- master shutting
down.
> 15/08/05 05:32:58 INFO util.Utils: Shutdown hook called



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message