zookeeper-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "zhangbo (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ZOOKEEPER-3211) zookeeper单机版本部署,在centos7.0内核中偶现一个较严重问题,Server段默认的60个连接全部变为CLOSE_WATI状态且长时间不消除,导致zk无法正常提供服务
Date Tue, 11 Dec 2018 08:59:00 GMT

    [ https://issues.apache.org/jira/browse/ZOOKEEPER-3211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16716595#comment-16716595
] 

zhangbo commented on ZOOKEEPER-3211:
------------------------------------

1. 是集群模式还是单机模式?看到log文件里面myid:3

2. OutOfMemory问题,`ulimit -u`看看是不是分配的用户线程数太少

3. 是生产环境发生的问题吗,有多少个应用连接到zk?

> zookeeper单机版本部署,在centos7.0内核中偶现一个较严重问题,Server段默认的60个连接全部变为CLOSE_WATI状态且长时间不消除,导致zk无法正常提供服务
> ------------------------------------------------------------------------------------------
>
>                 Key: ZOOKEEPER-3211
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3211
>             Project: ZooKeeper
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 3.4.5
>         Environment: 1.部署配置
> server.1=127.0.0.1:2902:2903
> 2.部署版本
> 内核:Linux localhost.localdomain 3.10.0-123.el7.x86_64 #1 SMP Tue Feb 12 19:44:50
EST 2019 x86_64 x86_64 x86_64 GNU/Linux
> JDK:
> java version "1.7.0_181"
> OpenJDK Runtime Environment (rhel-2.6.14.5.el7-x86_64 u181-b00)
> OpenJDK 64-Bit Server VM (build 24.181-b00, mixed mode)
> zk: 3.4.5
>            Reporter: yeshuangshuang
>            Priority: Blocker
>             Fix For: 3.4.5
>
>         Attachments: 1.log, zklog.rar
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> 1.部署配置
> server.1=127.0.0.1:2902:2903
> 2.部署版本
> 内核:Linux localhost.localdomain 3.10.0-123.el7.x86_64 #1 SMP Tue Feb 12 19:44:50
EST 2019 x86_64 x86_64 x86_64 GNU/Linux
> JDK:
> java version "1.7.0_181"
> OpenJDK Runtime Environment (rhel-2.6.14.5.el7-x86_64 u181-b00)
> OpenJDK 64-Bit Server VM (build 24.181-b00, mixed mode)
> zk: 3.4.5
> 3.问题现象:不是必现问题,但是复现概率极高,起初是读写超时,大概耗时6s左右,过来几分钟后所有的连接(包括长连接)都成了CLOSE_WAIT状态。
> 4.目前手段:发现连接全部变为close_wait 主动重启zookeeper 服务端



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message