Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8569B200B43 for ; Wed, 29 Jun 2016 05:46:47 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 82943160A6C; Wed, 29 Jun 2016 03:46:47 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id CB639160A56 for ; Wed, 29 Jun 2016 05:46:46 +0200 (CEST) Received: (qmail 1948 invoked by uid 500); 29 Jun 2016 03:46:46 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 1927 invoked by uid 99); 29 Jun 2016 03:46:45 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Jun 2016 03:46:45 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 7FCAA2C029F for ; Wed, 29 Jun 2016 03:46:45 +0000 (UTC) Date: Wed, 29 Jun 2016 03:46:45 +0000 (UTC) From: "peng xiao (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CASSANDRA-12103) Cassandra is hang and cqlsh was not able to login with OperationTimeout error MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 29 Jun 2016 03:46:47 -0000 [ https://issues.apache.org/jira/browse/CASSANDRA-12103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peng xiao updated CASSANDRA-12103: ---------------------------------- Component/s: Core > Cassandra is hang and cqlsh was not able to login with OperationTimeout error > ----------------------------------------------------------------------------- > > Key: CASSANDRA-12103 > URL: https://issues.apache.org/jira/browse/CASSANDRA-12103 > Project: Cassandra > Issue Type: Bug > Components: Core, Local Write-Read Paths > Environment: centos 6.5 cassandra 2.1.9 > Reporter: peng xiao > Priority: Critical > Attachments: system.log.2016-06-28_1257.gz > > > Hi, > We have two DCs(DC1 and DC2) with DC1 3 nodes and DC2 9 nodes. > And we experienced a Timeout error today,all applications connected to DC1 were hang and no response,even cqlsh was not able to log into any node in DC1. > I restarted the 3 nodes in DC1,the problem was not resolved. > Then we switched to DC2,then applications back to normal. > Could you please help to take a look? > Thanks > many errors like below: > ERROR [SharedPool-Worker-43] 2016-06-28 11:58:49,705 Message.java:538 - Unexpected exception during request; channel = [id: 0x87e315d6, /172.16.10.198:13604 => /172.16.11.13:9042] > java.lang.RuntimeException: org.apache.cassandra.exceptions.ReadTimeoutException: Operation timed out - received only 0 responses. > at org.apache.cassandra.auth.Auth.selectUser(Auth.java:276) ~[apache-cassandra-2.1.9.jar:2.1.9] > at org.apache.cassandra.auth.Auth.isExistingUser(Auth.java:86) ~[apache-cassandra-2.1.9.jar:2.1.9] > at org.apache.cassandra.service.ClientState.login(ClientState.java:206) ~[apache-cassandra-2.1.9.jar:2.1.9] > at org.apache.cassandra.transport.messages.AuthResponse.execute(AuthResponse.java:82) ~[apache-cassandra-2.1.9.jar:2.1.9] > at org.apache.cassandra.transport.Message$Dispatcher.channelRead0(Message.java:439) [apache-cassandra-2.1.9.jar:2.1.9] > at org.apache.cassandra.transport.Message$Dispatcher.channelRead0(Message.java:335) [apache-cassandra-2.1.9.jar:2.1.9] > at io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:105) [netty-all-4.0.23.Final.jar:4.0.23.Final] > at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:333) [netty-all-4.0.23.Final.jar:4.0.23.Final] > at io.netty.channel.AbstractChannelHandlerContext.access$700(AbstractChannelHandlerContext.java:32) [netty-all-4.0.23.Final.jar:4.0.23.Final] > at io.netty.channel.AbstractChannelHandlerContext$8.run(AbstractChannelHandlerContext.java:324) [netty-all-4.0.23.Final.jar:4.0.23.Final] > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0] > at org.apache.cassandra.concurrent.AbstractTracingAwareExecutorService$FutureTask.run(AbstractTracingAwareExecutorService.java:164) [apache-cassandra-2.1.9.jar:2.1.9] > at org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:105) [apache-cassandra-2.1.9.jar:2.1.9] > at java.lang.Thread.run(Thread.java:744) [na:1.8.0] -- This message was sent by Atlassian JIRA (v6.3.4#6332)