Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 4DB0E200CF8 for ; Thu, 31 Aug 2017 05:31:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 4C1DC16A799; Thu, 31 Aug 2017 03:31:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 6CB7816A794 for ; Thu, 31 Aug 2017 05:31:21 +0200 (CEST) Received: (qmail 52220 invoked by uid 500); 31 Aug 2017 03:31:14 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 52205 invoked by uid 99); 31 Aug 2017 03:31:14 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Aug 2017 03:31:14 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id A374FC1D28 for ; Thu, 31 Aug 2017 03:31:13 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.48 X-Spam-Level: ** X-Spam-Status: No, score=2.48 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=thelastpickle-com.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id UKCNHDiUW2_z for ; Thu, 31 Aug 2017 03:31:12 +0000 (UTC) Received: from mail-yw0-f174.google.com (mail-yw0-f174.google.com [209.85.161.174]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 58E6F60F06 for ; Thu, 31 Aug 2017 03:31:11 +0000 (UTC) Received: by mail-yw0-f174.google.com with SMTP id h127so39953546ywf.3 for ; Wed, 30 Aug 2017 20:31:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=thelastpickle-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=CUZwBS34GSiQGYNY2+Ybb8H216nECYZhm8B6NFoBh88=; b=S5+cEBex4r/mbtNONFQSosVgLDd9csaPnVgsmDJ7jYXvbycCaL5K2P9iBw49o96Y26 oBFhbhnSpAH4KHrAEMnaIJXBsHbifHpdv0DobXLHlnWkgH1eMS3SnVdlBjPmszUiaAw+ onMtyQqk0j1oBxNX9oE+Bw4SOUx/NzAc/4yp6ehtQbohiQPDcc4T4ptbSlHORS8CpwQ3 pEqdCYMIw/BZ3726h6fsCnrOnNssTU+3pA0k8mhOYWDjqi3qFOY8YKbIfnp381ArRu86 0j0NBwk0TzJclRcpTFdbTIKEQp1KfzvOOjVKnZq0M9v6drOVi46Z5EWCwMRk/6N7maap TTCw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=CUZwBS34GSiQGYNY2+Ybb8H216nECYZhm8B6NFoBh88=; b=jYiQsVnqhb80AI+6PCjg9ft/5vyujQcA07qVCurM5Y2jDQX6z/ayXpdwtTYkiO5UAE Q7hC8Np/0kt/e+OEPV/AT6cUNEYCs5Vffp4Ynbxj5iq7Wgubuk10DesK5WJnTDpXI+MF Hh/5YFFB+Tk5MECl7CoIkE90VsFhNvG+/12V2A1n7lkZjUzJfcf4KzYdEJiI9AMqOypO RryiahZtlhyJgcASkJn/4gOqJ3k7oVY8pPOAo9wZr0A3KVhdlsORMLdZWcvkrEHM5suh omrFak/B/C36Fw07IuYTyujdacXalUXgTJq5XSC5XuqsYcMBAMm4unGaMXQ0k8gLhJf+ //2A== X-Gm-Message-State: AHYfb5gJtm0YYLONhCjMjUWThPQzGvrhIlQSjbdtwW/8WZ/ZQnSomX3B afjqHQ+fsZOeLUNeTaaRPLaF5XN8gcVjynI= X-Received: by 10.37.173.160 with SMTP id z32mr3314634ybi.322.1504150268978; Wed, 30 Aug 2017 20:31:08 -0700 (PDT) MIME-Version: 1.0 Received: by 10.13.239.197 with HTTP; Wed, 30 Aug 2017 20:31:07 -0700 (PDT) In-Reply-To: References: From: Nate McCall Date: Thu, 31 Aug 2017 15:31:07 +1200 Message-ID: Subject: Re: Cassandra All host(s) tried for query failed (no host was tried) To: Cassandra Users Content-Type: multipart/alternative; boundary="f403045d9bf678ba2c05580446a8" archived-at: Thu, 31 Aug 2017 03:31:22 -0000 --f403045d9bf678ba2c05580446a8 Content-Type: text/plain; charset="UTF-8" If these app instances sit idle for a while, they might just be timing out their sockets. You can tweak socket settings on the driver as described here: https://github.com/datastax/java-driver/tree/3.x/manual/socket_options Perhaps start with explicitly setting keepAlive to true as that may or may not be set depending on whether it's using the native epoll extension or NIO directly (more details about such on the page above). On Thu, Aug 31, 2017 at 3:10 AM, Ivan Iliev wrote: > Hello everyone, > > We are using Cassandra 3.9 for storing quite a lot of data produced from > our tester machines. > > Occasionally, we are seeing issues with apps not being able to communicate > with Cassandra nodes, returning the following errors (captured in > servicemix logs): > >> by: com.datastax.driver.core.exceptions.NoHostAvailableException: All >> host(s) tried for query failed (no host was tried) >> at com.datastax.driver.core.RequestHandler.reportNoMoreHosts( >> RequestHandler.java:218) >> at com.datastax.driver.core.RequestHandler.access$1000( >> RequestHandler.java:43) >> at com.datastax.driver.core.RequestHandler$SpeculativeExecution. >> sendRequest(RequestHandler.java:284) >> at com.datastax.driver.core.RequestHandler.startNewExecution( >> RequestHandler.java:115) >> at com.datastax.driver.core.RequestHandler.sendRequest( >> RequestHandler.java:91) >> at com.datastax.driver.core.SessionManager.executeAsync( >> SessionManager.java:132) >> ... 107 more > > > As a result, apps that try to send data to cassandra get crashed due to > running out of memory and we have to restart the containers in which they > run. > > So far I have not been able to identify what might be the cause for this > as nothing (at least I could not find anything relevant on the timestamps) > in the cassandra debug and system logs. > > Could you share some insight on this ? What to check and where to start > from , in order to troubleshoot this. > > Thanks ! > Ivan > -- ----------------- Nate McCall Wellington, NZ @zznate CTO Apache Cassandra Consulting http://www.thelastpickle.com --f403045d9bf678ba2c05580446a8 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
If these app instances sit idle for a while, they might ju= st be timing out their sockets. You can tweak socket settings on the driver= as described here:

Perhaps start = with explicitly setting keepAlive to true as that may or may not be set dep= ending on whether it's using the native epoll extension or NIO directly= (more details about such on the page above).=C2=A0

On Thu, Aug 31, 2017 at 3:1= 0 AM, Ivan Iliev <ivan.iliev.iliev@gmail.com> wrote= :
Hello everyone,
We are using Cassandra 3.9 for storing quite a lot of data pro= duced from our tester machines.

Occasionally, we a= re seeing issues with apps not being able to communicate with Cassandra nod= es, returning the following errors (captured in servicemix logs):
=C2=A0by: com.datastax.driver.core.exceptions.NoHostAvailab= leException: All host(s) tried for query failed (no host was tried)
at com.datastax.driver.core.Requ= estHandler.reportNoMoreHosts(RequestHandler.java:218)
<= span style=3D"color:rgb(51,51,51);font-family:Lato,calibri;font-size:14px;b= ackground-color:rgb(240,240,240)">at com.datastax.driver.core.RequestH= andler.access$1000(RequestHandler.java:43)
at com.datastax.driver.core.RequestHandler$Spec= ulativeExecution.sendRequest(RequestHandler.java:284)
<= span style=3D"color:rgb(51,51,51);font-family:Lato,calibri;font-size:14px;b= ackground-color:rgb(240,240,240)">at com.datastax.driver.core.RequestH= andler.startNewExecution(RequestHandler.java:115)
at com.datastax.driver.core.RequestHandl= er.sendRequest(RequestHandler.java:91)
at com.datastax.driver.core.SessionManager.executeAsync(= SessionManager.java:132)
= ... 107 more
As a result, apps that try to send data to cassandra get= crashed due to running out of memory and we have to restart the containers= in which they run.

So far I have not been abl= e to identify what might be the cause for this as nothing (at least I could= not find anything relevant on the timestamps) in the cassandra debug and s= ystem logs.

Could you share some insight on this ?= What to check and where to start from , in order to troubleshoot this.

Thanks !
Ivan



--
-----------------
Nate McCall<= br>Wellington, NZ
@zznate

CTO
Apache Cassandra Consulting
<= a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://www.thelas= tpickle.com
--f403045d9bf678ba2c05580446a8--