Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5137C177FD for ; Mon, 13 Oct 2014 15:45:34 +0000 (UTC) Received: (qmail 17773 invoked by uid 500); 13 Oct 2014 15:45:34 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 17745 invoked by uid 500); 13 Oct 2014 15:45:33 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 17705 invoked by uid 99); 13 Oct 2014 15:45:33 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Oct 2014 15:45:33 +0000 Date: Mon, 13 Oct 2014 15:45:33 +0000 (UTC) From: "T Jake Luciani (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-7821) Add Optional Backoff on Retry to Cassandra Stress MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-7821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169416#comment-14169416 ] T Jake Luciani commented on CASSANDRA-7821: ------------------------------------------- bq. Of course writing a new retry policy would be acceptable too, a BackingOffRetryPolicy.java or something alike? Yes this is how I think it should be done > Add Optional Backoff on Retry to Cassandra Stress > ------------------------------------------------- > > Key: CASSANDRA-7821 > URL: https://issues.apache.org/jira/browse/CASSANDRA-7821 > Project: Cassandra > Issue Type: Improvement > Reporter: Russell Alexander Spitzer > Assignee: Russell Alexander Spitzer > Attachments: CASSANDRA-7821-2.1.patch > > > Currently when stress is running against a cluster which occasionally has nodes marked as down, it will almost immediately stop. This occurs because the retry loop can execute extremely quickly if each execution terminates with a {{com.datastax.driver.core.exceptions.NoHostAvailableException}} or {{com.datastax.driver.core.exceptions.UnavailableException}}. > In case of these exceptions is will most likely be unable to succeed if the retries are performed as fast as possible. To get around this, we could add an optional delay on retries giving the cluster time to recover rather than terminating the stress run. > We could make this configurable, with options such as: > * Constant # Delays the same amount after each retry > * Linear # Backoff a set amount * the trial number > * Exponential # Backoff set amount * 2 ^ trial number > This may also require adjusting the "thread is stuck check" to make sure that the max retry timeout will not cause the thread to be terminated early. -- This message was sent by Atlassian JIRA (v6.3.4#6332)