Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id BCA5F2007D1 for ; Thu, 12 May 2016 15:48:14 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id BB4A9160A15; Thu, 12 May 2016 13:48:14 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1A3E61602BF for ; Thu, 12 May 2016 15:48:13 +0200 (CEST) Received: (qmail 89565 invoked by uid 500); 12 May 2016 13:48:13 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 89526 invoked by uid 99); 12 May 2016 13:48:13 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 12 May 2016 13:48:13 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 082252C1F61 for ; Thu, 12 May 2016 13:48:13 +0000 (UTC) Date: Thu, 12 May 2016 13:48:13 +0000 (UTC) From: "Ariel Weisberg (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-11752) histograms/metrics in 2.2 do not appear recency biased MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 12 May 2016 13:48:14 -0000 [ https://issues.apache.org/jira/browse/CASSANDRA-11752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15281528#comment-15281528 ] Ariel Weisberg commented on CASSANDRA-11752: -------------------------------------------- Maybe the server should provide the raw histogram, but for the percentiles provide a value that is for a recent window of time. IOW do the work of munging the histogram for the monitoring programs instead of forcing them to provide an integration for munging EstimatedHistogram. This would also make it less of a change in behavior for people who are upgrading. I think that a percentile that is based on a window of time going back to when the server was started is an inappropriate metric for what JMX is/should be used for. Providing it has 0 value so lets use that API for something more useful and similar to what existed before. > histograms/metrics in 2.2 do not appear recency biased > ------------------------------------------------------ > > Key: CASSANDRA-11752 > URL: https://issues.apache.org/jira/browse/CASSANDRA-11752 > Project: Cassandra > Issue Type: Bug > Components: Core > Reporter: Chris Burroughs > Labels: metrics > Attachments: boost-metrics.png, c-jconsole-comparison.png, c-metrics.png, default-histogram.png > > > In addition to upgrading to metrics3, CASSANDRA-5657 switched to using a custom histogram implementation. After upgrading to Cassandra 2.2 histograms/timer metrics are not suspiciously flat. To be useful for graphing and alerting metrics need to be biased towards recent events. > I have attached images that I think illustrate this. > * The first two are a comparison between latency observed by a C* 2.2 (us) cluster shoring very flat lines and a client (using metrics 2.2.0, ms) showing server performance problems. We can't rule out with total certainty that something else isn't the cause (that's why we measure from both the client & server) but they very rarely disagree. > * The 3rd image compares jconsole viewing of metrics on a 2.2 and 2.1 cluster over several minutes. Not a single digit changed on the 2.2 cluster. -- This message was sent by Atlassian JIRA (v6.3.4#6332)