Return-Path: X-Original-To: apmail-flink-dev-archive@www.apache.org Delivered-To: apmail-flink-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BB5B019995 for ; Thu, 14 Apr 2016 19:32:14 +0000 (UTC) Received: (qmail 19004 invoked by uid 500); 14 Apr 2016 19:32:14 -0000 Delivered-To: apmail-flink-dev-archive@flink.apache.org Received: (qmail 18945 invoked by uid 500); 14 Apr 2016 19:32:14 -0000 Mailing-List: contact dev-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list dev@flink.apache.org Received: (qmail 18934 invoked by uid 99); 14 Apr 2016 19:32:14 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Apr 2016 19:32:14 +0000 Received: from [192.168.2.102] (p57AAE175.dip0.t-ipconnect.de [87.170.225.117]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 226931A0113 for ; Thu, 14 Apr 2016 19:32:12 +0000 (UTC) Subject: Re: Adding custom monitoring to Flink To: dev@flink.apache.org References: From: Chesnay Schepler Message-ID: <570FF03B.8050208@apache.org> Date: Thu, 14 Apr 2016 21:32:11 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: multipart/alternative; boundary="------------060800000301050209070603" --------------060800000301050209070603 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit I'm currently working on a metric system that a) exposes several TaskManger metrics b) allows gathering metrics in various parts of a task, most notably user-defined functions. The first version makes these metrics available via JMX on each TaskManager. While a mechanism to make that pluggable is /planned/ there are no details on that yet. I /guess/ once it is merged you should be able to modify one of the classes so that the data is directly exported to your tool, but i would have to know more about it to make a definite assessment. There are no plans to funnel all those metrics unaggregated through Flink's accumulator mechanism; only a selection that will be aggregated locally and on the JobManager to display in the Dashboard. Out of curiosity, what metrics are you interested in? On 14.04.2016 20:59, Maxim wrote: > Hi! > I'm looking into integrating Flink into our stack and one of the > requirements is to report metrics to an internal system. The current > Accumulators are not adequate to provide visibility that we need to run > such a system in production. We want much more information about the > internal cluster state and ability to calculate aggregates ourselves. The > core reporting API accepts a metric name, metric type (gauge, counter, > timer) and a set of key value pairs that act as dimensions. > > The ideal solution for us would report the metrics through such API and > provide default binding to existing Accumulators, but allow overriding it > to our internal reporting client. > > Is it something that could be added to the Flink or there are other plans > for monitoring? > > Thanks! > > Maxim. > --------------060800000301050209070603--