Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 94EBF200B74 for ; Thu, 1 Sep 2016 15:16:52 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 934B6160AAE; Thu, 1 Sep 2016 13:16:52 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id DBAF5160AB5 for ; Thu, 1 Sep 2016 15:16:51 +0200 (CEST) Received: (qmail 4494 invoked by uid 500); 1 Sep 2016 13:16:45 -0000 Mailing-List: contact user-help@storm.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@storm.apache.org Delivered-To: mailing list user@storm.apache.org Received: (qmail 4484 invoked by uid 99); 1 Sep 2016 13:16:45 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Sep 2016 13:16:45 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 78AB5C000A for ; Thu, 1 Sep 2016 13:16:45 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.701 X-Spam-Level: X-Spam-Status: No, score=-0.701 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=disabled Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id P_sJxCQ5DjNP for ; Thu, 1 Sep 2016 13:16:41 +0000 (UTC) Received: from mout.gmx.net (mout.gmx.net [212.227.15.19]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id 5DE575FC1C for ; Thu, 1 Sep 2016 13:16:41 +0000 (UTC) Received: from [188.99.94.243] by 3capp-gmx-bs49.server.lan (via HTTP); Thu, 1 Sep 2016 15:16:33 +0200 MIME-Version: 1.0 Message-ID: From: "Dave Webb" To: "Storm User" Subject: Parallelism of IMetricsConsumer Content-Type: text/plain; charset=UTF-8 Date: Thu, 1 Sep 2016 15:16:33 +0200 Importance: normal Sensitivity: Normal X-Priority: 3 X-Provags-ID: V03:K0:rxKMXnJlKz4Jv2JlZClwjfsTeTwkRAi6l1xltw2uFhn +74SXaVo9hD7cSxx65NWOElJivLu+nEoCuhFTH8p8F5tllYHeN Lgng2Z72n1YzJyqW+baFpJTSDkP0d6mhEoIOokEw312dsE416W 4vqrA9yMJ1MeuF/pm5t+6c4U2i0Ze1HPq23D15HspvGZ5GbqPr 1nDAEBAySNNyK86JECwMidTqd/qsgQXWPLcEEendwqXJPeh67C PUGt4R9FjVRCOG6Z9WFF3UoX/YmA21MzpvuiYc61bmpyCRsNjQ 7g/kB8= X-UI-Out-Filterresults: notjunk:1;V01:K0:Zxf/ZLx291k=:KxfH8MiAws2gSoRVaksC7E yE7Mv2rnIFUDjmjXHluaV1Oz4E8ZgRywhEiI4uF7lnxOBuhjdMza/pi35fN0kn0HOlmYyLUIi aWmbMOJeO0w3QnYFS2LPn1+s28u6JTnr6eQ5+7SkQNkWIy0Lck+V8E7p0X9G1TNTTey4Lj7ak kI7e5Qq0vcxs1WE553sRPu+u0XZqRN721ST1jhTmIzQwoFoGURWjJcN4kj15M+7iFIkI8uwL1 u81UdTVcWKW2SvRszyYMqEGpcXvCenr9p9KWpgUc1Myn9FqVkYV4hst5jeLr8yNd+8q/VD4XZ tMHd3M45DJWdwshIwwau0u4fFkIMueJJY20xCH48WYxEjeB0ihNB67NFXFotOeb6MG/ucqEJI Nz6hamIZ4xVMsZTCuU2LdJCm0QvmntQslr8w7RJBbnaVXuiLcCdQYoal9Lg3fYv2Igu4dsD6D G3+YaUeZyQ== archived-at: Thu, 01 Sep 2016 13:16:52 -0000 Hello! I want to gather some Metrics of all Topologies running in Storm and came across the IMetricsConsumer. In order to add gathering of metrics to all topologies, I added my implementation in the "storm.yaml" config. Although it works great so far, I have some conceptual questions. As far as I understand it, the MetricsConsumer is internally implemented as a Bolt which is connected to all other Bolts. As with Bolts, I can adjust the degree of parallelism by specifying a "parallelism.hint" which basically defines the amount of Executors (i.e. Threads) that execute this Bolt. Question 1: With which kind of Grouping is the MetricsConsumerBolt connected to the other Bolts? I would have guessed "shuffle grouping" or "local or shuffle grouping", is this true? Question 2: Despite being able to define the amount of Executors, can I specify in which way I'd like to "distribute" the MetricsConsumerBolt "instances". Imagine that my topology is split across 2 workers which are running on separate machines and I've set "parallelism.hint: 2". Where will the two MetricsConsumerBolt Tasks run? One Task on each Worker? Both Tasks on one Worker? Thank you! Dave Webb