Return-Path: X-Original-To: apmail-storm-dev-archive@minotaur.apache.org Delivered-To: apmail-storm-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4772011FB4 for ; Thu, 24 Jul 2014 00:49:03 +0000 (UTC) Received: (qmail 19324 invoked by uid 500); 24 Jul 2014 00:49:03 -0000 Delivered-To: apmail-storm-dev-archive@storm.apache.org Received: (qmail 19265 invoked by uid 500); 24 Jul 2014 00:49:03 -0000 Mailing-List: contact dev-help@storm.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@storm.incubator.apache.org Delivered-To: mailing list dev@storm.incubator.apache.org Received: (qmail 19243 invoked by uid 99); 24 Jul 2014 00:49:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 00:49:02 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 24 Jul 2014 00:49:04 +0000 Received: (qmail 18429 invoked by uid 99); 24 Jul 2014 00:48:39 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 00:48:39 +0000 Date: Thu, 24 Jul 2014 00:48:39 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: dev@storm.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (STORM-166) Highly available Nimbus MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/STORM-166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072647#comment-14072647 ] ASF GitHub Bot commented on STORM-166: -------------------------------------- Github user ptgoetz commented on the pull request: https://github.com/apache/incubator-storm/pull/61#issuecomment-49955263 @yveschina any update on the concerns raised? > Highly available Nimbus > ----------------------- > > Key: STORM-166 > URL: https://issues.apache.org/jira/browse/STORM-166 > Project: Apache Storm (Incubating) > Issue Type: New Feature > Reporter: James Xu > Priority: Minor > > https://github.com/nathanmarz/storm/issues/360 > The goal of this feature is to be able to run multiple Nimbus servers so that if one goes down another one will transparently take over. Here's what needs to happen to implement this: > 1. Everything currently stored on local disk on Nimbus needs to be stored in a distributed and reliable fashion. A DFS is perfect for this. However, as we do not want to make a DFS a mandatory requirement to run Storm, the storage of these artifacts should be pluggable (default to local filesystem, but the interface should support DFS). You would only be able to run multiple NImbus if you use the right storage, and the storage interface chosen should have a flag indicating whether it's suitable for HA mode or not. If you choose local storage and try to run multiple Nimbus, one of the Nimbus's should fail to launch. > 2. Nimbus's should register themselves in Zookeeper. They should use a leader election protocol to decide which one is currently responsible for launching and monitoring topologies. > 3. StormSubmitter should find the Nimbus to connect to via Zookeeper. In case the leader changes during submission, it should use a retry protocol to try reconnecting to the new leader and attempting submission again. -- This message was sent by Atlassian JIRA (v6.2#6252)