Return-Path: X-Original-To: apmail-zookeeper-bookkeeper-user-archive@minotaur.apache.org Delivered-To: apmail-zookeeper-bookkeeper-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C85A69233 for ; Thu, 3 May 2012 17:26:05 +0000 (UTC) Received: (qmail 14488 invoked by uid 500); 3 May 2012 17:26:05 -0000 Delivered-To: apmail-zookeeper-bookkeeper-user-archive@zookeeper.apache.org Received: (qmail 14467 invoked by uid 500); 3 May 2012 17:26:05 -0000 Mailing-List: contact bookkeeper-user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: bookkeeper-user@zookeeper.apache.org Delivered-To: mailing list bookkeeper-user@zookeeper.apache.org Received: (qmail 14459 invoked by uid 99); 3 May 2012 17:26:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 May 2012 17:26:05 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [193.1.99.77] (HELO gir.skynet.ie) (193.1.99.77) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 May 2012 17:25:56 +0000 Received: from moloch.vm.bytemark.co.uk (moloch.bleurgh.com [80.68.94.101]) (using TLSv1 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by gir.skynet.ie (Postfix) with ESMTPSA id B211512371 for ; Thu, 3 May 2012 18:25:35 +0100 (IST) Date: Thu, 3 May 2012 19:25:34 +0200 From: Ivan Kelly To: bookkeeper-user@zookeeper.apache.org Subject: Re: topic naming, grouping of messages and topic redistribution in Hedwig Message-ID: <20120503172533.GG10622@moloch.vm.bytemark.co.uk> References: <4FA26705.5050906@apache.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4FA26705.5050906@apache.org> User-Agent: Mutt/1.5.20 (2009-06-14) > - Are topic names unique across Hedwig instances (spread on > different datacenters for example)? Is there a naming scheme? No, a topic name is global to the system. If you publish to TopicA on region1, a subscriber on region2 will get the message. > - How can one ensure that a group of messages can be handled > atomically by Hedwig (in particular 1 message persisted => all > messages from the group persisted)? One way could be to create a > single message containing all messages from the group, and send that > to Hedwig, but maybe there is a better way? Batching into one message is the only way to do this. > - when a hub fails, its topics are redistributed to other hubs, but > would it be possible to control the redistribution? This could be > useful in order to limit the number of hubs a client would have to > connect to (in the case all topics are reallocated to distinct > hubs). It seems that the current logic is in ZkTopicManager and > redistribution of topic is based on load information gathered from > the available hubs. But could this be made customizable? There's actually work planned to make the TopicManager interfaces more pluggable so that other backends could be plugged in. This work could be folded into that maybe. -Ivan