Return-Path: X-Original-To: apmail-kafka-users-archive@www.apache.org Delivered-To: apmail-kafka-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1A8EA10B54 for ; Tue, 20 Aug 2013 17:57:10 +0000 (UTC) Received: (qmail 89938 invoked by uid 500); 20 Aug 2013 17:57:09 -0000 Delivered-To: apmail-kafka-users-archive@kafka.apache.org Received: (qmail 89916 invoked by uid 500); 20 Aug 2013 17:57:09 -0000 Mailing-List: contact users-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@kafka.apache.org Delivered-To: mailing list users@kafka.apache.org Received: (qmail 89907 invoked by uid 99); 20 Aug 2013 17:57:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Aug 2013 17:57:08 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of otto@wikimedia.org designates 209.85.128.43 as permitted sender) Received: from [209.85.128.43] (HELO mail-qe0-f43.google.com) (209.85.128.43) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Aug 2013 17:57:02 +0000 Received: by mail-qe0-f43.google.com with SMTP id t7so422150qeb.2 for ; Tue, 20 Aug 2013 10:56:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=wikimedia.org; s=google; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=yLI3Q774s8AiATKvU7pXkcrMAlcwnDiTccl5B1DMnyc=; b=WjYdWzP4UNw2oLt+vaImRhk+CIpUfeks63j0LpJX0Qoh5q/uOfisx2kVMLtMLggvck J1XUZjlYPwvnvOzXe27UnPwaM92miNUVYWTRR6Gup2tZlMCsCn56fDTo6draedTY+S2a gsjY4tZdffpBXNeMa3obK+RqxU1A1JCCZCSmg= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-gm-message-state:content-type:mime-version:subject:from :in-reply-to:date:content-transfer-encoding:message-id:references:to; bh=yLI3Q774s8AiATKvU7pXkcrMAlcwnDiTccl5B1DMnyc=; b=C3z9LN4el+EfQ3Dw8JJD8KwZF2ivVC/8lQGxPB/cUaqly8k0G4y01mM1yWT0PVmlX1 yYm/ZfwfpTP6s8yGmv4sccfaupWEaoXm4EEaFtTC4r6Opisvyo/gYA8vQ/YgOwK7O3R+ BBz2sFBnTIcVEbrHHBK3HYoVhCocQqz1JOHnu/ZbuXSLl1hgcV7EdwscdA3vg2Etfvm9 +ekdNwx8HzLXvLx3963KgqvQ9sQsjQEIQp1kdtVnMIxEunt3fxN9I+jmxaiWLN3fcTtE y5L8fghCXv5rr6s/fLqKiRQhnhHm8rVZcfgFQMLSDt5nbHZrAoAzBTEUxXsJ+p4He+Gv bazQ== X-Gm-Message-State: ALoCoQmje4lMYdUpVTsxZvPqdh83BpBo1EfGxqd3ES2Rc2R/6/LE3iOmgqKzOFmmMWMKlAnv0jiu X-Received: by 10.224.62.15 with SMTP id v15mr3373647qah.94.1377021400830; Tue, 20 Aug 2013 10:56:40 -0700 (PDT) Received: from [10.0.1.137] (ool-ad03c7b5.dyn.optonline.net. [173.3.199.181]) by mx.google.com with ESMTPSA id t4sm1877955qas.1.1969.12.31.16.00.00 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 20 Aug 2013 10:56:30 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\)) Subject: Re: Kafka Mirroring setup From: Andrew Otto In-Reply-To: Date: Tue, 20 Aug 2013 13:56:27 -0400 Content-Transfer-Encoding: quoted-printable Message-Id: <2A115330-EEE1-4463-B56B-8FA30629C910@wikimedia.org> References: <183A2B11-3C40-4993-B98F-3E2D4BFBC5E6@gmail.com> <177597C7-93B8-4461-A303-E225B7336F78@wikimedia.org> To: users@kafka.apache.org X-Mailer: Apple Mail (2.1508) X-Virus-Checked: Checked by ClamAV on apache.org In our case, our aggregator/analytics cluster is in our main datacenter, = so there's no risk of the main producers becoming disconnected from it. = It seems nicer to have a dedicated aggregator cluster, that only only = gets its data via MirrorMaker (Option A), but in our case this isn't = necessary. The aggregator cluster could use MirrorMaker to consume from remote = datacenters, but still have regular local producers send it data = directly (Option B). On Aug 20, 2013, at 1:47 PM, Jay Kreps wrote: > We do something like A (though I'm not sure I understand B): > http://kafka.apache.org/documentation.html#datacenters >=20 > Essentially what we wanted was that each datacenter stood alone so = that we > would not lose data if the datacenters became disconnected. Network > partitions within our data centers are extremely rare but between > datacenters relatively common. >=20 > -Jay >=20 >=20 >=20 >=20 > On Tue, Aug 20, 2013 at 10:35 AM, Andrew Otto = wrote: >=20 >> Hi all! >>=20 >> Wikimedia is investigating how best to set up Broker clusters in = multiple >> data centers. Our main analytics Broker cluster is currently in our = main >> datacenter. It is possible for all of the main DC's frontend = producers to >> produce directly to our analytics cluster, but we're not sure if this = is a >> best practice. So! What does LinkedIn recommend? >>=20 >> Option A: N + 1 clusters. >> - N production Broker Clusters (1 for each DC). >> - +1 aggregator/analytics Broker cluster that mirrors all of the >> production clusters. >>=20 >> - Option B: N total Broker clusters. >> - Frontend producers in the main cluster produce directly to the >> aggregator/analytics cluster. >> - Other DC's clusters are mirrored to the aggregator/analytics = cluster. >>=20 >> Thanks! >> -Andrew