Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 68D4E200CD2 for ; Thu, 13 Jul 2017 01:19:15 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 674ED16A184; Wed, 12 Jul 2017 23:19:15 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 88F2516A182 for ; Thu, 13 Jul 2017 01:19:14 +0200 (CEST) Received: (qmail 62913 invoked by uid 500); 12 Jul 2017 23:19:13 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 62891 invoked by uid 99); 12 Jul 2017 23:19:13 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Jul 2017 23:19:13 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id B7087192308 for ; Wed, 12 Jul 2017 23:19:12 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.379 X-Spam-Level: ** X-Spam-Status: No, score=2.379 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id jMRRz1eJQsjr for ; Wed, 12 Jul 2017 23:19:06 +0000 (UTC) Received: from mail-lf0-f47.google.com (mail-lf0-f47.google.com [209.85.215.47]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 77F2B627F9 for ; Wed, 12 Jul 2017 22:02:34 +0000 (UTC) Received: by mail-lf0-f47.google.com with SMTP id h22so26535269lfk.3 for ; Wed, 12 Jul 2017 15:02:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=T+bRitGbDjx+mWnFfq323CMyrl/bg98TGxTBV9PdkOY=; b=QcAKTAYHRnNi8oTG5Bbax33NPyD60Kd8tCgsGTKK2fCHBy/tFIik7NxqI+pJH8NBPe wU0pP2S1oOZgUQUhSaq6Cc/Way62mbA17QtKX/+OjXF/+Xm4cl6fotKC1RqWyDsAK1yC O9ixBbpJliC+Kt+/3JAoBxKcK1TR1RhnJF5UnpDPBRhAbSlsxAHai6YpAXiRo12xo4jw jQADxLnONiAiwf/m4tF7Q0ZykOd1InFzu6nmB9BsziF2hc90I4g2Q+FVKb4XlyHCaXJ4 1qX23i5SaI9+tSsfWxo3KAq2hB4EUr+JDO0PGQxCTnSYlzPboBryQfg5/xPCcir8+roh T9yg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=T+bRitGbDjx+mWnFfq323CMyrl/bg98TGxTBV9PdkOY=; b=Zyc9QSnWs+RFlI0VFjNcPBOROZGO1VkfcTgKS6C00rAG5JfM2laJQgShoUxPk1buab heXfNDFgPBdEgPBzxEIyP+5hmJHwB7t/ivx5eQ8l01gPC4YIAsAsgSC40iLaTFfTcEMP wJhLVkCXZqQXl16mKUw5UfMQbyDqVEZwe7XRldMWNQm4bwNDOle7YswO9eoHAwgUXpSo 7T63D1vj8HvJK1ieHqdkcFmfu8cB2azM/TFerS4KXrOl9oy4+huC9liQbfrsbB0kayKh BwAkUJb/86g3zB3M87wIn9d2cDL9tedS59RdJy9wHRSGNcP38tuP3DBajhgYWo+kTZBV ZLag== X-Gm-Message-State: AIVw113sMjFGbjuTF4ETb0uRKptuainVSoa05Mbip0BEP138YEAZjsat Rz1coxOehQ5Hz5UTYWUow8zW0+0GBA== X-Received: by 10.25.22.212 with SMTP id 81mr206981lfw.104.1499896953663; Wed, 12 Jul 2017 15:02:33 -0700 (PDT) MIME-Version: 1.0 Received: by 10.46.21.71 with HTTP; Wed, 12 Jul 2017 15:02:13 -0700 (PDT) In-Reply-To: References: From: Alexander Shraer Date: Wed, 12 Jul 2017 15:02:13 -0700 Message-ID: Subject: Re: New to zookeeper To: "user@zookeeper.apache.org" Content-Type: multipart/alternative; boundary="001a114020221f7dd4055425f94c" archived-at: Wed, 12 Jul 2017 23:19:15 -0000 --001a114020221f7dd4055425f94c Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Just a small comment - 3.5.3 is in beta. The getConfig API returns a list of servers in the cluster, including their ports and roles in the ensemble. Alex On Wed, Jul 12, 2017 at 7:53 AM, Washko, Daniel wrote= : > I speak strictly from my experience with Zookeeper and not an any officia= l > capacity of the project or of exhibitor. > > Exhibitor works great and allows you to easily automate clustering > zookeeper nodes into an ensemble and discovering the individual nodes in > the ensemble via an http call. We ran into a problem, though, after we > implemented Exhibitor across our infrastructure. Every so often our > Zookeeper ensembles lost the data they stored. While I cannot say this wa= s > caused by Exhibitor, we have Solr clouds where Exhibitor was not used and > they never had this problem. My suspicion is that there was a problem wit= h > a zookeeper node and Exhibitor removed that node from the ensemble then d= id > a rolling restart. When that node recovered for some reason the data was > corrupted or lost. Exhibitor pulled that node back into the ensemble and > did a rolling restart. That node became leader and when the others joined > synced from that. Those nodes then dumped their data stored to be in sync > with the leader. This is my speculation, I have had a very hard time > replicating this and have not heard of anyone else having this problem. > Again, I am not definitively saying Exhibitor is the cause of this but > since we removed Exhibitor this problem has not occurred. > > Zookeeper 3.5.x branch adds discovery functionality and does automated > clustering. It=E2=80=99s great, but from what I understand is still in al= pha. > > Prior to the 3.5.x branch I know of no way to discover what nodes are > actually in the ensemble. The 4 letter commands will tell you whether a > node is in an ensemble, whether it is a leader or follower, but it will n= ot > tell you what ensemble it is in or list any other node information. If > someone has a way to do this please post, because I have looked all over. > > We make use of Scalr and that adds an additional layer to automation. I > run orchestration scripts in Scalr that discover the other running > zookeeper nodes in (what Scalr calls) the same Farm Role. This script > configures each node with the information for the other nodes and does a > restart of Zookeeper to bring them into an ensemble. Then it collects thi= s > information and stores the IP addresses into a Global Variable in scalr > that is available then to Solr. Changes to the ensemble are reflected in > this variable that is then passed to the Solr cloud where a restart of th= e > service will update the zookeeper information in Solr. We are working > towards moving this functionality to Consul where it will register ther > zookeeper ensemble information allowing Solr to pull it from Consul as > opposed to relying on Global Variables. What I am getting at is that > outside the 3.5.x branch, automating this takes a bit of work. > > > -- > Daniel S Washko > Solutions Architect > > > > dwashko@gannett.com > > On 7/11/17, 6:58 PM, "Luigi Tagliamonte" > wrote: > > Hello, Zookeeper Users! > I'm currently configuring/exploring zookeeper. > I'm reading a lot about ensembles and scaling and I got some question > that > I'd like to submit to an expert audience. > I need zookeeper as Kafka dependency so my deployment goal is the > ensemble > reliability especially because last Kafka version uses zookeeper only > to > store the leader partition. > > Here are my questions: > > - To manage the ensemble I decided to use exhibitor - what do you thi= nk > about? Should I look to something else? > > - Is there a way to discover all the servers of an ensemble apart fro= m > use 4LTR? I wonder if it is possible to do something like in Cassandr= a > were > you contact one node and you can get the whole cluster info from it. > should > I configure just a DNS per zookeeper server, this doesn't scale well > in a > dynamic env like servers in autoscaling. > > - is there any white paper that shows a real scalable and reliable > Zookeeper installation? Any resources are welcome! > > Thank you all in advance! > Regards > > > --001a114020221f7dd4055425f94c--