Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id DF69D200C32 for ; Wed, 22 Feb 2017 15:52:09 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id DDE44160B49; Wed, 22 Feb 2017 14:52:09 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0CCC6160B67 for ; Wed, 22 Feb 2017 15:52:08 +0100 (CET) Received: (qmail 99808 invoked by uid 500); 22 Feb 2017 14:52:07 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 99794 invoked by uid 99); 22 Feb 2017 14:52:07 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Feb 2017 14:52:07 +0000 Received: from mail-wr0-f179.google.com (mail-wr0-f179.google.com [209.85.128.179]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 4E92C1A00A2 for ; Wed, 22 Feb 2017 14:52:07 +0000 (UTC) Received: by mail-wr0-f179.google.com with SMTP id z61so3639293wrc.1 for ; Wed, 22 Feb 2017 06:52:07 -0800 (PST) X-Gm-Message-State: AMke39malkYaRaD3FPIVXiEhS3KH+m9UNCTdOmlCFl5p62Exx1wkzXcnAMc3gbqFmNpHNd6RWdGqqlYqdTg7fw== X-Received: by 10.223.139.2 with SMTP id n2mr24965222wra.67.1487775126084; Wed, 22 Feb 2017 06:52:06 -0800 (PST) MIME-Version: 1.0 Received: by 10.80.210.193 with HTTP; Wed, 22 Feb 2017 06:52:05 -0800 (PST) In-Reply-To: References: <82099c83-186e-e5ba-f18d-3b4f78785386@wingcon.com> From: Camille Fournier Date: Wed, 22 Feb 2017 09:52:05 -0500 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: etcd performance comparison To: "user@zookeeper.apache.org" Content-Type: multipart/alternative; boundary=f403045ea778e5681005491fa38e archived-at: Wed, 22 Feb 2017 14:52:10 -0000 --f403045ea778e5681005491fa38e Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Even just writing about what objective tests might look like would be a good start! I'm happy to read draft posts by anyone who wishes to write on the topic. C On Wed, Feb 22, 2017 at 9:36 AM, Jordan Zimmerman < jordan@jordanzimmerman.com> wrote: > IMO there is tremendous FUD in the etcd world. It's the new cool toy and > ZK feels old. To suggest that ZK does not do Service Discovery is > ludicrous. That was one of the very first Curator recipes. > > It might be useful to counter this trend objectively. I'd be interested i= n > helping. Anyone else? We can create objective tests that compare common u= se > cases. > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > Jordan Zimmerman > > > On Feb 22, 2017, at 11:21 AM, Camille Fournier > wrote: > > > > I think that my biggest feeling about this blog post (besides not > > disclosing the disk setup clearly) is that, ZK is really not designed t= o > > have massive write throughput. I would not traditionally recommend > someone > > use ZK in that manner. If we think that evolving it to be useful for su= ch > > workloads would be good, it could be an interesting community discussio= n, > > but it's really not the purpose of the system design. > > > > I'd love to see a more read/write mixed load test for the systems, as > well > > as a blog post about why you might choose different systems for differe= nt > > workloads. I think developers have a hard time really understanding the > > tradeoffs they are choosing in these systems, because of the nuance > around > > them. > > > > For me, I'm more concerned about the fact that I saw a talk yesterday > that > > mentioned both etcd and consul as options for service discovery but not > ZK. > > That feels like a big hit for our community. Orthogonal to this topic, > just > > feels worth mentioning. > > > > C > > > > On Wed, Feb 22, 2017 at 4:05 AM, Alexander Binzberger < > > alexander.binzberger@wingcon.com> wrote: > > > >> 1. Seams like it might make sense to increase snapCount for those test= s. > >> > >> 2. ZK write performance also depends on the number of watches - afaik. > >> This is not mentioned and not tested. > >> > >> 3. Does it really make sense to "blast" the store? Wouldn't it make mo= re > >> sense to compare fixed write/read per clients rates? > >> > >> > >> > >>> Am 22.02.2017 um 05:53 schrieb Michael Han: > >>> > >>> Kudus to etcd team for making this blog and thanks for sharing. > >>> > >>> I feel like they're running a questionable configuration. > >>>>> > >>>> Looks like the test configuration > >>> >>> 235c20878a8637f24608c/agent/agent_zookeeper.go#L29> > >>> does not have separate directory for transaction logs and snapshots a= s > it > >>> does not have configuration for dataLogDir. So the configuration is n= ot > >>> optimal. Would be interesting to see the numbers with updated > >>> configuration. > >>> > >>> They mention that ZK snapshots "stop the world", and maybe I'm > mistaken, > >>>>> but > >>>>> > >>>> I didn't think that was right > >>> > >>> Right, ZK snapshots does not block processing pipeline as it is fuzzy > and > >>> it is done in a separate thread. The warning message "*To busy to sna= p, > >>> skipping*" mentioned in the blog is a sign that a snap shot is also > >>> generating in progress, which could be caused by the write contention= s > >>> created from serializing transaction logs that leads to longer than > >>> expected snap shot generation. So "stop the world" is a side effect o= f > >>> resource contention, but not a design intention IMO. > >>> > >>> Also the blog mentions ZooKeeper as a key value store and I also want > to > >>> point out that ZooKeeper is more than a (metadata) key value store ha= s > >>> features such as sessions, ephemerals, and watchers, and these design > >>> choices were made I believe to make ZK more useful as a coordination > >>> kernel, and these design choice also (negatively) contribute to the > >>> performance and scalability of ZooKeeper. > >>> > >>> > >>> On Tue, Feb 21, 2017 at 4:32 PM, Dan Benediktson < > >>> dbenediktson@twitter.com.invalid> wrote: > >>> > >>> I kind of wonder about them only using one disk. I haven't experiment= ed > >>>> with this in ZooKeeper, nor have I ever been a DBA, but with > traditional > >>>> database systems (which ZooKeeper should be basically identical to, = in > >>>> this > >>>> regard), it's a pretty common recommendation to put snapshots and > TxLogs > >>>> on > >>>> different drives, for the obvious reason of avoiding one of the > biggest > >>>> problems laid out in that blog post: when snapshot happens, it > contends > >>>> with your log flushes, causing write latencies to explode. Suddenly > you > >>>> have tons more IO, and where it used to be nicely sequential, now it= 's > >>>> heavily randomized because of the two competing writers. It's kind o= f > the > >>>> nature of benchmarks that there's always something you can nitpick, > but > >>>> still, I feel like they're running a questionable configuration. > >>>> > >>>> They mention that ZK snapshots "stop the world", and maybe I'm > mistaken, > >>>> but I didn't think that was right - I thought they were just slowing > >>>> everything down because they write a lot and contend a lot. I'm pret= ty > >>>> sure > >>>> ZK snapshots are fuzzy over a range of transactions, and transaction= s > >>>> keep > >>>> applying during the snapshot, right? > >>>> > >>>> Thanks, > >>>> Dan > >>>> > >>>> On Tue, Feb 21, 2017 at 2:24 PM, Benjamin Mahler < > bmahler@mesosphere.io> > >>>> wrote: > >>>> > >>>> I'm curious if folks here have seen the following write performance > >>>>> comparison that was done by CoreOS on etc, Consul, and ZooKeeper: > >>>>> https://coreos.com/blog/performance-of-etcd.html > >>>>> > >>>>> Sounds like performance comparison of reads and updates are coming > next. > >>>>> Are there any thoughts from folks here on this comparison so far? > >>>>> > >>>>> Thanks, > >>>>> Ben > >>>>> > >>>>> > >> -- > >> Alexander Binzberger > >> System Designer - WINGcon AG > >> Tel. +49 7543 966-119 > >> > >> Sitz der Gesellschaft: Langenargen > >> Registergericht: ULM, HRB 734260 > >> USt-Id.: DE232931635, WEEE-Id.: DE74015979 > >> Vorstand: thomasThomas Ehrle (Vorsitz), Fritz R. Paul (Stellvertreter)= , > >> Tobias Tre=C3=9F > >> Aufsichtsrat: J=C3=BCrgen Maucher (Vorsitz), Andreas Paul (Stellvertre= ter), > >> Martin Sauter > >> > >> > --f403045ea778e5681005491fa38e--