Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 75C47200C47 for ; Thu, 30 Mar 2017 19:24:30 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 742B7160B8B; Thu, 30 Mar 2017 17:24:30 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id B9925160B7E for ; Thu, 30 Mar 2017 19:24:29 +0200 (CEST) Received: (qmail 9242 invoked by uid 500); 30 Mar 2017 17:24:27 -0000 Mailing-List: contact dev-help@aurora.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@aurora.apache.org Delivered-To: mailing list dev@aurora.apache.org Received: (qmail 9231 invoked by uid 99); 30 Mar 2017 17:24:27 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Mar 2017 17:24:27 +0000 Received: from mail-yb0-f181.google.com (mail-yb0-f181.google.com [209.85.213.181]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 5D94E1A0323 for ; Thu, 30 Mar 2017 17:24:27 +0000 (UTC) Received: by mail-yb0-f181.google.com with SMTP id l201so491000ybf.0 for ; Thu, 30 Mar 2017 10:24:27 -0700 (PDT) X-Gm-Message-State: AFeK/H1JHgJD+h+yuq5X4Abi0a/OrW3nWkBxODZHlFzc/9Tew9tCxNBTvkDhEqjtxzSmCrDgNX88UIJZnmeDUYWj X-Received: by 10.37.219.141 with SMTP id g135mr694890ybf.19.1490894666241; Thu, 30 Mar 2017 10:24:26 -0700 (PDT) MIME-Version: 1.0 Received: by 10.13.225.72 with HTTP; Thu, 30 Mar 2017 10:24:05 -0700 (PDT) In-Reply-To: References: From: Zameer Manji Date: Thu, 30 Mar 2017 10:24:05 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Future of storage in Aurora To: dev@aurora.apache.org Content-Type: multipart/alternative; boundary=94eb2c186a96fb910f054bf5f68c archived-at: Thu, 30 Mar 2017 17:24:30 -0000 --94eb2c186a96fb910f054bf5f68c Content-Type: text/plain; charset=UTF-8 I don't object to changes to storage so long as we have a migration plan and a design doc. I'm also not opposed to radical revisits of storage, including overhauling what we store and where we store it. For example, instead of storing our `TaskConfig` objects could we store Mesos `TaskInfo` objects instead? Could we store data outside of the scheduler like in Cassandra? Should we have a high level 'Job' store to make querying for job level data easier? On Thu, Mar 30, 2017 at 10:16 AM, David McLaughlin wrote: > Hi all, > > I'd like to start a discussion around storage in Aurora. > > I think one of the biggest mistakes we made in migrating our storage to H2 > was deleting the memory stores as we moved. We made a pretty big bet that > we could eventually make H2/relational databases work. I don't think that > bet has paid off and that we need to revisit the direction we're taking. > > My belief is that the current H2/MyBatis approach is untenable for large > production clusters, at least without changing our current single-master > architecture. At Twitter we are already having to fight to keep GC > manageable even without DbTaskStore enabled, so I don't see a path forward > where we could eventually enable that. So far experiments with H2 off-heap > storage have provided marginal (if any) gains. > > Would anyone object to restoring the in-memory stores and creating new > implementations for the missing ones (UpdateStore)? I'd even go further and > propose that we consider in-memory H2 and MyBatis a failed experiment and > we drop that storage layer completely. > > Cheers, > David > > -- > Zameer Manji > --94eb2c186a96fb910f054bf5f68c--