Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 684AD200D11 for ; Mon, 2 Oct 2017 22:07:45 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 66BD9160BCB; Mon, 2 Oct 2017 20:07:45 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id B408F1609EF for ; Mon, 2 Oct 2017 22:07:44 +0200 (CEST) Received: (qmail 34520 invoked by uid 500); 2 Oct 2017 20:07:43 -0000 Mailing-List: contact dev-help@aurora.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@aurora.apache.org Delivered-To: mailing list dev@aurora.apache.org Received: (qmail 34434 invoked by uid 99); 2 Oct 2017 20:07:43 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Oct 2017 20:07:43 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id B5C95199F0D; Mon, 2 Oct 2017 20:07:42 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.379 X-Spam-Level: ** X-Spam-Status: No, score=2.379 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id GKAIjU10qF2W; Mon, 2 Oct 2017 20:07:41 +0000 (UTC) Received: from mail-oi0-f41.google.com (mail-oi0-f41.google.com [209.85.218.41]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id DC4995FB40; Mon, 2 Oct 2017 20:07:40 +0000 (UTC) Received: by mail-oi0-f41.google.com with SMTP id p187so10955161oif.4; Mon, 02 Oct 2017 13:07:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to:cc; bh=aBo72N0YXG9E2WqAyte8sHcoRSdY4PN878aHSwpUqxI=; b=Ia5xeYmk0w/6c9in8JgzLFEzKJdp1ob5pdYw7AL2mEEgm2zKKJPtbeqBgEA1H89XOn s6vnWVhN0DGWuMnHx1dvh2zMfcsAJRZzClsUXs3MZ50PBdkJeE/T9W4AKh6HAl7Uw8A8 BS40bTJ0RRahMZCNHfq0kz0CgsxDdOU2UUzhzDu4+muMd244ZmwQuV/rPg+iaIy1wS4k p6Ckfskgdpfl++ElvmxBGrLgqjocFRLEj5bmtzNA2N3pIXWFCNDyKsd8zNmmmYzFOqYP kPXXkBuAn+q1JQEFHPWTtXIK82tJ24JYD0pLercbP54A+XJuppOHekgKqnnLgtqabyHv Wkpg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to:cc; bh=aBo72N0YXG9E2WqAyte8sHcoRSdY4PN878aHSwpUqxI=; b=Yz8pv1BWs0PJtBZah1ZVcOjdZ2sR1AcNiz8Q9qBPIwni6JxgyxwRCbgMu0K5/HlJGs 2Y/SYLspLIaxJucVUMPLRvqbUcqUp8rrQEQcQaj1RKGKfk8MPDdXoEawok4wAobouf19 njZ+4UrGEaagESsxq75ux4ndYeuchj14nTwJmFMD5e9U5mV/FMCKRZYC66jfdULITVlz nJtFxgKvYTffOxU7qFRXLOEBQtEyRgPrBYtIGZTKW8BmLpyFHvXblVcFGKUmvNzT3/36 us+IwgvQjE+LSvJ5b7IMDRMXT/AbNMxawt6uKv0mx06fKXf6QemfN6rvbxOdU2jgOaug r4Iw== X-Gm-Message-State: AMCzsaWFks+T2c25fA/KIwiEN9RXRmjloZgXlKOzGn2UqWVCakNEYiv4 4ZwYAQy6ZPgHUiCUVwU/OalDmDhuqHaof+SeAza2HQ== X-Google-Smtp-Source: AOwi7QDgGozbyVlxKEeHgbmDoETojxRfKk9EnFQ1fCNb0NEuUFeWXxwD+kNQuQrBCI4P/j9Y2D4T3UwZOMdtjLKEImk= X-Received: by 10.157.43.88 with SMTP id f24mr3728623otd.272.1506974859241; Mon, 02 Oct 2017 13:07:39 -0700 (PDT) MIME-Version: 1.0 Received: by 10.157.69.151 with HTTP; Mon, 2 Oct 2017 13:06:58 -0700 (PDT) From: Renan DelValle Rueda Date: Mon, 2 Oct 2017 13:06:58 -0700 Message-ID: Subject: Extending Aurora Pluggable Scheduling Proposal To: dev@aurora.apache.org Cc: user@aurora.apache.org Content-Type: multipart/alternative; boundary="001a113d1da42bb109055a95edee" archived-at: Mon, 02 Oct 2017 20:07:45 -0000 --001a113d1da42bb109055a95edee Content-Type: text/plain; charset="UTF-8" Hello fellow Aurorans, I'd like to share a proposal doc that seeks to lay out a roadmap for bringing in new scheduling features to Aurora. David McLaughlin did a fantastic job of getting the ball rolling with the pluggable scheduling patches he contributed (1) and I'd like to expand upon that work. The overarching idea of this proposal is that everyone has different scheduling needs and it would be great to enhance Aurora to allow operators to meet organization specific scheduling needs without imposing them on the rest of the community. The features outlined in this proposal are based upon principles from Fenzo(2) which have enjoyed great success powering Mantis(3) and Titus(4) at Netflix. Finally, since this proposal is about scheduling enhancements, I also thought it would be pertinent to include talk of a feature that attempts to avoid hosting tasks on misbehaving agents. This is due to the fact that some of the scheduling policies introduced by this proposal can amplify the negative effect a bad node can have on performance. (I.e. we keep on choosing the "bad" node to schedule on and the task keeps on failing through no fault of its own.) Would love to hear some feedback on these ideas and/or opinions on what the next steps should be if we were to embark on this journey. https://docs.google.com/document/d/11ArMA53chtK-Zb_ KPMV7l_bCvTrUb005XlqzGQ2fTP4/edit# Thanks! -Renan 1. https://lists.apache.org/thread.html/50caf01283144ee9dacd24d3fb481a 2ca6120ceaa1289fd5b48620a4@%3Cdev.aurora.apache.org%3E 2. https://github.com/Netflix/Fenzo 3. https://medium.com/netflix-techblog/stream-processing- with-mantis-78af913f51a6 4. https://medium.com/netflix-techblog/the-evolution-of- container-usage-at-netflix-3abfc096781b --001a113d1da42bb109055a95edee--