Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 2FF4C200C1B for ; Tue, 14 Feb 2017 22:12:16 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 2E7AC160B5F; Tue, 14 Feb 2017 21:12:16 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 770DF160B45 for ; Tue, 14 Feb 2017 22:12:15 +0100 (CET) Received: (qmail 89762 invoked by uid 500); 14 Feb 2017 21:12:14 -0000 Mailing-List: contact issues-help@aurora.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@aurora.apache.org Delivered-To: mailing list issues@aurora.apache.org Received: (qmail 89753 invoked by uid 99); 14 Feb 2017 21:12:14 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Feb 2017 21:12:14 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 4F6D11A0116 for ; Tue, 14 Feb 2017 21:12:14 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -1.999 X-Spam-Level: X-Spam-Status: No, score=-1.999 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RP_MATCHES_RCVD=-2.999] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id n5-OcGsvZOpF for ; Tue, 14 Feb 2017 21:12:13 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 5500D5FB0F for ; Tue, 14 Feb 2017 21:12:13 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 56482E00C7 for ; Tue, 14 Feb 2017 21:11:42 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 0A28B24119 for ; Tue, 14 Feb 2017 21:11:42 +0000 (UTC) Date: Tue, 14 Feb 2017 21:11:42 +0000 (UTC) From: "David McLaughlin (JIRA)" To: issues@aurora.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (AURORA-1893) Add ability to prune tasks using aurora_admin MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 14 Feb 2017 21:12:16 -0000 David McLaughlin created AURORA-1893: ---------------------------------------- Summary: Add ability to prune tasks using aurora_admin Key: AURORA-1893 URL: https://issues.apache.org/jira/browse/AURORA-1893 Project: Aurora Issue Type: Task Reporter: David McLaughlin Currently task history pruning is configured with three main settings: maximum terminal tasks per job, maximum time to retain terminal tasks and a minimum time to retain terminal tasks. There are times where a combination of bad actors in the cluster and the minimum time to retain terminal tasks can lead to incredibly bloated task store sizes, leading to serious problems with GC pressure during task store queries, and also when creating and persisting snapshots. At Twitter we've run into this and have had to respond by redeploying the Scheduler with more aggressive task pruning settings - which affects every user in the cluster. What we'd like is an endpoint in aurora_admin that accepts a TaskQuery and will prune all inactive tasks that match. This should allow us to limit the pruning by role, environment and also limit the number of tasks pruned. -- This message was sent by Atlassian JIRA (v6.3.15#6346)