Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EC6C4200C44 for ; Mon, 13 Mar 2017 03:50:08 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EAF48160B8A; Mon, 13 Mar 2017 02:50:08 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3F20A160B77 for ; Mon, 13 Mar 2017 03:50:08 +0100 (CET) Received: (qmail 165 invoked by uid 500); 13 Mar 2017 02:50:07 -0000 Mailing-List: contact commits-help@airflow.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@airflow.incubator.apache.org Delivered-To: mailing list commits@airflow.incubator.apache.org Received: (qmail 156 invoked by uid 99); 13 Mar 2017 02:50:07 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Mar 2017 02:50:07 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 01F4B1A7B7B for ; Mon, 13 Mar 2017 02:50:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.651 X-Spam-Level: X-Spam-Status: No, score=0.651 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_NEUTRAL=0.652] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id MuRCk1cDdpkR for ; Mon, 13 Mar 2017 02:50:06 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id ED36060E08 for ; Mon, 13 Mar 2017 02:50:05 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 63A66E030C for ; Mon, 13 Mar 2017 02:50:05 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 135D8243A6 for ; Mon, 13 Mar 2017 02:50:05 +0000 (UTC) Date: Mon, 13 Mar 2017 02:50:05 +0000 (UTC) From: "ASF subversion and git services (JIRA)" To: commits@airflow.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (AIRFLOW-910) Parallelize dag runs in backfills MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 13 Mar 2017 02:50:09 -0000 [ https://issues.apache.org/jira/browse/AIRFLOW-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15906795#comment-15906795 ] ASF subversion and git services commented on AIRFLOW-910: --------------------------------------------------------- Commit dcc8ede5c1a2f6819b151dd5ce839f0a0917313a in incubator-airflow's branch refs/heads/v1-8-test from [~bolke] [ https://git-wip-us.apache.org/repos/asf?p=incubator-airflow.git;h=dcc8ede ] [AIRFLOW-910] Use parallel task execution for backfills The refactor to use dag runs in backfills caused a regression in task execution performance as dag runs were executed sequentially. Next to that, the backfills were non deterministic due to the random execution of tasks, causing root tasks being added to the non ready list too soon. This updates the backfill logic as follows: * Parallelize execution of tasks * Use a leave first execution model * Replace state updates from the executor by task based only Closes #2107 from bolkedebruin/AIRFLOW-910 > Parallelize dag runs in backfills > --------------------------------- > > Key: AIRFLOW-910 > URL: https://issues.apache.org/jira/browse/AIRFLOW-910 > Project: Apache Airflow > Issue Type: Sub-task > Components: backfill > Affects Versions: 1.8.0rc4 > Reporter: Bolke de Bruin > Assignee: Bolke de Bruin > Priority: Blocker > Fix For: 1.8.0 > > > Currently dag runs are executed sequentially while backfilling. This is a regression and slows down the processing off tasks. > [~aoen] -- This message was sent by Atlassian JIRA (v6.3.15#6346)