airflow-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AIRFLOW-1117) Increase the default value of min_file_process_interval
Date Sun, 02 Sep 2018 18:07:03 GMT

    [ https://issues.apache.org/jira/browse/AIRFLOW-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16601578#comment-16601578
] 

Apache Spark commented on AIRFLOW-1117:
---------------------------------------

User 'mhousley' has created a pull request for this issue:
https://github.com/apache/incubator-airflow/pull/2825

> Increase the default value of min_file_process_interval
> -------------------------------------------------------
>
>                 Key: AIRFLOW-1117
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-1117
>             Project: Apache Airflow
>          Issue Type: Wish
>          Components: scheduler
>    Affects Versions: 1.8.0
>            Reporter: Keisuke Nishida
>            Priority: Minor
>         Attachments: screenshot-1.png
>
>
> I observed high CPU usage after upgrading Airflow from 1.7.1.3 to 1.8.0.
> I found Airflow is loading DAG files repeatedly, which consumed most of the CPU time
in my Airflow instance.  I realized Airflow 1.8 introduced a new configuration variable {{min_file_process_interval}}
with default value 0.  This means Airflow reloads DAG files one after another without any
interval.
> Can you increase the default value of {{min_file_process_interval}} to at least the same
value of {{scheduler_heartbeat_sec}}, which is 5 seconds?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message