Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A25F9200C0D for ; Tue, 31 Jan 2017 17:25:26 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id A0F2B160B52; Tue, 31 Jan 2017 16:25:26 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C1C42160B46 for ; Tue, 31 Jan 2017 17:25:25 +0100 (CET) Received: (qmail 50680 invoked by uid 500); 31 Jan 2017 16:25:25 -0000 Mailing-List: contact dev-help@airflow.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@airflow.incubator.apache.org Delivered-To: mailing list dev@airflow.incubator.apache.org Received: (qmail 50669 invoked by uid 99); 31 Jan 2017 16:25:24 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jan 2017 16:25:24 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 44547C0BAF for ; Tue, 31 Jan 2017 16:25:24 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.499 X-Spam-Level: ** X-Spam-Status: No, score=2.499 tagged_above=-999 required=6.31 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id XfrgAn-JOuu3 for ; Tue, 31 Jan 2017 16:25:21 +0000 (UTC) Received: from mail-ua0-f172.google.com (mail-ua0-f172.google.com [209.85.217.172]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id D1A185F238 for ; Tue, 31 Jan 2017 16:25:20 +0000 (UTC) Received: by mail-ua0-f172.google.com with SMTP id 35so276503813uak.1 for ; Tue, 31 Jan 2017 08:25:20 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=tqLRZH6PiM073lp260/AMG5HS9e6bbQzKrBUDpTBi2w=; b=k5iFnEE3K0nRN1zi/bCa5Acg1jVURJtIztoDipgIC5m1XjCtMFeV1XZwjM5NvDqzvq hK8ogHS5UQe/TvTeNkdygwlL6gd1R3//g5gT0xlzCTGuMrNilsAjeiSEAnpJYCbanKTD hjdiLJOgN4tmHXmylH0UghDWUaSFE9SGHi3PUOFZejsdnxZWoOSz0CKNLB9UD3UjEhZH zqkxdK6UEggH3pEWiX6EERa41PqJ+F+mD9SKfzzu7iJVPtm/BnJ+bwgjij+cJGmXL71l /WCz4Av5KtOwROPqocv89vB86oeOhfKD4NyC+6L+QQ9PWmzQ6sjZQWOj2bdzJy68Rycc ktvg== X-Gm-Message-State: AIkVDXKk4+rKBVOFc+9FOzqP5G9HD+XaS2PgRX8A26gMa7HrWaaoqH5UUz2kQBrtucO+kmc+YigS8cflhv6BUA== X-Received: by 10.176.23.22 with SMTP id j22mr14106894uaf.168.1485879917831; Tue, 31 Jan 2017 08:25:17 -0800 (PST) MIME-Version: 1.0 References: <9AEEDD5F-7FC7-4E15-8FE8-556165019D3B@gmail.com> <3246BFBD-BBC3-475B-9BD3-6D23820B3973@gmail.com> <41EAC266-BC94-407B-A97D-9062B8832230@gmail.com> In-Reply-To: <41EAC266-BC94-407B-A97D-9062B8832230@gmail.com> From: Alex Van Boxel Date: Tue, 31 Jan 2017 16:25:06 +0000 Message-ID: Subject: Re: Airflow 1.8.0 BETA 5 To: dev@airflow.incubator.apache.org Content-Type: multipart/alternative; boundary=f40304361e2cae6cb705476660f9 archived-at: Tue, 31 Jan 2017 16:25:26 -0000 --f40304361e2cae6cb705476660f9 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable I'll try to identify the core problem On Tue, Jan 31, 2017, 16:43 Bolke de Bruin wrote: > Hey Alex > > Can you provide some info on the scheduler paths thing. I don't have/see > that issue. Do you mean cli paths or by cfg? Jira would be nice in any ca= se. > > I don't think the dag processor respects cli parameters. > > Bolke > > Sent from my iPhone > > > On 31 Jan 2017, at 15:10, Alex Van Boxel wrote: > > > > It's quite hard to share my complete dags. I don't have this locally, > but I > > have it in my production environment where I use Celery. I rolled back = to > > beta 4 to make it work again. > > > > Also @bolke the scheduler logs don't respect the log path. > > > > On Tue, Jan 31, 2017 at 1:02 AM Dan Davydov .invalid> > > wrote: > > > >> @Alex > >> I'm not able to reproduce locally (assuming the two python files are i= n > the > >> same folder or is on your PYTHONPATH). I don't see that import error > >> anyways. > >> > >> Just in case, what is your complete DAG definition? Is anyone else abl= e > to > >> repro? > >> > >>> On Mon, Jan 30, 2017 at 3:09 PM, Alex Van Boxel > wrote: > >>> > >>> Well this means none of my DAG's work anymore: > >>> > >>> you just can do this anymore: > >>> > >>> file bqschema.py with > >>> > >>> def marketing_segment(): > >>> return [ > >>> {"name": "user_id", "type": "integer", "mode": "nullable"}, > >>> {"name": "bucket_date", "type": "timestamp", "mode": > "nullable"}, > >>> {"name": "segment_main", "type": "string", "mode": "nullable"}= , > >>> {"name": "segment_sub", "type": "integer", "mode": "nullable"}= , > >>> > >>> > >>> In marketing_segmentation.py: > >>> > >>> > >>> import bqschema > >>> > >>> Gives an error: > >>> > >>> Traceback (most recent call last): > >>> File > >>> "/usr/local/lib/python2.7/site-packages/airflow-1.8.0b5+ > >>> apache.incubating-py2.7.egg/airflow/models.py", > >>> line 264, in process_file > >>> m =3D imp.load_source(mod_name, filepath) > >>> File "/home/airflow/dags/marketing_segmentation.py", line 17, in > >>> > >>> import bqschema > >>> ImportError: No module named bqschema > >>> > >>> *I don't think this is incorrect?!* > >>> > >>> > >>> > >>> On Mon, Jan 30, 2017 at 11:46 PM Dan Davydov >>> invalid> > >>> wrote: > >>> > >>>> The latest commit fixed a regression since 1.7 that files with parsi= ng > >>>> errors no longer showed up on the UI. > >>>> > >>>> On Mon, Jan 30, 2017 at 2:42 PM, Alex Van Boxel > >>> wrote: > >>>> > >>>>> Just installed beta 5 on our dev environment it lighted up as a > >>> christmas > >>>>> tree. I got a a screen full of import errors. I see that the latest > >>>> commit > >>>>> did something with import errors... is it coorect?! > >>>>> > >>>>> On Sun, Jan 29, 2017 at 4:37 PM Bolke de Bruin > >>>> wrote: > >>>>> > >>>>>> Hey Boris > >>>>>> > >>>>>> The scheduler is a bit more aggressive and can use multiple > >>> processors, > >>>>> so > >>>>>> higher CPU usage is actually a good thing. > >>>>>> > >>>>>> I case it is really out of hand look at the new scheduler options > >> and > >>>>>> heartbeat options (see PR for updating.md not in the beta yet). > >>>>>> > >>>>>> Bolke > >>>>>> > >>>>>> Sent from my iPhone > >>>>>> > >>>>>>> On 29 Jan 2017, at 15:35, Boris Tyukin > >>>> wrote: > >>>>>>> > >>>>>>> I am not sure if it is my config or something, but looks like > >> after > >>>> the > >>>>>>> upgrade and start of scheduler, airflow would totally hose CPU. > >> The > >>>>>> reason > >>>>>>> is two new examples that start running right away - latest only > >> and > >>>>>> latest > >>>>>>> with trigger. Once I pause them, CPU goes back to idle. Is this > >>>> because > >>>>>> now > >>>>>>> dags are not paused by default like it was before? > >>>>>>> > >>>>>>> As I mentioned before, I also had to upgrade mysql to 5.7 - if > >>>> someone > >>>>>>> needs a step by step instruction, make sure to follow all steps > >>>>> precisely > >>>>>>> here for in-place upgrade or you will have heck of the time (like > >>>> me). > >>>>>>> > >>>>>> https://dev.mysql.com/doc/refman/5.7/en/upgrading.html# > >>>>> upgrade-procedure-inplace > >>>>>>> > >>>>>>> BTW official Oracle repository for Oracle Linux only has MySql > >> 5.6 > >>> - > >>>>> for > >>>>>>> 5.7 you have to use MySql community repo. > >>>>>>> > >>>>>>>> On Sat, Jan 28, 2017 at 10:07 AM, Bolke de Bruin < > >>> bdbruin@gmail.com > >>>>> > >>>>>> wrote: > >>>>>>>> > >>>>>>>> Hi All, > >>>>>>>> > >>>>>>>> I have made the FIFTH beta of Airflow 1.8.0 available at: > >>>>>>>> https://dist.apache.org/repos/dist/dev/incubator/airflow/ < > >>>>>>>> https://dist.apache.org/repos/dist/dev/incubator/airflow/> , > >>> public > >>>>>> keys > >>>>>>>> are available at https://dist.apache.org/repos/ > >>>>> dist/release/incubator/ > >>>>>>>> airflow/ >>>>> airflow/ > >>>>>>> > >>>>>>>> . It is tagged with a local version =E2=80=9Capache.incubating= =E2=80=9D so it > >>> allows > >>>>>>>> upgrading from earlier releases. > >>>>>>>> > >>>>>>>> Issues fixed: > >>>>>>>> * Parsing errors not showing up in UI fixing a regression** > >>>>>>>> * Scheduler would terminate immediately if no dag files present > >>>>>>>> > >>>>>>>> ** As this touches the scheduler logic I though it warranted > >>> another > >>>>>> beta. > >>>>>>>> > >>>>>>>> This should be the last beta in my opinion and we can prepare > >>>>> changelog, > >>>>>>>> upgrade notes and release notes for the RC (Feb 2). > >>>>>>>> > >>>>>>>> Cheers > >>>>>>>> Bolke > >>>>>> > >>>>> -- > >>>>> _/ > >>>>> _/ Alex Van Boxel > >>>>> > >>>> > >>> -- > >>> _/ > >>> _/ Alex Van Boxel > >>> > >> > > -- > > _/ > > _/ Alex Van Boxel > --=20 _/ _/ Alex Van Boxel --f40304361e2cae6cb705476660f9--