Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5FF74200C15 for ; Wed, 8 Feb 2017 13:33:44 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 5E731160B5A; Wed, 8 Feb 2017 12:33:44 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5B2CF160B4E for ; Wed, 8 Feb 2017 13:33:43 +0100 (CET) Received: (qmail 5457 invoked by uid 500); 8 Feb 2017 12:33:42 -0000 Mailing-List: contact dev-help@airflow.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@airflow.incubator.apache.org Delivered-To: mailing list dev@airflow.incubator.apache.org Received: (qmail 5445 invoked by uid 99); 8 Feb 2017 12:33:42 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2017 12:33:42 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 7E9E8C074C for ; Wed, 8 Feb 2017 12:33:41 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.649 X-Spam-Level: ** X-Spam-Status: No, score=2.649 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, KAM_LOTSOFHASH=0.25, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, WEIRD_PORT=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id d0Ci2qd4YeIY for ; Wed, 8 Feb 2017 12:33:39 +0000 (UTC) Received: from mail-wr0-f181.google.com (mail-wr0-f181.google.com [209.85.128.181]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 77C595F1BA for ; Wed, 8 Feb 2017 12:33:38 +0000 (UTC) Received: by mail-wr0-f181.google.com with SMTP id i10so59749147wrb.0 for ; Wed, 08 Feb 2017 04:33:38 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=vPIJAxjWDBMniFRyGCqi8rviyWE0qK2/S+M29l3Oups=; b=pkjZSDJiTvfX6wdlBBaxOK8ArVT2M/EzHeiwAvxf7laeIU39JYoItqnseiCvsTYtmZ G8nvuRDWmslnRc/5pFNm+AwmEqSHEXugbAJtv3+jX9IIA0nKYyWlpxjX6l4N7a3SbIFo ovAjKq+13n1LkGP4DQG2iqjbCu5pLHo/YJqljCPY55z4qfEHoVvuZ84gCp0RmkoXvkbc 3wqAmPlxFWXHzkoDtOHfUp0+m/F4Mx308NgUIJnR2+Cb+9Fn7GJMzdcJCYzUhFJax/pf 5ihvDbjTB+Fwn+L/cyQvVYTXEpLWUk2+iLPRe5mgNEfLLG9Bp8tSKDLSu3WuixggMd3h RFMQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=vPIJAxjWDBMniFRyGCqi8rviyWE0qK2/S+M29l3Oups=; b=Ui0qpLSXGSI+7Bw4AbiU/Lb7l77nwdB9a0mAyC9xXgI97sWVD+XYaZDElYlKkmXQN7 pVCH7+wqRSw1AX9bNhK+UwyC4cH7AtLzsO3pxdJVpZtjxOfXu0BFzgArnEIpF1Re4BOj O1fP/Jx+0ox9kFNQ6AMXYKgH6o5/8N9SzBt0+Lx5UqcfUd0rjmLIXlIOv1hqDpX2n1Xs RqYkQY1u75s/4DcDhazqsxWRflGogvWs60Zy+B9Tc26rdNF0Hm6KPNVy8vXTsQfx78N3 T/09hQHUNMEC4UWYQWEpevJ8CRWS35vTV/FqHdCKBweuwCv7brksxkQE/v1Pw8p/qTph bpdw== X-Gm-Message-State: AIkVDXJcZO7aZdX01pskjgHVv/tknlUIgNDvgdF998xA8EwNPgdA3m0mk//B5dOXg2gRog== X-Received: by 10.223.175.36 with SMTP id z33mr18579874wrc.25.1486557216542; Wed, 08 Feb 2017 04:33:36 -0800 (PST) Received: from [10.254.254.2] (89.20.160.55.static.ef-service.nl. [89.20.160.55]) by smtp.gmail.com with ESMTPSA id l140sm3131351wmg.12.2017.02.08.04.33.34 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 08 Feb 2017 04:33:35 -0800 (PST) From: Bolke de Bruin Message-Id: <0BB6FA0E-94CC-4076-B66F-C0094F1758F4@gmail.com> Content-Type: multipart/alternative; boundary="Apple-Mail=_78EBE853-9101-4089-B92C-893BBD6A8891" Mime-Version: 1.0 (Mac OS X Mail 10.2 \(3259\)) Subject: Re: Airflow 1.8.0 Release Candidate 1 Date: Wed, 8 Feb 2017 13:33:01 +0100 In-Reply-To: Cc: dev@airflow.incubator.apache.org To: Alex Van Boxel References: <1BA62FD0-851A-43BB-965E-87B680E247B6@gmail.com> <2DE49F95-BDD1-44F2-AC6A-495B755828D2@gmail.com> X-Mailer: Apple Mail (2.3259) archived-at: Wed, 08 Feb 2017 12:33:44 -0000 --Apple-Mail=_78EBE853-9101-4089-B92C-893BBD6A8891 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 Alex, Do you have anything more to go on? I don=E2=80=99t mind reverting the = patch, however it code part seems unrelated to what you described and = the issue wasn=E2=80=99t reproducible. I would really like to see more = logging and maybe a test in a clean environment plus debugging. = Preferable I would like to make RC 2 available today and immediately = raise a vote as the *current* changes are really small, are confined to = contrib and have been tested by the people using it. But I am holding off for now due to your concern. Cheers Bolke > On 7 Feb 2017, at 20:56, Bolke de Bruin wrote: >=20 > How do you start the scheduler Alex? What are the command line = parameters? What are the logs when it doesn=E2=80=99t work? >=20 > Bolke >=20 >=20 >=20 >> On 7 Feb 2017, at 18:52, Alex Van Boxel > wrote: >>=20 >> Hey Feng, >>=20 >> The upgrades are all automated (including the workers/web/scheduler). = And I tripple checked, I now am test running RC1 just with the your line = reverted (and look ok) >>=20 >> Could you do me a favour and add a test dag where you do a local = import. Example: >>=20 >> bqschema.py >> def ranking(): >> return [ >> {"name": "bucket_date", "type": "timestamp", "mode": = "nullable"}, >> {"name": "rank", "type": "integer", "mode": "nullable"}, >> {"name": "audience_preference", "type": "float", "mode": = "nullable"}, >> {"name": "audience_likelihood_share", "type": "float", = "mode": "nullable"} >> ] >>=20 >> dag.py >> import bqschema >> ... >> all in the same dag folder. We use it to define out BigQuery schema's = into a seperate file. >>=20 >>=20 >> On Tue, Feb 7, 2017 at 6:37 PM Feng Lu > wrote: >> Hi Alex- >>=20 >> Please see the attached screenshots of my local testing using = celeryexecutor (on k8s as well).=20 >> All look good and the workflow is successfully completed. >>=20 >> Curious did you also update the worker image?=20 >> Sorry for the confusion, happy to debug more if you could share with = me your k8s setup.=20 >>=20 >> Feng >>=20 >> On Tue, Feb 7, 2017 at 8:37 AM, Feng Lu > wrote: >> When num_runs is not explicitly specified, the default is set to -1 = to match the expectation of SchedulerJob here: >> >> =E2=80=8B >> Doing so also matches the type of num_runs ('int' in this case). >> The scheduler will run non-stop as a result regardless whether dag = files are present (since the num_runs default is now -1: unlimited).=20 >>=20 >> Based on what Alex described, the import error doesn't look like = directly related to this change.=20 >> Maybe this one? = https://github.com/apache/incubator-airflow/commit/67cbb966410226c1489bb73= 0af3af45330fc51b9 = >>=20 >> I am still in the middle of running some quick test using celery = executor, will update the thread once it's done.=20 >>=20 >>=20 >> On Tue, Feb 7, 2017 at 6:56 AM, Bolke de Bruin > wrote: >> Hey Alex, >>=20 >> Thanks for tracking it down. Can you elaborate want went wrong with = celery? The lines below do not particularly relate to Celery directly, = so I wonder why we are not seeing it with LocalExecutor? >>=20 >> Cheers >> Bolke >>=20 >> > On 7 Feb 2017, at 15:51, Alex Van Boxel > wrote: >> > >> > I have to give the RC1 a *-1*. I spend hours, or better days to get = the RC >> > running with Celery on our test environment, till I finally found = the >> > commit that killed it: >> > >> > e7f6212cae82c3a3a0bc17bbcbc70646f67d02eb >> > [AIRFLOW-813] Fix unterminated unit tests in SchedulerJobTest >> > Closes #2032 from fenglu-g/master >> > >> > I was always looking at the wrong this, because the commit only = changes a >> > single default parameter from *None to -1* >> > >> > I do have the impression I'm the only one running with Celery. Are = other >> > people running with it? >> > >> > *I propose* *reverting the commit*. Feng, can you elaborate on this = change? >> > >> > Change the default back no *None* in cli.py got it finally working: >> > >> > 'num_runs': Arg( >> > ("-n", "--num_runs"), >> > default=3DNone, type=3Dint, >> > help=3D"Set the number of runs to execute before exiting"), >> > >> > Thanks. >> > >> > On Tue, Feb 7, 2017 at 3:49 AM siddharth anand > wrote: >> > >> > I did get 1.8.0 installed and running at Agari. >> > >> > I did run into 2 problems. >> > 1. Most of our DAGs broke due the way Operators are now imported. >> > = https://github.com/apache/incubator-airflow/blob/master/UPDATING.md#deprec= ated-features = >> > >> > According to the documentation, these deprecations would only cause = an >> > issue in 2.0. However, I needed to fix them now. >> > >> > So, I needed to change "from airflow.operators import = PythonOperator" to >> > from "from airflow.operators.python_operator import = PythonOperator". Am I >> > missing something? >> > >> > 2. I ran into a migration problem that seems to have cleared itself = up. I >> > did notice that some dags do not have data in their "DAG Runs" = column on >> > the overview page computed. I am looking into that issue presently. >> > = https://www.dropbox.com/s/cn058mtu3vcv8sq/Screenshot%202017-02-06%2018.45.= 07.png?dl=3D0 = >> > >> > -s >> > >> > On Mon, Feb 6, 2017 at 4:30 PM, Dan Davydov .invalid> >> > wrote: >> > >> >> Bolke, attached is the patch for the cgroups fix. Let me know = which >> >> branches you would like me to merge it to. If anyone has = complaints about >> >> the patch let me know (but it does not touch the core of airflow, = only the >> >> new cgroups task runner). >> >> >> >> On Mon, Feb 6, 2017 at 4:24 PM, siddharth anand > wrote: >> >> >> >>> Actually, I see the error is further down.. >> >>> >> >>> File >> >>> = "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/default.py", >> >>> line >> >>> 469, in do_execute >> >>> >> >>> cursor.execute(statement, parameters) >> >>> >> >>> sqlalchemy.exc.IntegrityError: (psycopg2.IntegrityError) null = value in >> >>> column "dag_id" violates not-null constraint >> >>> >> >>> DETAIL: Failing row contains (null, running, 1, f). >> >>> >> >>> [SQL: 'INSERT INTO dag_stats (state, count, dirty) VALUES = (%(state)s, >> >>> %(count)s, %(dirty)s)'] [parameters: {'count': 1L, 'state': = u'running', >> >>> 'dirty': False}] >> >>> >> >>> It looks like an autoincrement is missing for this table. >> >>> >> >>> >> >>> I'm running `SQLAlchemy=3D=3D1.1.4` - I see our setup.py = specifies any >> > version >> >>> greater than 0.9.8 >> >>> >> >>> -s >> >>> >> >>> >> >>> >> >>> On Mon, Feb 6, 2017 at 4:11 PM, siddharth anand = > >> >>> wrote: >> >>> >> >>>> I tried upgrading to 1.8.0rc1 from 1.7.1.3 via pip install >> >>>> https://dist.apache.org/repos/dist/dev/incubator/airflow/ = >> >>>> airflow-1.8.0rc1+apache.incubating.tar.gz and then running = airflow >> >>>> upgradedb didn't quite work. First, I thought it completed >> > successfully, >> >>>> then saw errors some tables were indeed missing. I ran it again = and >> >>>> encountered the following exception : >> >>>> >> >>>> DB: postgresql://app_cousteau@db-cousteau.ep.stage.agari.com:543 = >> >>> 2/airflow >> >>>> >> >>>> [2017-02-07 00:03:20,309] {db.py:284} INFO - Creating tables >> >>>> >> >>>> INFO [alembic.runtime.migration] Context impl PostgresqlImpl. >> >>>> >> >>>> INFO [alembic.runtime.migration] Will assume transactional DDL. >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade 2e82aab8ef20 = -> >> >>>> 211e584da130, add TI state index >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade 211e584da130 = -> >> >>>> 64de9cddf6c9, add task fails journal table >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade 64de9cddf6c9 = -> >> >>>> f2ca10b85618, add dag_stats table >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade f2ca10b85618 = -> >> >>>> 4addfa1236f1, Add fractional seconds to mysql tables >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade 4addfa1236f1 = -> >> >>>> 8504051e801b, xcom dag task indices >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade 8504051e801b = -> >> >>>> 5e7d17757c7a, add pid field to TaskInstance >> >>>> >> >>>> INFO [alembic.runtime.migration] Running upgrade 5e7d17757c7a = -> >> >>>> 127d2bf2dfa7, Add dag_id/state index on dag_run table >> >>>> >> >>>> = /usr/local/lib/python2.7/dist-packages/sqlalchemy/sql/crud.py:692: >> >>>> SAWarning: Column 'dag_stats.dag_id' is marked as a member of = the >> >>> primary >> >>>> key for table 'dag_stats', but has no Python-side or server-side >> > default >> >>>> generator indicated, nor does it indicate 'autoincrement=3DTrue' = or >> >>>> 'nullable=3DTrue', and no explicit value is passed. Primary key = columns >> >>>> typically may not store NULL. Note that as of SQLAlchemy 1.1, >> >>>> 'autoincrement=3DTrue' must be indicated explicitly for = composite (e.g. >> >>>> multicolumn) primary keys if AUTO_INCREMENT/SERIAL/IDENTITY = behavior is >> >>>> expected for one of the columns in the primary key. CREATE TABLE >> >>> statements >> >>>> are impacted by this change as well on most backends. >> >>>> >> >>> >> >> >> >> >> > >> > -- >> > _/ >> > _/ Alex Van Boxel >>=20 >>=20 >> --=20 >> _/ >> _/ Alex Van Boxel >=20 --Apple-Mail=_78EBE853-9101-4089-B92C-893BBD6A8891--