Return-Path: X-Original-To: apmail-spark-dev-archive@minotaur.apache.org Delivered-To: apmail-spark-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D199517394 for ; Thu, 2 Apr 2015 16:02:39 +0000 (UTC) Received: (qmail 18089 invoked by uid 500); 2 Apr 2015 16:02:38 -0000 Delivered-To: apmail-spark-dev-archive@spark.apache.org Received: (qmail 18007 invoked by uid 500); 2 Apr 2015 16:02:38 -0000 Mailing-List: contact dev-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list dev@spark.apache.org Received: (qmail 17996 invoked by uid 99); 2 Apr 2015 16:02:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Apr 2015 16:02:38 +0000 X-ASF-Spam-Status: No, hits=1.5 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: unknown (athena.apache.org: error in processing during lookup of sknapp@berkeley.edu) Received: from [209.85.217.172] (HELO mail-lb0-f172.google.com) (209.85.217.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Apr 2015 16:02:34 +0000 Received: by lbbzk7 with SMTP id zk7so46480113lbb.0 for ; Thu, 02 Apr 2015 08:59:58 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-type; bh=Ana8H5ani/CwyggPVrT7lmRSn3iRiE86ELX/Q17BpWg=; b=Ksdp8cGAajwGh0woowLbyfdPxehN6n4mJXwN5M1ablrd9UxwsvTKeXi/HEFkxqW/eF 4Iv+ftdDL4XzumH3ZB+FMoImWio64vmypG1ylMx0mNcCugVPfURLAHpi1NO+9nm/NUoE avpBuj6SYrCR2l/ZTw0W37UlXpZy+mNuAuAom0Wo9c5wRZceNADr6Fy0bdJbF+XuIUmQ rDm0k2IUcHHjUt71YWb3JUfOYQJ+hqL+cwgeOttvmNhtixJpHZ3g5Mb6MLAkuLPPrNQu 2xa0UizxOpQeqDetqSIyzeWzOSqK7YgoMYq4bogBjreonP0etsnecI1xfPUDM10e6z9G n8HA== X-Gm-Message-State: ALoCoQmaSxj9lVeUTSOafK/ZWTC2CmScbWgD/uWex7++gdztNNcCGlItEiTlk+1+vtNERUBkKNYv X-Received: by 10.152.27.225 with SMTP id w1mr36675463lag.77.1427990397584; Thu, 02 Apr 2015 08:59:57 -0700 (PDT) MIME-Version: 1.0 Received: by 10.114.2.74 with HTTP; Thu, 2 Apr 2015 08:59:37 -0700 (PDT) In-Reply-To: References: <3EEE3E33-B165-40ED-891B-13139D544DD7@hortonworks.com> From: shane knapp Date: Thu, 2 Apr 2015 08:59:37 -0700 Message-ID: Subject: Re: Unit test logs in Jenkins? To: Nicholas Chammas Cc: Steve Loughran , Apache Spark Dev Content-Type: multipart/alternative; boundary=089e0158c4586414960512bfece2 X-Virus-Checked: Checked by ClamAV on apache.org --089e0158c4586414960512bfece2 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable i agree with all of this. but can we please break up the tests and make them shorter? :) On Thu, Apr 2, 2015 at 8:54 AM, Nicholas Chammas wrote: > This is secondary to Marcelo=E2=80=99s question, but I wanted to comment = on this: > > Its main limitation is more cultural than technical: you need to get peop= le > to care about intermittent test runs, otherwise you can end up with > failures that nobody keeps on top of > > This is a problem that plagues Spark as well, but there *is* a technical > solution. > > The solution is simple: *All* the builds that we care about run for *ever= y* > proposed change. If *any* build fails, the change doesn=E2=80=99t make it= into the > repository. > > Spark already has a pull request builder that tests and reports back on > PRs. Committers don=E2=80=99t merge in PRs when this builder reports that= it failed > some tests. That=E2=80=99s a good thing. > > The problem is that there are several other builds that we run on a fixed > interval, independent of the pull request builder. These builds test > different configurations, dependency versions, and environments than what > the PR builder covers. If one of those builds fails, it fails on its own > little island, with no-one to hear it scream. The build failure is detach= ed > from the PR that caused it to fail. > > What should happen is that the whole matrix of stuff we care to test gets > run for every PR. No PR goes in if any build we care about fails for that > PR, and every build we care about runs for every commit of every PR. > > Really, this is just an extension of the basic idea of the PR builder. It > doesn=E2=80=99t make much sense to test stuff *after* it has been committ= ed and > potentially broken things. And it becomes exponentially more difficult to > find and fix a problem the longer it has been festering in the repo. It= =E2=80=99s > best to keep such problems out in the first place. > > With some more work on our CI infrastructure, I think this can be done. > Maybe even later this year. > > Nick > > On Thu, Apr 2, 2015 at 6:02 AM Steve Loughran stevel@hortonworks.com > wrote: > > > > > On 2 Apr 2015, at 06:31, Patrick Wendell wrote: > > > > > > Hey Marcelo, > > > > > > Great question. Right now, some of the more active developers have an > > > account that allows them to log into this cluster to inspect logs (we > > > copy the logs from each run to a node on that cluster). The > > > infrastructure is maintained by the AMPLab. > > > > > > I will put you in touch the someone there who can get you an account. > > > > > > This is a short term solution. The longer term solution is to have > > > these scp'd regularly to an S3 bucket or somewhere people can get > > > access to them, but that's not ready yet. > > > > > > - Patrick > > > > > >> > > > > > > ASF Jenkins is always there to play with; committers/PMC members should > > just need to file a BUILD JIRA to get access. > > > > Its main limitation is more cultural than technical: you need to get > > people to care about intermittent test runs, otherwise you can end up > with > > failures that nobody keeps on top of > > https://builds.apache.org/view/H-L/view/Hadoop/ > > > > Someone really needs to own the "keep the builds working" problem -and > > have the ability to somehow kick others into fixing things. The latter = is > > pretty hard cross-organisation > > > > > > >> That would be really helpful to debug build failures. The scalatest > > >> output isn't all that helpful. > > >> > > > > Potentially an issue with the test runner, rather than the tests > > themselves. > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org > > For additional commands, e-mail: dev-help@spark.apache.org > > > > =E2=80=8B > --089e0158c4586414960512bfece2--