Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E283018F97 for ; Wed, 2 Dec 2015 21:23:09 +0000 (UTC) Received: (qmail 49905 invoked by uid 500); 2 Dec 2015 21:23:09 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 49825 invoked by uid 500); 2 Dec 2015 21:23:09 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 49812 invoked by uid 99); 2 Dec 2015 21:23:08 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Dec 2015 21:23:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 4D7FFC3FE1 for ; Wed, 2 Dec 2015 21:23:08 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.001 X-Spam-Level: *** X-Spam-Status: No, score=3.001 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id PLJiWlctdt1u for ; Wed, 2 Dec 2015 21:22:54 +0000 (UTC) Received: from mail-io0-f175.google.com (mail-io0-f175.google.com [209.85.223.175]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id CBBB4441C4 for ; Wed, 2 Dec 2015 21:22:53 +0000 (UTC) Received: by iouu10 with SMTP id u10so61594560iou.0 for ; Wed, 02 Dec 2015 13:22:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=rhOuur96fherT8tbbmjgmLSB+mgRlcKgQnC3+MrYVtM=; b=nkfV83Jf3TfnXLowQNpfGmSBLdN9DLLA/Im+7cRi+SzllFpYCyxAHK4OatRceIBJ9C NZOapMuZMS9uV4KtzBmvoOp93wGHnPBremrL5ye9TskrCgf73NHQkBwB/v4QwDHM+i76 1SdJfjhKBa/mwPkC7GfA4/Pd6Nx6ZTqC/YouNKuD8ww5XgYW0q7Zw4kyI5Cm7lAwqPYO qkK+B5wnT0eor7df5JfBZS1a4G4w4McQuYnF9IHVG3s60lourOpAFEnysoNg6VJJdaEi gQxVj1SJLK3tATWlIyd7z+t/N2wiipPO5nuxXFn2B67gh7yY/ptuNFz94OwoL+ZHaINq pA6g== MIME-Version: 1.0 X-Received: by 10.107.14.65 with SMTP id 62mr5629613ioo.57.1449091373411; Wed, 02 Dec 2015 13:22:53 -0800 (PST) Sender: saint.ack@gmail.com Received: by 10.64.167.134 with HTTP; Wed, 2 Dec 2015 13:22:53 -0800 (PST) In-Reply-To: References: Date: Wed, 2 Dec 2015 13:22:53 -0800 X-Google-Sender-Auth: lgDdPavdZmU9Lj5xiNxC_tezlXo Message-ID: Subject: Re: Testing and CI -- Apache Jenkins Builds (WAS -> Re: Testing) From: Stack To: HBase Dev List Content-Type: multipart/alternative; boundary=001a113fd05a8f344f0525f0e0a0 --001a113fd05a8f344f0525f0e0a0 Content-Type: text/plain; charset=UTF-8 As part of my continuing advocacy of builds.apache.org and that their results are now worthy of our trust and nurture, here are some highlights from the last few days of builds: + hadoopqa is now finding zombies before the patch is committed. HBASE-14888 showed "-1 core tests. The patch failed these unit tests:" but didn't have any failed tests listed (I'm trying to see if I can do anything about this...). Running our little ./dev-tools/findHangingTests.py against the consoleText, it showed a hanging test. Running locally, I see same hang. This is before the patch landed. + Our branch runs are now near totally zombie and flakey free -- still some work to do -- but a recent patch that seemed harmless was causing a reliable flake fail in the backport to branch-1* confirmed by local runs. The flakeyness was plain to see up in builds.apache.org. + In the last few days I've committed a patch that included javadoc warnings even though hadoopqa said the patch introduced javadoc issues (I missed it). This messed up life for folks subsequently as their patches now reported javadoc issues.... In short, I suggest that builds.apache.org is worth keeping an eye on, make sure you get a clean build out of hadoopqa before committing anything, and lets all work together to try and keep our builds blue: it'll save us all work in the long run. St.Ack On Tue, Nov 4, 2014 at 9:38 AM, Stack wrote: > Branch-1 and master have stabilized and now run mostly blue (give or take > the odd failure) [1][2]. Having a mostly blue branch-1 has helped us > identify at least one destabilizing commit in the last few days, maybe two; > this is as it should be (smile). > > Lets keep our builds blue. If you commit a patch, make sure subsequent > builds stay blue. You can subscribe to builds@hbase.apache.org to get > notice of failures if not already subscribed. > > Thanks, > St.Ack > > 1. https://builds.apache.org/view/H-L/view/HBase/job/HBase-1.0/ > 2. https://builds.apache.org/view/H-L/view/HBase/job/HBase-TRUNK/ > > > On Mon, Oct 13, 2014 at 4:41 PM, Stack wrote: > >> A few notes on testing. >> >> Too long to read, infra is more capable now and after some work, we are >> seeing branch-1 and trunk mostly running blue. Lets try and keep it this >> way going forward. >> >> Apache Infra has new, more capable hardware. >> >> A recent spurt of test fixing combined with more capable hardware seems >> to have gotten us to a new place; tests are mostly passing now on branch-1 >> and master. Lets try and keep it this way and start to trust our test runs >> again. Just a few flakies remain. Lets try and nail them. >> >> Our tests now run in parallel with other test suites where previous we >> ran alone. You can see this sometimes when our zombie detector reports >> tests from another project altogether as lingerers (To be fixed). Some of >> our tests are failing because a concurrent hbase run is undoing classes and >> data from under it. Also, lets fix. >> >> Our tests are brittle. It takes 75minutes for them to complete. Many are >> heavy-duty integration tests starting up multiple clusters and mapreduce >> all in the one JVM. It is a miracle they pass at all. Usually integration >> tests have been cast as unit tests because there was no where else for them >> to get an airing. We have the hbase-it suite now which would be a more apt >> place but until these are run on a regular basis in public for all to see, >> the fat integration tests disguised as unit tests will remain. A review of >> our current unit tests weeding the old cruft and the no longer relevant or >> duplicates would be a nice undertaking if someone is looking to contribute. >> >> Alex Newman has been working on making our tests work up on travis and >> circle-ci. That'll be sweet when it goes end-to-end. He also added in >> some "type" categorizations -- client, filter, mapreduce -- alongside our >> old "sizing" categorizations of small/medium/large. His thinking is that >> we can run these categorizations in parallel so we could run the total >> suite in about the time of the longest test, say 20-30minutes? We could >> even change Apache to run them this way. >> >> FYI, >> St.Ack >> >> >> >> >> >> >> > --001a113fd05a8f344f0525f0e0a0--