Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 70B711070F for ; Wed, 31 Jul 2013 23:44:20 +0000 (UTC) Received: (qmail 15893 invoked by uid 500); 31 Jul 2013 23:44:19 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 15841 invoked by uid 500); 31 Jul 2013 23:44:19 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 15833 invoked by uid 99); 31 Jul 2013 23:44:19 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Jul 2013 23:44:19 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of saint.ack@gmail.com designates 209.85.214.50 as permitted sender) Received: from [209.85.214.50] (HELO mail-bk0-f50.google.com) (209.85.214.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Jul 2013 23:44:13 +0000 Received: by mail-bk0-f50.google.com with SMTP id ik8so458123bkc.9 for ; Wed, 31 Jul 2013 16:43:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=lrw0jNd+srXp71dRs/XFtW61rJYSsMjoyFZ/KWJAKIQ=; b=UNKhRTD5HOICo9XfnX7MINwHRTxj12bdvdStVce2ZzSH0I8ofeD9psFX3H70esL/HM Ibp9QOfeb4gBtoRqu/8bSQgGK+JUqe5M+zH0ovrbwGyitT3iaXjWVhtOwJavCKYHHAzl AxYRzV8o9bPOdrOHrMc6ZgWGedQoN3PzSgUDi8Wr6ittUxegBL+Ya00UN1vsF5s1q4Xd aqxLFVIYoFy27WZdRb0qVtPso4wmmo23/dRFyLTi27YkMUfOTF6BXYGWvH8E5wXQPais 9KfMeyay/zHLseKKBgWE3Nn7KY/airLM7vBJZQCGeDCVJh0UfcmoUNp4sCA1/3wnYAK5 DtVw== MIME-Version: 1.0 X-Received: by 10.204.54.206 with SMTP id r14mr10207127bkg.120.1375314232731; Wed, 31 Jul 2013 16:43:52 -0700 (PDT) Sender: saint.ack@gmail.com Received: by 10.204.39.16 with HTTP; Wed, 31 Jul 2013 16:43:52 -0700 (PDT) In-Reply-To: References: Date: Wed, 31 Jul 2013 16:43:52 -0700 X-Google-Sender-Auth: QMAbGxv2G4IYU8vttjS8n86w8V4 Message-ID: Subject: Re: Flaky Testcase Detector From: Stack To: HBase Dev List Content-Type: multipart/alternative; boundary=001a11c36e004c2b8404e2d74c20 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c36e004c2b8404e2d74c20 Content-Type: text/plain; charset=UTF-8 Did this get checked in? Thanks, St.Ack On Wed, Mar 6, 2013 at 10:11 AM, Jeffrey Zhong wrote: > > I can check it into ./dev-tools unless someone objects. > > Thanks, > -Jeffrey > > On 3/5/13 10:43 PM, "Stack" wrote: > > >Nice. Should we check it into ./dev-tools? > >Thanks, > >St.Ack > > > > > >On Tue, Mar 5, 2013 at 7:12 PM, Andrew Purtell > >wrote: > > > >> This is handy. Thanks so much for sharing it. > >> > >> On Wednesday, March 6, 2013, Jeffrey Zhong wrote: > >> > >> > Hey, > >> > > >> > Recently I'm working on some hbase test case failures and I think it's > >> > useful if we can see a report of all failed test cases from most > >>recent > >> > runs. So we can easily see how flaky a test case is. I wrote a tool > >>some > >> > time back and below are some reports against different branches from > >> > today's > >> > run. You can get the tools' source at > >> > https://github.com/jeffreyz88/jenkins-tools. If we can run it daily > >>and > >> > send > >> > out an email then we can quickly notice what may break after recent > >> > check-ins. > >> > > >> > Notes: > >> > 1) " 873 874 875 876 877 878 879 880 881" are builds which has > >> > failed test cases in current or previous runs > >> > 2) 1 means "PASSED", 0 means "NOT RUN AT ALL", -1 means "FAILED" > >> > > >> > HBase-0.95 (from last 10 runs configurable) > >> > > >> > Failed Test Cases 21 22 23 24 25 27 > >> > ======================================================== > >> > > >> > > >> > >>org.apache.hadoop.hbase.catalog.testmetamigrationconvertingtopb.org.apach > >>e.h > >> > adoop.hbase.catalog.testmetamigrationconvertingtopb 0 0 0 > >>0 > >> > -1 > >> > -1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.coprocessor.example.testbulkdeleteprotocol.testbu > >>lkd > >> > eletecolumn -1 0 0 0 0 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.coprocessor.example.testrowcountendpoint.org.apac > >>he. > >> > hadoop.hbase.coprocessor.example.testrowcountendpoint -1 0 0 > >> 0 > >> > 0 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.coprocessor.example.testzookeeperscanpolicyobserv > >>er. > >> > > >> > >>org.apache.hadoop.hbase.coprocessor.example.testzookeeperscanpolicyobserv > >>er > >> > -1 0 0 0 0 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.master.testmasterfailover.testmasterfailoverwithm > >>ock > >> > edritondeadrs 1 -1 -1 0 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testsplittransactiononcluster.testsh > >>oul > >> > > >> > >>dthrowioexceptionifstorefilesizeisemptyandshouldsuccessfullyexecuterollba > >>ck > >> > 1 1 1 1 1 -1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testsplittransactiononcluster.testsh > >>utd > >> > ownfixupwhendaughterhassplit 1 1 1 -1 -1 -1 > >> > org.apache.hadoop.hbase.regionserver.wal.testhlog.testlogcleaning 0 > >> 1 > >> > 1 1 -1 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.replication.testmasterreplication.testcyclicrepli > >>cat > >> > ion 1 1 1 1 1 -1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.replication.testreplicationqueuefailover.queuefai > >>lov > >> > er 1 -1 0 1 -1 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.replication.testreplicationqueuefailovercompresse > >>d.q > >> > ueuefailover 0 1 -1 0 -1 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.security.access.testaccesscontroller.org.apache.h > >>ado > >> > op.hbase.security.access.testaccesscontroller 0 -1 0 0 0 > >> 0 > >> > > >> > As you can see we have a few test cases don't run successfully at all > >>or > >> > recently. > >> > > >> > HBase-TRUNK (from last 10 runs) > >> > > >> > Failed Test Cases 3908 3909 3910 3912 3913 3914 3915 3916 > >> > ======================================================== > >> > > >> > > >> > >>org.apache.hadoop.hbase.catalog.testmetamigrationconvertingtopb.org.apach > >>e.h > >> > adoop.hbase.catalog.testmetamigrationconvertingtopb 0 -1 0 > >>0 > >> > 0 > >> > 0 -1 -1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.client.testadmin.testcloseregionwhenservernameise > >>mpt > >> > y 1 1 1 1 1 1 -1 0 > >> > org.apache.hadoop.hbase.client.testscannertimeout.test3686a 1 1 > >> -1 > >> > 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.client.testsnapshotcloneindependence.testofflines > >>nap > >> > shotregionoperationsindependent 0 1 -1 0 1 1 1 > >>1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.client.testsnapshotcloneindependence.testonlinesn > >>aps > >> > hotregionoperationsindependent 1 1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.master.testassignmentmanageroncluster.testmovereg > >>ion > >> > 1 1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.master.testdistributedlogsplitting.testdelayeddel > >>ete > >> > onfailure 1 1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.master.testmasterfailover.testmasterfailoverwithm > >>ock > >> > edritondeadrs 1 1 1 1 1 -1 0 1 > >> > org.apache.hadoop.hbase.master.testtablelockmanager.testdelete 1 > >> 1 > >> > -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.procedure.testzkprocedurecontrollers.testsimplezk > >>coh > >> > ortmembercontroller 1 1 1 1 1 1 -1 0 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testsplittransactiononcluster.testex > >>ist > >> > ingznodeblockssplitandwerollback 1 1 1 1 1 -1 0 > >> 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testsplittransactiononcluster.testsh > >>oul > >> > > >> > >>dthrowioexceptionifstorefilesizeisemptyandshouldsuccessfullyexecuterollba > >>ck > >> > 1 1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testsplittransactiononcluster.testsh > >>utd > >> > ownfixupwhendaughterhassplit -1 -1 -1 0 -1 0 -1 0 > >> > org.apache.hadoop.hbase.regionserver.wal.testhlog.testlogcleaning 0 > >> -1 > >> > -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.replication.testmultislavereplication.testmultisl > >>ave > >> > replication 1 1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.replication.testreplicationqueuefailover.queuefai > >>lov > >> > er 1 1 1 -1 0 -1 0 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.replication.testreplicationqueuefailovercompresse > >>d.q > >> > ueuefailover 1 1 1 -1 0 -1 0 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.security.access.testaccesscontrolfilter.testquali > >>fie > >> > raccess -1 0 1 -1 0 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testdegenerateregions 1 > >> -1 > >> > 0 1 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.util.testhbasefsck.testfixassignmentsandnohdfsche > >>cki > >> > ng 1 -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testfixbytable 1 -1 > >> 0 > >> > 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testhbckthreadpooling 1 > >> -1 > >> > 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testlingeringreferencefile > >> 1 > >> > -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testmissingfirstregion 1 > >> -1 > >> > 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testmissinglastregion 1 > >> -1 > >> > 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testnotinhdfs 1 -1 > >>0 > >> > 1 > >> > 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testnotinmetaordeployedhole > >> 1 > >> > -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testoverlapandorphan 1 > >> -1 > >> > 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testregionhole 1 -1 > >> 0 > >> > 1 1 1 1 1 > >> > > >>org.apache.hadoop.hbase.util.testhbasefsck.testregionshouldnotbedeployed > >> > 1 -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testsidelineoverlapregion > >> 1 > >> > -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.util.testhbasefsck.testsplitdaughtersnotinmeta > >> 1 > >> > -1 0 1 1 1 1 1 > >> > > >> > More test cases are not in good shape. > >> > > >> > HBase-0.94(from last 10 runs) > >> > > >> > Failed Test Cases 873 874 875 876 877 878 879 880 > >> 881 > >> > ======================================================== > >> > org.apache.hadoop.hbase.backup.testhfilearchiving.testcleaningrace > >>-1 > >> > 0 > >> > 1 1 1 1 1 1 1 > >> > org.apache.hadoop.hbase.catalog.testmetareadereditor.testgetregion > >>1 > >> > 1 > >> > -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.catalog.testmetareadereditor.testretrying 1 > >> 1 > >> > -1 0 1 1 1 1 1 > >> > > >>org.apache.hadoop.hbase.catalog.testmetareadereditor.testscanmetafortable > >> > 1 1 -1 0 1 1 1 1 1 > >> > org.apache.hadoop.hbase.catalog.testmetareadereditor.testtableexists > >> 1 > >> > 1 -1 0 1 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.client.testmetamigrationremovinghtd.org.apache.ha > >>doo > >> > p.hbase.client.testmetamigrationremovinghtd 0 -1 0 0 0 > >> 0 > >> > 0 0 0 > >> > > >>org.apache.hadoop.hbase.client.testmultiparallel.testflushcommitsnoabort > >> > 1 1 1 1 1 -1 0 1 1 > >> > > >> > >>org.apache.hadoop.hbase.client.testmultiparallel.testflushcommitswithabor > >>t > >> > 1 1 1 1 1 -1 0 1 1 > >> > org.apache.hadoop.hbase.client.testshell.testrunshelltests 1 1 > >> 1 > >> > 1 1 1 -1 0 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.master.testmasterfailover.testmasterfailoverwithm > >>ock > >> > edritondeadrs 1 1 -1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testatomicoperation.testmultirowmuta > >>tio > >> > nmultithreads 1 1 1 -1 0 1 1 1 1 > >> > > >> > > >> > >>org.apache.hadoop.hbase.regionserver.testsplittransactiononcluster.testsh > >>utd > >> > ownfixupwhendaughterhassplit 1 -1 0 1 1 1 1 1 > >> 1 > >> > > >> > As you can see 0.94 is stable and testmetamigrationremovinghtd hasn't > >> been > >> > run for a while either being removed or due to other failures. Some > >>test > >> > cases aren't stable but not too bad. > >> > > >> > Thanks, > >> > -Jeffrey > >> > > >> > > >> > > >> > > >> > >> -- > >> Best regards, > >> > >> - Andy > >> > >> Problems worthy of attack prove their worth by hitting back. - Piet Hein > >> (via Tom White) > >> > > > --001a11c36e004c2b8404e2d74c20--