Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EC22A18A69 for ; Tue, 9 Feb 2016 22:39:24 +0000 (UTC) Received: (qmail 78555 invoked by uid 500); 9 Feb 2016 22:24:22 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 78201 invoked by uid 500); 9 Feb 2016 22:24:21 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 75827 invoked by uid 99); 9 Feb 2016 22:17:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Feb 2016 22:17:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 8AB582C1F64 for ; Tue, 9 Feb 2016 22:17:18 +0000 (UTC) Date: Tue, 9 Feb 2016 22:17:18 +0000 (UTC) From: "Hadoop QA (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-15219) Canary tool does not return non-zero exit when one of region stuck state MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-15219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139873#comment-15139873 ] Hadoop QA commented on HBASE-15219: ----------------------------------- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s {color} | {color:blue} Docker mode activated. {color} | | {color:green}+1{color} | {color:green} hbaseanti {color} | {color:green} 0m 0s {color} | {color:green} Patch does not have any anti-patterns. {color} | | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s {color} | {color:green} The patch does not contain any @author tags. {color} | | {color:red}-1{color} | {color:red} test4tests {color} | {color:red} 0m 0s {color} | {color:red} The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 2m 26s {color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 43s {color} | {color:green} master passed with JDK v1.8.0_72 {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 42s {color} | {color:green} master passed with JDK v1.7.0_95 {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 15m 17s {color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green} 1m 14s {color} | {color:green} master passed {color} | | {color:red}-1{color} | {color:red} findbugs {color} | {color:red} 7m 49s {color} | {color:red} branch/. no findbugs output file (./target/findbugsXml.xml) {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 2m 20s {color} | {color:green} master passed with JDK v1.8.0_72 {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 2m 49s {color} | {color:green} master passed with JDK v1.7.0_95 {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 3m 11s {color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 43s {color} | {color:green} the patch passed with JDK v1.8.0_72 {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 2m 43s {color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 40s {color} | {color:green} the patch passed with JDK v1.7.0_95 {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 2m 40s {color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 14m 48s {color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green} 1m 13s {color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s {color} | {color:green} Patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} hadoopcheck {color} | {color:green} 21m 50s {color} | {color:green} Patch does not cause any errors with Hadoop 2.4.0 2.4.1 2.5.0 2.5.1 2.5.2 2.6.1 2.6.2 2.6.3 2.7.1. {color} | | {color:red}-1{color} | {color:red} findbugs {color} | {color:red} 7m 50s {color} | {color:red} patch/. no findbugs output file (./target/findbugsXml.xml) {color} | | {color:red}-1{color} | {color:red} findbugs {color} | {color:red} 2m 3s {color} | {color:red} hbase-server introduced 9 new FindBugs issues. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 2m 22s {color} | {color:green} the patch passed with JDK v1.8.0_72 {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 2m 48s {color} | {color:green} the patch passed with JDK v1.7.0_95 {color} | | {color:red}-1{color} | {color:red} unit {color} | {color:red} 104m 55s {color} | {color:red} root in the patch failed with JDK v1.8.0_72. {color} | | {color:red}-1{color} | {color:red} unit {color} | {color:red} 97m 36s {color} | {color:red} hbase-server in the patch failed with JDK v1.8.0_72. {color} | | {color:red}-1{color} | {color:red} unit {color} | {color:red} 100m 40s {color} | {color:red} root in the patch failed with JDK v1.7.0_95. {color} | | {color:red}-1{color} | {color:red} unit {color} | {color:red} 92m 55s {color} | {color:red} hbase-server in the patch failed with JDK v1.7.0_95. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 32s {color} | {color:green} Patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 497m 45s {color} | {color:black} {color} | \\ \\ || Reason || Tests || | FindBugs | module:hbase-server | | | Read of unwritten public or protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$RegionServerStdOutSink.publishReadFailure(String, String) At Canary.java:protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$RegionServerStdOutSink.publishReadFailure(String, String) At Canary.java:[line 170] | | | Read of unwritten public or protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.getReadFailureCount() At Canary.java:protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.getReadFailureCount() At Canary.java:[line 119] | | | Read of unwritten public or protected field writeFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.getWriteFailureCount() At Canary.java:protected field writeFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.getWriteFailureCount() At Canary.java:[line 143] | | | Read of unwritten public or protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishReadFailure(HRegionInfo, Exception) At Canary.java:protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishReadFailure(HRegionInfo, Exception) At Canary.java:[line 124] | | | Read of unwritten public or protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishReadFailure(HRegionInfo, HColumnDescriptor, Exception) At Canary.java:protected field readFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishReadFailure(HRegionInfo, HColumnDescriptor, Exception) At Canary.java:[line 130] | | | Read of unwritten public or protected field writeFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishWriteFailure(HRegionInfo, Exception) At Canary.java:protected field writeFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishWriteFailure(HRegionInfo, Exception) At Canary.java:[line 148] | | | Read of unwritten public or protected field writeFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishWriteFailure(HRegionInfo, HColumnDescriptor, Exception) At Canary.java:protected field writeFailureCount in org.apache.hadoop.hbase.tool.Canary$StdOutSink.publishWriteFailure(HRegionInfo, HColumnDescriptor, Exception) At Canary.java:[line 154] | | | Unwritten public or protected field:org.apache.hadoop.hbase.tool.Canary$StdOutSink.readFailureCount At Canary.java:[line 170] | | | Unwritten public or protected field:org.apache.hadoop.hbase.tool.Canary$StdOutSink.writeFailureCount At Canary.java:[line 143] | | JDK v1.8.0_72 Failed junit tests | hadoop.hbase.replication.multiwal.TestReplicationKillMasterRSCompressedWithMultipleWAL | | | hadoop.hbase.replication.multiwal.TestReplicationKillMasterRSCompressedWithMultipleWAL | | JDK v1.8.0_72 Timed out junit tests | org.apache.hadoop.hbase.snapshot.TestFlushSnapshotFromClient | | JDK v1.7.0_95 Timed out junit tests | org.apache.hadoop.hbase.regionserver.TestHRegion | \\ \\ || Subsystem || Report/Notes || | Docker | Client=1.9.1 Server=1.9.1 Image:yetus/hbase:date2016-02-09 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12787069/HBASE-15219.v6.patch | | JIRA Issue | HBASE-15219 | | Optional Tests | asflicense javac javadoc unit findbugs hadoopcheck hbaseanti checkstyle compile | | uname | Linux be128618bfda 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/component/dev-support/hbase-personality.sh | | git revision | master / 7bb68b9 | | findbugs | v3.0.0 | | findbugs | https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/new-findbugs-hbase-server.html | | unit | https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-root-jdk1.8.0_72.txt | | unit | https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-hbase-server-jdk1.8.0_72.txt | | unit | https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-root-jdk1.7.0_95.txt | | unit | https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-hbase-server-jdk1.7.0_95.txt | | unit test logs | https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-root-jdk1.8.0_72.txt https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-hbase-server-jdk1.8.0_72.txt https://builds.apache.org/job/PreCommit-HBASE-Build/494/artifact/patchprocess/patch-unit-hbase-server-jdk1.7.0_95.txt | | JDK v1.7.0_95 Test Results | https://builds.apache.org/job/PreCommit-HBASE-Build/494/testReport/ | | modules | C: . hbase-server U: . | | Max memory used | 428MB | | Powered by | Apache Yetus 0.1.0 http://yetus.apache.org | | Console output | https://builds.apache.org/job/PreCommit-HBASE-Build/494/console | This message was automatically generated. > Canary tool does not return non-zero exit when one of region stuck state > ------------------------------------------------------------------------- > > Key: HBASE-15219 > URL: https://issues.apache.org/jira/browse/HBASE-15219 > Project: HBase > Issue Type: Bug > Components: canary > Affects Versions: 0.98.16 > Reporter: Vishal Khandelwal > Assignee: Ted Yu > Priority: Critical > Fix For: 2.0.0, 1.3.0, 1.2.1, 1.1.4, 0.98.18 > > Attachments: HBASE-15219.v1.patch, HBASE-15219.v3.patch, HBASE-15219.v4.patch, HBASE-15219.v5.patch, HBASE-15219.v6.patch > > > {code} > 2016-02-05 12:24:18,571 ERROR [pool-2-thread-7] tool.Canary - read from region CAN_1,\x08\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00,1454667477865.00e77d07b8defe10704417fb99aa0418. column family 0 failed > org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed after attempts=2, exceptions: > Fri Feb 05 12:24:15 GMT 2016, org.apache.hadoop.hbase.client.RpcRetryingCaller@54c9fea0, org.apache.hadoop.hbase.NotServingRegionException: org.apache.hadoop.hbase.NotServingRegionException: Region CAN_1,\x08\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00,1454667477865.00e77d07b8defe10704417fb99aa0418. is not online on isthbase02-dnds1-3-crd.eng.sfdc.net,60020,1454669984738 > at org.apache.hadoop.hbase.regionserver.HRegionServer.getRegionByEncodedName(HRegionServer.java:2852) > at org.apache.hadoop.hbase.regionserver.HRegionServer.getRegion(HRegionServer.java:4468) > at org.apache.hadoop.hbase.regionserver.HRegionServer.get(HRegionServer.java:2984) > at org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:31186) > at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2149) > at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:104) > at org.apache.hadoop.hbase.ipc.RpcExecutor.consumerLoop(RpcExecutor.java:133) > at org.apache.hadoop.hbase.ipc.RpcExecutor$1.run(RpcExecutor.java:108) > at java.lang.Thread.run(Thread.java:745) > -------- > -bash-4.1$ echo $? > 0 > {code} > Below code prints the error but it does sets/returns the exit code. Due to this tool can't be integrated with nagios or other alerting. > Ideally it should return error for failures. as pre the documentation: > > This tool will return non zero error codes to user for collaborating with other monitoring tools, such as Nagios. The error code definitions are: > private static final int USAGE_EXIT_CODE = 1; > private static final int INIT_ERROR_EXIT_CODE = 2; > private static final int TIMEOUT_ERROR_EXIT_CODE = 3; > private static final int ERROR_EXIT_CODE = 4; > > {code} > org.apache.hadoop.hbase.tool.Canary.RegionTask > public Void read() { > .... > try { > table = connection.getTable(region.getTable()); > tableDesc = table.getTableDescriptor(); > } catch (IOException e) { > LOG.debug("sniffRegion failed", e); > sink.publishReadFailure(region, e); > ... > return null; > } > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)