Return-Path: X-Original-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9DD6117F4C for ; Wed, 25 Feb 2015 15:59:18 +0000 (UTC) Received: (qmail 82132 invoked by uid 500); 25 Feb 2015 15:59:05 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 82089 invoked by uid 500); 25 Feb 2015 15:59:05 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 82074 invoked by uid 99); 25 Feb 2015 15:59:05 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Feb 2015 15:59:05 +0000 Date: Wed, 25 Feb 2015 15:59:05 +0000 (UTC) From: "Tony Reix (JIRA)" To: common-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HADOOP-11617) Several tests are not stable (on OpenJDK / x86_64 / RHEL 7.1) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HADOOP-11617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14336661#comment-14336661 ] Tony Reix commented on HADOOP-11617: ------------------------------------ I have built a tool for summarizing the results. Out of 16 runs, I've found 30 unstable tests. 30 T F E S | NN/n crypto.key.kms.server.TestKMS 18 0-3 0-3 0 | 10/16 means: 18 tests, 0 to 3 failures and 0 to 3 errors, and 1 to 3 failures/errors appeared 10 times out of 16 (6 times with 0 failure/error). Unstable Tests out of n= 16 runs: 30 T F E S | NN/n -------------------------------------------------------------------------------------------------------------------------- crypto.key.kms.server.TestKMS 18 0-3 0-3 0 | 10/16 ha.TestZKFailoverControllerStress 3 0 0-1 0 | 3/16 hdfs.TestDataTransferKeepalive 4 0-1 0 0 | 2/16 hdfs.TestEncryptionZones 18 0 0-1 0 | 1/16 hdfs.TestHFlush 9 0 0-1 0 | 4/16 hdfs.qjournal.TestSecureNNWithQJM 2 0 0-1 0 | 1/16 hdfs.server.balancer.TestBalancer 22 0-1 0 0 | 2/16 hdfs.server.namenode.TestCacheDirectives 12 0-1 0 0 | 1/16 hdfs.server.namenode.ha.TestDNFencing 6 0-1 0 0 | 3/16 hdfs.util.TestByteArrayManager 0-3 | 15/16 hdfs.web.TestWebHdfsFileSystemContract 39 0-1 0 0 | 1/16 ipc.TestDecayRpcScheduler 9 0-2 0 0 | 14/16 mapred.TestSpecialCharactersInOutputPath 1 0 0-1 0 | 2/16 mapred.gridmix.TestRecordFactory 3 0-1 0 0 | 1/16 mapreduce.lib.db.TestDbClasses 4 0 0-1 0 | 1/16 mapreduce.v2.app.TestKill 5 0-1 0 0 | 5/16 metrics2.impl.TestMetricsSystemImpl 9 0-1 0 0 | 2/16 security.token.delegation.TestZKDelegationTokenSecretManager 3 0-1 0 0 | 1/16 tracing.TestTracing 5 0-1 0 0 | 1/16 yarn.applications.distributedshell.TestDistributedShell 11 0-1 0 0 | 4/16 yarn.client.TestResourceTrackerOnHA 0-1 0 0 0 | 14/16 yarn.client.api.impl.TestNMClient 2 0-2 0 0 | 15/16 yarn.server.nodemanager.containermanager.logaggregation.TestLogAggregationService 16 1-2 0 0 | 7/16 yarn.server.resourcemanager.TestApplicationMasterService 0-5 0 0 0 | 15/16 yarn.server.resourcemanager.TestRMRestart 46 0-3 0 0 | 9/16 yarn.server.resourcemanager.applicationsmanager.TestAMRestart 6 0-1 0-2 0 | 8/16 yarn.server.resourcemanager.rmapp.TestRMAppTransitions 42 0-1 0 0 | 4/16 yarn.server.resourcemanager.scheduler.capacity.TestCapacityScheduler 30 0-1 0 0 | 2/16 yarn.server.resourcemanager.scheduler.fair.TestSchedulingPolicy 2 0 0-1 0 | 1/16 yarn.server.resourcemanager.webapp.TestRMWebappAuthentication 0-2 0 0 0 | 15/16 > Several tests are not stable (on OpenJDK / x86_64 / RHEL 7.1) > ------------------------------------------------------------- > > Key: HADOOP-11617 > URL: https://issues.apache.org/jira/browse/HADOOP-11617 > Project: Hadoop Common > Issue Type: Bug > Environment: RHEL 7.1 / x86_64 > Reporter: Tony Reix > > I'm now porting Hadoop 2.4.1 on PPC64 HW. > In order to have an idea about the stability of the tests, I've run several times all Hadoop tests in a RHEL 7.1 / x86_64 / OpenJDK environment. > I've run it 12 times, for now, on the EXACT same environment. > Using some tool, I have extracted the list of tests that do not always produce the same results, listed below, showing some test unstability. > I 'm now doing the same for version 2.6.0 on OpenJDK/Ubuntu/x86_64. > Report gives: > test-name N: T F E S | > where: > - N is the number of the test log file > - and: > T: Tests run > F: Failure > E: Error > S: Skipped > As an example (see below for full data): > hdfs.server.datanode.TestDeleteBlockPool 0: 2 0 0 0 | > ........ > hdfs.server.datanode.TestDeleteBlockPool 4: 2 0 1 0 | > ........ > hdfs.server.datanode.TestDeleteBlockPool 11: 2 0 0 0 | > means that this test, out of 12 runs, had an error once, with 0 Failure and 0 Skipped test. -- This message was sent by Atlassian JIRA (v6.3.4#6332)