hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tony Reix (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-11617) Several tests are not stable (on OpenJDK / x86_64 / RHEL 7.1)
Date Wed, 25 Feb 2015 15:59:05 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-11617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14336661#comment-14336661
] 

Tony Reix commented on HADOOP-11617:
------------------------------------

I have built a tool for summarizing the results.
Out of 16 runs, I've found 30 unstable tests.

 30                                                     T      F      E      S |   NN/n
crypto.key.kms.server.TestKMS      18    0-3    0-3      0 |  10/16
means:
18 tests, 0 to 3 failures and 0 to 3 errors, and 1 to 3 failures/errors appeared 10 times
out of 16 (6 times with 0 failure/error).


Unstable Tests out of n= 16 runs:
 30                                                                                      
 T      F      E      S |   NN/n
--------------------------------------------------------------------------------------------------------------------------
crypto.key.kms.server.TestKMS                                                            
18    0-3    0-3      0 |  10/16
ha.TestZKFailoverControllerStress                                                        
 3      0    0-1      0 |   3/16
hdfs.TestDataTransferKeepalive                                                           
 4    0-1      0      0 |   2/16
hdfs.TestEncryptionZones                                                                 
18      0    0-1      0 |   1/16
hdfs.TestHFlush                                                                          
 9      0    0-1      0 |   4/16
hdfs.qjournal.TestSecureNNWithQJM                                                        
 2      0    0-1      0 |   1/16
hdfs.server.balancer.TestBalancer                                                        
22    0-1      0      0 |   2/16
hdfs.server.namenode.TestCacheDirectives                                                 
12    0-1      0      0 |   1/16
hdfs.server.namenode.ha.TestDNFencing                                                    
 6    0-1      0      0 |   3/16
hdfs.util.TestByteArrayManager                                                           0-3
                     |  15/16
hdfs.web.TestWebHdfsFileSystemContract                                                   
39    0-1      0      0 |   1/16
ipc.TestDecayRpcScheduler                                                                
 9    0-2      0      0 |  14/16
mapred.TestSpecialCharactersInOutputPath                                                 
 1      0    0-1      0 |   2/16
mapred.gridmix.TestRecordFactory                                                         
 3    0-1      0      0 |   1/16
mapreduce.lib.db.TestDbClasses                                                           
 4      0    0-1      0 |   1/16
mapreduce.v2.app.TestKill                                                                
 5    0-1      0      0 |   5/16
metrics2.impl.TestMetricsSystemImpl                                                      
 9    0-1      0      0 |   2/16
security.token.delegation.TestZKDelegationTokenSecretManager                             
 3    0-1      0      0 |   1/16
tracing.TestTracing                                                                      
 5    0-1      0      0 |   1/16
yarn.applications.distributedshell.TestDistributedShell                                  
11    0-1      0      0 |   4/16
yarn.client.TestResourceTrackerOnHA                                                      0-1
     0      0      0 |  14/16
yarn.client.api.impl.TestNMClient                                                        
 2    0-2      0      0 |  15/16
yarn.server.nodemanager.containermanager.logaggregation.TestLogAggregationService        
16    1-2      0      0 |   7/16
yarn.server.resourcemanager.TestApplicationMasterService                                 0-5
     0      0      0 |  15/16
yarn.server.resourcemanager.TestRMRestart                                                
46    0-3      0      0 |   9/16
yarn.server.resourcemanager.applicationsmanager.TestAMRestart                            
 6    0-1    0-2      0 |   8/16
yarn.server.resourcemanager.rmapp.TestRMAppTransitions                                   
42    0-1      0      0 |   4/16
yarn.server.resourcemanager.scheduler.capacity.TestCapacityScheduler                     
30    0-1      0      0 |   2/16
yarn.server.resourcemanager.scheduler.fair.TestSchedulingPolicy                          
 2      0    0-1      0 |   1/16
yarn.server.resourcemanager.webapp.TestRMWebappAuthentication                            0-2
     0      0      0 |  15/16

> Several tests are not stable (on OpenJDK / x86_64 / RHEL 7.1)
> -------------------------------------------------------------
>
>                 Key: HADOOP-11617
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11617
>             Project: Hadoop Common
>          Issue Type: Bug
>         Environment: RHEL 7.1 / x86_64
>            Reporter: Tony Reix
>
> I'm now porting Hadoop 2.4.1 on PPC64 HW.
> In order to have an idea about the stability of the tests, I've run several times all
Hadoop tests in a RHEL 7.1 / x86_64 / OpenJDK environment.
> I've run it 12 times, for now, on the EXACT same environment.
> Using some tool, I have extracted the list of tests that do not always produce the same
results, listed below, showing some test unstability.
> I 'm now doing the same for version 2.6.0 on OpenJDK/Ubuntu/x86_64.
> Report gives:
> test-name N: T F E S |
> where:
> - N is the number of the test log file
> - and:
> T: Tests run
> F: Failure
> E: Error
> S: Skipped
> As an example (see below for full data):
> hdfs.server.datanode.TestDeleteBlockPool                      0:    2   0   0   0 |
> ........
> hdfs.server.datanode.TestDeleteBlockPool                      4:    2   0   1   0 |
> ........
> hdfs.server.datanode.TestDeleteBlockPool                     11:    2   0   0   0 |
> means that this test, out of 12 runs, had an error once, with 0 Failure and 0 Skipped
test.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message