hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Appy (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-19803) False positive for the HBASE-Find-Flaky-Tests job
Date Tue, 16 Jan 2018 19:52:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16327681#comment-16327681
] 

Appy commented on HBASE-19803:
------------------------------

Oh, this looks promising: http://maven.apache.org/surefire/maven-surefire-plugin/faq.html#vm-termination
And we have quite a few System.exit in our code.

Since the failure is in hbase-server tests, just looking for System.exit calls in that module
and those on which it depends. Also, ignoring the calls from main() fns of tools. Here's list
of possible culprits:
- HMaster#InitializationMonitor#run()
...there may be others, but not obvious at first look.


> False positive for the HBASE-Find-Flaky-Tests job
> -------------------------------------------------
>
>                 Key: HBASE-19803
>                 URL: https://issues.apache.org/jira/browse/HBASE-19803
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Duo Zhang
>            Priority: Major
>
> It reports two hangs for TestAsyncTableGetMultiThreaded, but I checked the surefire output
> https://builds.apache.org/job/HBASE-Flaky-Tests/24830/artifact/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.client.TestAsyncTableGetMultiThreaded-output.txt
> This one was likely to be killed in the middle of the run within 20 seconds.
> https://builds.apache.org/job/HBASE-Flaky-Tests/24852/artifact/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.client.TestAsyncTableGetMultiThreaded-output.txt
> This one was also killed within about 1 minutes.
> The test is declared as LargeTests so the time limit should be 10 minutes. It seems that
the jvm may crash during the mvn test run and then we will kill all the running tests and
then we may mark some of them as hang which leads to the false positive.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message