hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yu Li (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-14305) Deadlock observed in MVCC during Unit Testing
Date Wed, 26 Aug 2015 06:10:46 GMT

     [ https://issues.apache.org/jira/browse/HBASE-14305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Yu Li updated HBASE-14305:
--------------------------
    Description: 
As titled, this failure is reported in a UT check by HadoopQA, below is part of the jstack
output:
{noformat}
"main" prio=10 tid=0x00007fb77000a800 nid=0x5004 in Object.wait() [0x00007fb778799000]
   java.lang.Thread.State: WAITING (on object monitor)
	at java.lang.Object.wait(Native Method)
	- waiting on <0x00000007ee9a5260> (a java.util.LinkedList)
	at org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForPreviousTransactionsComplete(MultiVersionConcurrencyControl.java:224)
	- locked <0x00000007ee9a5260> (a java.util.LinkedList)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2254)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2061)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2026)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2016)
	at org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1423)
	at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1344)
	- locked <0x00000007ee9c85e8> (a java.lang.Object)
	at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1295)
	at org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:352)
	at org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileGetting(TestHRegion.java:3999)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
{noformat}
It seems waiting on waitQueue never got notified and cause the case a zombie

Full jstack output please refer to [this link|https://builds.apache.org/job/PreCommit-HBASE-Build/15244//consoleFull]

  was:
As titled, this failure is reported in a UT check by HadoopQA, below is part of the jstack
output:
{noformat}
"main" prio=10 tid=0x00007fb77000a800 nid=0x5004 in Object.wait() [0x00007fb778799000]
   java.lang.Thread.State: WAITING (on object monitor)
	at java.lang.Object.wait(Native Method)
	- waiting on <0x00000007ee9a5260> (a java.util.LinkedList)
	at org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForPreviousTransactionsComplete(MultiVersionConcurrencyControl.java:224)
	- locked <0x00000007ee9a5260> (a java.util.LinkedList)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2254)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2061)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2026)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2016)
	at org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1423)
	at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1344)
	- locked <0x00000007ee9c85e8> (a java.lang.Object)
	at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1295)
	at org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:352)
	at org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileGetting(TestHRegion.java:3999)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
{noformat}
It seems we are waiting on the same waitQueue object after got its lock.

Full jstack output please refer to [this link|https://builds.apache.org/job/PreCommit-HBASE-Build/15244//consoleFull]


> Deadlock observed in MVCC during Unit Testing
> ---------------------------------------------
>
>                 Key: HBASE-14305
>                 URL: https://issues.apache.org/jira/browse/HBASE-14305
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 2.0.0
>            Reporter: Yu Li
>
> As titled, this failure is reported in a UT check by HadoopQA, below is part of the jstack
output:
> {noformat}
> "main" prio=10 tid=0x00007fb77000a800 nid=0x5004 in Object.wait() [0x00007fb778799000]
>    java.lang.Thread.State: WAITING (on object monitor)
> 	at java.lang.Object.wait(Native Method)
> 	- waiting on <0x00000007ee9a5260> (a java.util.LinkedList)
> 	at org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForPreviousTransactionsComplete(MultiVersionConcurrencyControl.java:224)
> 	- locked <0x00000007ee9a5260> (a java.util.LinkedList)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2254)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2061)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2026)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2016)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1423)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1344)
> 	- locked <0x00000007ee9c85e8> (a java.lang.Object)
> 	at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1295)
> 	at org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:352)
> 	at org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileGetting(TestHRegion.java:3999)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> {noformat}
> It seems waiting on waitQueue never got notified and cause the case a zombie
> Full jstack output please refer to [this link|https://builds.apache.org/job/PreCommit-HBASE-Build/15244//consoleFull]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message