hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rajesh Nishtala (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-13471) Deadlock closing a region
Date Tue, 21 Apr 2015 00:41:01 GMT

    [ https://issues.apache.org/jira/browse/HBASE-13471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14504047#comment-14504047
] 

Rajesh Nishtala commented on HBASE-13471:
-----------------------------------------

In fairness I think there are two bugs here. (1) the client has a row / region mismatch under
some circumstances that are yet TBD and (2) when that occurs there's a possible infinite loop.
This addresses the later by propagating up the wrong region information to the client. With
this fix in we can hopefully find the cause of (1) with the extra debugging information that
results from the fix for (2).

> Deadlock closing a region
> -------------------------
>
>                 Key: HBASE-13471
>                 URL: https://issues.apache.org/jira/browse/HBASE-13471
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Elliott Clark
>            Assignee: Rajesh Nishtala
>         Attachments: HBASE-13471.patch
>
>
> {code}
> Thread 4139 (regionserver/hbase412.example.com/10.158.6.53:60020-splits-1429003183537):
>   State: WAITING
>   Blocked count: 131
>   Waited count: 228
>   Waiting on java.util.concurrent.locks.ReentrantReadWriteLock$NonfairSync@50714dc3
>   Stack:
>     sun.misc.Unsafe.park(Native Method)
>     java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
>     java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
>     java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireQueued(AbstractQueuedSynchronizer.java:870)
>     java.util.concurrent.locks.AbstractQueuedSynchronizer.acquire(AbstractQueuedSynchronizer.java:1199)
>     java.util.concurrent.locks.ReentrantReadWriteLock$WriteLock.lock(ReentrantReadWriteLock.java:943)
>     org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1371)
>     org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1325)
>     org.apache.hadoop.hbase.regionserver.SplitTransactionImpl.stepsBeforePONR(SplitTransactionImpl.java:352)
>     org.apache.hadoop.hbase.regionserver.SplitTransactionImpl.createDaughters(SplitTransactionImpl.java:252)
>     org.apache.hadoop.hbase.regionserver.SplitTransactionImpl.execute(SplitTransactionImpl.java:509)
>     org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:84)
>     java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>     java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>     java.lang.Thread.run(Thread.java:745)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message