hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "binlijin (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (HBASE-16960) RegionServer hang when aborting
Date Fri, 28 Oct 2016 08:51:58 GMT

    [ https://issues.apache.org/jira/browse/HBASE-16960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15614234#comment-15614234
] 

binlijin edited comment on HBASE-16960 at 10/28/16 8:51 AM:
------------------------------------------------------------

The problem can happen when:
(1)FSHLog#rollWriter throw exception and LogRoller call regionserver.abort 
(2)RingBufferEventHandler.onEvent process FSWALEntry throw DamagedWALException 
(3)RingBufferEventHandler.onEvent process safe point, set RingBufferEventHandler.exception=null
(4)RingBufferEventHandler.onEvent process SyncFuture (MemStoreFlusher.1  FSHLog.sync)
  endOfBatch=false
(5)RingBufferEventHandler.onEvent process FSWALEntry (ASYNC_WAL  FSHLog.append) throw
DamagedWALException
There is no other events, so the MemStoreFlusher.1  FSHLog.sync will hang.


was (Author: aoxiang):
The problem can happen when:
(1)FSHLog#rollWriter throw exception and LogRoller call regionserver.abort 
(2)RingBufferEventHandler.onEvent process FSWALEntry throw DamagedWALException 
(3)RingBufferEventHandler.onEvent process safe point, set RingBufferEventHandler.exception=null
(4)RingBufferEventHandler.onEvent process SyncFuture (MemStoreFlusher.1  FSHLog.sync)
  endOfBatch=false
(5)RingBufferEventHandler.onEvent process FSWALEntry (ASYNC_WAL  FSHLog.append)
There is no other events, so the MemStoreFlusher.1  FSHLog.sync will hang.

> RegionServer hang when aborting
> -------------------------------
>
>                 Key: HBASE-16960
>                 URL: https://issues.apache.org/jira/browse/HBASE-16960
>             Project: HBase
>          Issue Type: Bug
>            Reporter: binlijin
>         Attachments: RingBufferEventHandler.png, RingBufferEventHandler_exception.png,
SyncFuture.png, SyncFuture_exception.png, rs1081.jstack
>
>
> We see regionserver hang when aborting several times and cause all regions on this regionserver
out of service and then all affected applications stop works.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message