hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeffrey Zhong (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-9480) Regions are unexpectedly made offline in certain failure conditions
Date Tue, 10 Sep 2013 22:57:52 GMT

    [ https://issues.apache.org/jira/browse/HBASE-9480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13763658#comment-13763658
] 

Jeffrey Zhong commented on HBASE-9480:
--------------------------------------

{quote}
At least, we don't see the same problem as for the second one, do we?
{quote}
We saw it in one test case where SSH was aborted and later a region move request come in to
move the region to somewhere else. In reality this could happen though rarely. I'm fine to
leave it as it is or cross check dead servers being processed by SSH before issuing deletes.

                
> Regions are unexpectedly made offline in certain failure conditions
> -------------------------------------------------------------------
>
>                 Key: HBASE-9480
>                 URL: https://issues.apache.org/jira/browse/HBASE-9480
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Devaraj Das
>            Priority: Critical
>             Fix For: 0.96.0
>
>         Attachments: 9480-1.txt
>
>
> Came across this issue (HBASE-9338 test):
> 1. Client issues a request to move a region from ServerA to ServerB
> 2. ServerA is compacting that region and doesn't close region immediately. In fact, it
takes a while to complete the request.
> 3. The master in the meantime, sends another close request.
> 4. ServerA sends it a NotServingRegionException
> 5. Master handles the exception, deletes the znode, and invokes regionOffline for the
said region.
> 6. ServerA fails to operate on ZK in the CloseRegionHandler since the node is deleted.
> The region is permanently offline.
> There are potentially other situations where when a RegionServer is offline and the client
asks for a region move off from that server, the master makes the region offline.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message