hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Purtell (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-932) Regionserver restart
Date Thu, 04 Feb 2010 04:50:28 GMT

    [ https://issues.apache.org/jira/browse/HBASE-932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12829444#action_12829444
] 

Andrew Purtell commented on HBASE-932:
--------------------------------------

Reconsider HRS restart strategy as part of handling transient filesystem problems. Should
happen only after trying a "safe mode" first, as part of a ride over HDFS restart capability.
If can't leave safe mode after some configured interval elapses, then consider how to shut
down.

Can't shut down if unfinished writes to WAL are still pending. 


> Regionserver restart
> --------------------
>
>                 Key: HBASE-932
>                 URL: https://issues.apache.org/jira/browse/HBASE-932
>             Project: Hadoop HBase
>          Issue Type: Improvement
>            Reporter: stack
>             Fix For: 0.21.0
>
>
> If we drop a flush or we fail close a write-ahead log, we currently shutdown the regionserver
(we fail because of hdfs usually).  Rather than shut themselves down, how about they restart?
 The restart at least in the HBASE-930 might fix the issue shaking DFSClient so it gets sense
again.  Even is HDFS is bad, it'll come around eventually.  The HRS restarting itself plus
HBASE-926 fix will make for fast recovery.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message