hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeffrey Zhong (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-10085) Some regions aren't re-assigned after a mater restarts
Date Thu, 05 Dec 2013 01:21:35 GMT
Jeffrey Zhong created HBASE-10085:
-------------------------------------

             Summary: Some regions aren't re-assigned after a mater restarts
                 Key: HBASE-10085
                 URL: https://issues.apache.org/jira/browse/HBASE-10085
             Project: HBase
          Issue Type: Bug
          Components: Region Assignment
    Affects Versions: 0.96.0
            Reporter: Jeffrey Zhong
            Assignee: Jeffrey Zhong
             Fix For: 0.98.0, 0.96.1


We see this issue happened in a cluster restart:

1) when shutdown a cluster, some regions are in offline state because no Region servers are
available(stop RS and then Master)
2) When the cluster restarts, the offlined regions are forced to be offline again and SSH
skip re-assigning them by function AM.processServerShutdown as shown below.

{code}
2013-12-03 10:41:56,686 INFO  [master:h2-ubuntu12-sec-1386048659-hbase-8:60000] master.AssignmentManager:
Processing 873dbd8c269f44d0aefb0f66c5b53537 in state: M_ZK_REGION_OFFLINE
2013-12-03 10:41:56,686 DEBUG [master:h2-ubuntu12-sec-1386048659-hbase-8:60000] master.AssignmentManager:
RIT 873dbd8c269f44d0aefb0f66c5b53537 in state=M_ZK_REGION_OFFLINE was on deadserver; forcing
offline
...
2013-12-03 10:41:56,739 DEBUG [AM.-pool1-t8] master.AssignmentManager: Force region state
offline {873dbd8c269f44d0aefb0f66c5b53537 state=OFFLINE, ts=1386067316737, server=h2-ubuntu12-sec-1386048659-hbase-6.cs1cloud.internal,60020,1386066968696}
...
2013-12-03 10:41:57,223 WARN  [MASTER_SERVER_OPERATIONS-h2-ubuntu12-sec-1386048659-hbase-8:60000-3]
master.RegionStates: THIS SHOULD NOT HAPPEN: unexpected {873dbd8c269f44d0aefb0f66c5b53537
state=OFFLINE, ts=1386067316737, server=h2-ubuntu12-sec-1386048659-hbase-6.cs1cloud.internal,60020,1386066968696}

{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message