hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ramkrishna.s.vasudevan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-4124) ZK restarted while assigning a region, new active HM re-assign it but the RS warned 'already online on this server'.
Date Tue, 23 Aug 2011 04:43:29 GMT

    [ https://issues.apache.org/jira/browse/HBASE-4124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13089261#comment-13089261
] 

ramkrishna.s.vasudevan commented on HBASE-4124:
-----------------------------------------------

@Gao
{bq}
step 3: startup master again .

As per the scenario you have described when the master restarted the RS has it opened the
region? I think the scenario here is RS is also dead.
If so the assignment manager will try assigning it to a new RS.  Do you think any problem
here? 
If the RS is alive then the znode status will be OPENED state and the processRIT will take
care of clearing the node as it is already opened.  Could be be more clear on the state of
RS after you killed the master and also on the state of znode in zookeeper for that region.


> ZK restarted while assigning a region, new active HM re-assign it but the RS warned 'already
online on this server'.
> --------------------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-4124
>                 URL: https://issues.apache.org/jira/browse/HBASE-4124
>             Project: HBase
>          Issue Type: Bug
>          Components: master
>            Reporter: fulin wang
>            Assignee: gaojinchao
>             Fix For: 0.90.5
>
>         Attachments: HBASE-4124_Branch90V1_trial.patch, HBASE-4124_Branch90V2.patch,
log.txt
>
>   Original Estimate: 0.4h
>  Remaining Estimate: 0.4h
>
> ZK restarted while assigning a region, new active HM re-assign it but the RS warned 'already
online on this server'.
> Issue:
> The RS failed besause of 'already online on this server' and return; The HM can not receive
the message and report 'Regions in transition timed out'.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message