hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anoop Sam John (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-7034) Bad version, failed OPENING to OPENED but master thinks it is open anyways
Date Wed, 16 Jan 2013 02:15:11 GMT

    [ https://issues.apache.org/jira/browse/HBASE-7034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554663#comment-13554663
] 

Anoop Sam John commented on HBASE-7034:
---------------------------------------

bq.I feel that we should check for the data part whether it was set correctly. The id part
is any way going to be the same i feel.

Yes Ram. I am also thinking so. This was not final patch any way. Was thinking that I can
do this in final patch. This was for the feedback from you guys. :)  As I read from the code
the intent was to check this process id against the id in the zoo data. This is to check while
this process try to change the data in the zookeeper another process changed that already
or not. In such a case this process will give up this op. That is the whole reason why we
add the id stuff in the data.  :)

The old check was any way wrong which compare the id against data.

Yes Ted I can add the test case in the final patch.
                
> Bad version, failed OPENING to OPENED but master thinks it is open anyways
> --------------------------------------------------------------------------
>
>                 Key: HBASE-7034
>                 URL: https://issues.apache.org/jira/browse/HBASE-7034
>             Project: HBase
>          Issue Type: Bug
>          Components: Region Assignment
>    Affects Versions: 0.94.2
>            Reporter: stack
>            Assignee: Anoop Sam John
>         Attachments: HBASE-7034_94.patch, TestRecoverableZooKeeper.java
>
>
> I have this in RS log:
> {code}
> 2012-10-22 02:21:50,698 ERROR org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler:
Failed transitioning node b9,\xEE\xAE\x9BiQO\x89]+a\xE0\x7F\xB7'X?,1349052737638.9af7cfc9b15910a0b3d714bf40a3248f.
from OPENING to OPENED -- closing region
> org.apache.zookeeper.KeeperException$BadVersionException: KeeperErrorCode = BadVersion
for /hbase/unassigned/9af7cfc9b15910a0b3d714bf40a3248f
> {code}
> Master says this (it is bulk assigning):
> {code}
> ....
> 2012-10-22 02:21:40,673 DEBUG org.apache.hadoop.hbase.zookeeper.ZKUtil: master:10302-0xb3a862e57a503ba
Set watcher on existing znode /hbase/unassigned/9af7cfc9b15910a0b3d714bf40a3248f
> ...
> then this
> ....
> 2012-10-22 02:23:47,089 DEBUG org.apache.hadoop.hbase.zookeeper.ZKUtil: master:10302-0xb3a862e57a503ba
Set watcher on existing znode /hbase/unassigned/9af7cfc9b15910a0b3d714bf40a3248f
> ....
> 2012-10-22 02:24:34,176 DEBUG org.apache.hadoop.hbase.zookeeper.ZKUtil: master:10302-0xb3a862e57a503ba
Retrieved 112 byte(s) of data from znode /hbase/unassigned/9af7cfc9b15910a0b3d714bf40a3248f
and set watcher; region=b9,\xEE\xAE\x9BiQO\x89]+a\xE0\x7F\xB7'X?,1349052737638.9af7cfc9b15910a0b3d714bf40a3248f.,
origin=sv4r17s44,10304,1350872216778, state=RS_ZK_REGION_OPENED
> etc.
> {code}
> Disagreement as to what is going on here.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message