hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jerry He (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-11341) ZKProcedureCoordinatorRpcs should respond only to members
Date Sat, 14 Jun 2014 03:07:01 GMT

    [ https://issues.apache.org/jira/browse/HBASE-11341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14031430#comment-14031430
] 

Jerry He commented on HBASE-11341:
----------------------------------

This enhancement is a good fix for the problem!

I have not seen that strange warning message previously on live cluster, including 0.94, 0.96,
and 0.98.

In theory, it seems that it should not happen in the procedure framework. 
The master procedure coordinator creates /hbase/<proc>/reached/<instance>,  then
watch for create events on /hbase/<proc>/reached/<instance>/<member>.
It does not listen for create event on /hbase/<proc>/reached/<instance>.

On the other hand, region server member listens for create event on /hbase/<proc>/reached/<instance>.
 Then creates /hbase/<proc>/reached/<instance>/<member>

My guess is still that the master and the region server sides are mixed up now.

Does it sound right?


> ZKProcedureCoordinatorRpcs should respond only to members
> ---------------------------------------------------------
>
>                 Key: HBASE-11341
>                 URL: https://issues.apache.org/jira/browse/HBASE-11341
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.99.0, 0.94.20, 0.98.3
>            Reporter: Matteo Bertozzi
>            Assignee: Matteo Bertozzi
>            Priority: Minor
>             Fix For: 0.99.0, 0.94.21, 0.98.4
>
>         Attachments: HBASE-11341-v0.patch
>
>
> ZKProcedureCoordinatorRpcs nodeCreated() responds to events of every znode under the
procedure and not only members.
> {code}
> INFO: Received created event: /hbase/<proc>/reached/<instance>
> INFO: Received created event: /hbase/<proc>/reached/<instance>/<member>
> {code}
> the result is a warning (no other side effects)
> {code}
> WARN [main-EventThread] procedure.ProcedureCoordinator: Member '<instance>' is
trying to release an unknown procedure 'reached
> {code}
> [~apurtell] posted in HBASE-10926 the steps to reproduce it:
> {noformat}
> $ cd ./src/hbase
> $ git checkout master
> $ mvn -DskipTests clean install
> {noformat}
> In one console:
> {noformat}
> $ ./bin/hbase master start
> {noformat}
> In another console:
> {noformat}
> $ ./bin/hbase org.apache.hadoop.hbase.util.LoadTestTool -num_keys 100000 \
>     -read 10:100 -write 1:100:10
> {noformat}
> In a third console:
> {noformat}
> $ ./bin/hbase shell
> hbase> while true do ; flush 'cluster_test' ; sleep 10 ; end
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message