hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (HBASE-1440) master won't go down because joined on a rootscanner that is waiting for ever
Date Tue, 19 May 2009 20:41:45 GMT

     [ https://issues.apache.org/jira/browse/HBASE-1440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

stack resolved HBASE-1440.
--------------------------

    Resolution: Fixed

Committed below:

{code}
Index: src/java/org/apache/hadoop/hbase/master/RegionManager.java
===================================================================
--- src/java/org/apache/hadoop/hbase/master/RegionManager.java  (revision 776447)
+++ src/java/org/apache/hadoop/hbase/master/RegionManager.java  (working copy)
@@ -978,7 +978,8 @@
         // regionServerReport message from the HRegionServer that has been
         // allocated the ROOT region below.
         try {
-          rootRegionLocation.wait();
+          // Cycle rather than hold here in case master is closed meantime.
+          rootRegionLocation.wait(this.master.threadWakeFrequency);
         } catch (InterruptedException e) {
           // continue
         }
{code}

> master won't go down because joined on a rootscanner that is waiting for ever
> -----------------------------------------------------------------------------
>
>                 Key: HBASE-1440
>                 URL: https://issues.apache.org/jira/browse/HBASE-1440
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>             Fix For: 0.20.0
>
>
> The below wait depends on an open event hitting the master.  Won't happen if we're shutting
down.
> {code}
> "RegionManager.rootScanner" daemon prio=10 tid=0x00007fdc98197c00 nid=0x7538 in Object.wait()
[0x0000000040e7f000..0x0000000040e7fa80]
>    java.lang.Thread.State: WAITING (on object monitor)
>     at java.lang.Object.wait(Native Method)
>     at java.lang.Object.wait(Object.java:485)
>     at org.apache.hadoop.hbase.master.RegionManager.waitForRootRegionLocation(RegionManager.java:981)
>     - locked <0x00007fdcad0cacd0> (a java.util.concurrent.atomic.AtomicReference)
>     at org.apache.hadoop.hbase.master.HMaster.waitForRootRegionLocation(HMaster.java:362)
>     at org.apache.hadoop.hbase.master.RootScanner.scanRoot(RootScanner.java:45)
>     at org.apache.hadoop.hbase.master.RootScanner.maintenanceScan(RootScanner.java:79)
>     at org.apache.hadoop.hbase.master.BaseScanner.chore(BaseScanner.java:135)
>     at org.apache.hadoop.hbase.Chore.run(Chore.java:68)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message