hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-776) Master not reassigning .META. from failed/failing regionserver
Date Sat, 26 Jul 2008 19:44:32 GMT

    [ https://issues.apache.org/jira/browse/HBASE-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12617208#action_12617208

stack commented on HBASE-776:

This exception of yours Andrew seems pretty easy to manufacture.  I see it here in a little
test I'm running.  Kept getting the exception over and over for 30 mins now.

2008-07-26 19:09:22,111 WARN org.apache.hadoop.hbase.master.BaseScanner: Scan one META region:
{regionname: .META.,,1, startKey: <>, server: XX.XX.XX.XX:60020}
java.net.SocketTimeoutException: timed out waiting for rpc response
        at org.apache.hadoop.ipc.Client.call(Client.java:559)
        at org.apache.hadoop.hbase.ipc.HbaseRPC$Invoker.invoke(HbaseRPC.java:230)
        at $Proxy2.openScanner(Unknown Source)
        at org.apache.hadoop.hbase.master.BaseScanner.scanRegion(BaseScanner.java:159)
        at org.apache.hadoop.hbase.master.MetaScanner.scanOneMetaRegion(MetaScanner.java:69)
        at org.apache.hadoop.hbase.master.MetaScanner.maintenanceScan(MetaScanner.java:124)
        at org.apache.hadoop.hbase.master.BaseScanner.chore(BaseScanner.java:139)
        at org.apache.hadoop.hbase.Chore.run(Chore.java:63)

> Master not reassigning .META. from failed/failing regionserver
> --------------------------------------------------------------
>                 Key: HBASE-776
>                 URL: https://issues.apache.org/jira/browse/HBASE-776
>             Project: Hadoop HBase
>          Issue Type: Bug
>          Components: master
>    Affects Versions: 0.2.0
>         Environment: CentOS x86_64, JDK 1.6, Hadoop 0.17.1, HBase 0.2.0, r679585, Fri
Jul 25 16:47:26 UTC 2008
>            Reporter: Andrew Purtell
>         Attachments: hbase-hadoop-master-sjdc-atr-dc-1.log, hbase-hadoop-regionserver-sjdc-atr-dc-13.log,
> In our environment sometimes the regionserver carrying META is also assigned to the 'content'
table, into which objects retrieved from Internet crawling is stored. For unclear reason the
regionserver occasionally goes "deaf" (seperate issue) and when this happens META is no longer
available. The master then never reassigns META, so the whole cluster is down from this point
and does not recover. Logs attached.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message