Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 15159 invoked from network); 23 Mar 2011 01:47:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 23 Mar 2011 01:47:07 -0000 Received: (qmail 70952 invoked by uid 500); 23 Mar 2011 01:47:06 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 70898 invoked by uid 500); 23 Mar 2011 01:47:06 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 70878 invoked by uid 99); 23 Mar 2011 01:47:06 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Mar 2011 01:47:06 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Mar 2011 01:47:04 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 7776042F94 for ; Wed, 23 Mar 2011 01:46:06 +0000 (UTC) Date: Wed, 23 Mar 2011 01:46:06 +0000 (UTC) From: "Hudson (JIRA)" To: issues@hbase.apache.org Message-ID: <1922935649.5209.1300844766486.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1771506091.4442.1300821605667.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HBASE-3687) Bulk assign on startup should handle a ServerNotRunningException MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-3687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13009960#comment-13009960 ] Hudson commented on HBASE-3687: ------------------------------- Integrated in HBase-TRUNK #1803 (See [https://hudson.apache.org/hudson/job/HBase-TRUNK/1803/]) HBASE-3687 Bulk assign on startup should handle a ServerNotRunningException; FIX PROB. FOUND BY TED YU > Bulk assign on startup should handle a ServerNotRunningException > ---------------------------------------------------------------- > > Key: HBASE-3687 > URL: https://issues.apache.org/jira/browse/HBASE-3687 > Project: HBase > Issue Type: Bug > Reporter: stack > Assignee: stack > Fix For: 0.90.2 > > Attachments: 3687.txt > > > On startup, we do bulk assign. At the moment, if any problem during bulk assign, we consider startup failed and expectation is that you need to retry (We need to make this better but that is not what this issue is about). One exception that we should handle is the case where a RS is slow coming up and its rpc is not yet up listening. In this case it will throw: ServerNotRunningException. We should retry at least this one exception during bulk assign. > We had this happen to us starting up a prod cluster. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira