Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@minotaur.apache.org Received: (qmail 73352 invoked from network); 27 Jan 2009 02:51:21 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 27 Jan 2009 02:51:21 -0000 Received: (qmail 86688 invoked by uid 500); 27 Jan 2009 02:51:20 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 86664 invoked by uid 500); 27 Jan 2009 02:51:20 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 86653 invoked by uid 99); 27 Jan 2009 02:51:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Jan 2009 18:51:20 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Jan 2009 02:51:20 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D1E79234C48C for ; Mon, 26 Jan 2009 18:50:59 -0800 (PST) Message-ID: <1316955704.1233024659858.JavaMail.jira@brutus> Date: Mon, 26 Jan 2009 18:50:59 -0800 (PST) From: "Jim Kellerman (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-1156) Improve lease handling In-Reply-To: <543250492.1233004079715.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12667568#action_12667568 ] Jim Kellerman commented on HBASE-1156: -------------------------------------- If we included the start code in the check, then the restarted server could start serving regions immediately without its regions being detected as having been hosted by the dead server. > Improve lease handling > ---------------------- > > Key: HBASE-1156 > URL: https://issues.apache.org/jira/browse/HBASE-1156 > Project: Hadoop HBase > Issue Type: Improvement > Components: master, regionserver > Affects Versions: 0.19.0 > Reporter: Jim Kellerman > Assignee: Jim Kellerman > Fix For: 0.20.0 > > > Currently, if a region server crashes and then restarts, it cannot be given work until its lease times out. This is because a lease is only identified by ipaddress:portnumber. If leases were also identified with the start code, the server could be given work immediately, because its log file includes the start code and will not interfere with the recovery of the log from its previous incarnation. > Additionally, we wait in a master server thread for the server to leave the dead servers list because dead servers are not identified by their start code either. Waiting in a master server thread ties up that thread (possibly for quite some time), and rather than waiting, we should throw an exception as the region server already knows how to deal with an exception thrown from a regionServerStartup call. > Finally, there is a bit of code cleanup that needs to be done in the region server when it receives a MSG_CALL_SERVER_STARTUP response from the master. It should not set up the HLog until reportForDuty completes > successfully (which is what it does on the initial reportForDuty call. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.