Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5CC05872B for ; Tue, 30 Aug 2011 02:47:03 +0000 (UTC) Received: (qmail 75866 invoked by uid 500); 30 Aug 2011 02:47:03 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 75775 invoked by uid 500); 30 Aug 2011 02:47:01 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 75767 invoked by uid 99); 30 Aug 2011 02:47:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Aug 2011 02:47:00 +0000 X-ASF-Spam-Status: No, hits=-2000.5 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Aug 2011 02:46:58 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 5CB67D607A for ; Tue, 30 Aug 2011 02:46:38 +0000 (UTC) Date: Tue, 30 Aug 2011 02:46:38 +0000 (UTC) From: "Todd Lipcon (JIRA)" To: issues@hbase.apache.org Message-ID: <1483042827.5939.1314672398376.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <490703039.2303.1314598238646.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (HBASE-4275) RS should communicate fatal "aborts" back to the master MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-4275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HBASE-4275: ------------------------------- Attachment: 0005-HBASE-4275.-Report-fatals-to-HMaster.patch Here's a patch which adds the new RPC. The master logs it locally and also keeps a ring-buffer of such log messages, constrained to 1MB by default. I've tested this on a cluster in coordination with HBASE-4292 (which displays the ring buffer) and it's very useful for debugging. The RS-abort behavior should probably get its own unit test before commit (or add some assertions to existing tests) > RS should communicate fatal "aborts" back to the master > ------------------------------------------------------- > > Key: HBASE-4275 > URL: https://issues.apache.org/jira/browse/HBASE-4275 > Project: HBase > Issue Type: Improvement > Components: master, regionserver > Affects Versions: 0.92.0 > Reporter: Todd Lipcon > Attachments: 0005-HBASE-4275.-Report-fatals-to-HMaster.patch > > > When a region server aborts, it should attempt to send an RPC to the master that contains (a) the reason for aborting, and (b) the last several KB of log messages, if available. This should help a lot in debugging. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira