Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8EEAE10DDD for ; Mon, 19 Aug 2013 07:30:59 +0000 (UTC) Received: (qmail 95910 invoked by uid 500); 19 Aug 2013 07:30:57 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 95723 invoked by uid 500); 19 Aug 2013 07:30:56 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 95676 invoked by uid 99); 19 Aug 2013 07:30:54 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Aug 2013 07:30:54 +0000 Date: Mon, 19 Aug 2013 07:30:54 +0000 (UTC) From: "Matteo Bertozzi (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-8760) possible loss of data in snapshot taken after region split MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-8760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13743598#comment-13743598 ] Matteo Bertozzi commented on HBASE-8760: ---------------------------------------- {quote}We should not include the offline regions' ServerName in the online snapshot procedure. Otherwise the snapshot procedure will timeout{quote} Yeah that's right. The addendum looks good to me. thanks I'll do a final test with more machines, but looks like v8+addendum will be the good one. > possible loss of data in snapshot taken after region split > ---------------------------------------------------------- > > Key: HBASE-8760 > URL: https://issues.apache.org/jira/browse/HBASE-8760 > Project: HBase > Issue Type: Bug > Components: snapshots > Affects Versions: 0.94.8, 0.95.1 > Reporter: Jerry He > Fix For: 0.98.0, 0.94.12, 0.96.0 > > Attachments: HBase-8760-0.94.8.patch, HBase-8760-0.94.8-v1.patch, HBASE-8760-0.94-v4.patch, HBASE-8760-0.94-v5.patch, HBASE-8760-0.94-v6.patch, HBASE-8760-0.94-v7.patch, HBASE-8760-0.94-v8-addendum.patch, HBASE-8760-0.94-v8.patch, HBASE-8760-thz-v0.patch, HBASE-8760-trunk-v8.patch, HBASE-8760-v4.patch, v4-patch-testing-0.94.zip, v4-patch-testing-0.95.2.zip > > > Right after a region split but before the daughter regions are compacted, we have two daughter regions containing Reference files to the parent hfiles. > If we take snapshot right at the moment, the snapshot will succeed, but it will only contain the daughter Reference files. Since there is no hold on the parent hfiles, they will be deleted by the HFile Cleaner after they are no longer needed by the daughter regions soon after. > A minimum we need to do is the keep these parent hfiles from being deleted. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira