Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C4BA518591 for ; Thu, 10 Dec 2015 07:36:11 +0000 (UTC) Received: (qmail 90056 invoked by uid 500); 10 Dec 2015 07:36:11 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 89979 invoked by uid 500); 10 Dec 2015 07:36:11 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 89754 invoked by uid 99); 10 Dec 2015 07:36:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 Dec 2015 07:36:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 15ACC2C1F57 for ; Thu, 10 Dec 2015 07:36:11 +0000 (UTC) Date: Thu, 10 Dec 2015 07:36:11 +0000 (UTC) From: "Heng Chen (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-14949) Skip duplicate entries when replay WAL. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-14949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15050250#comment-15050250 ] Heng Chen commented on HBASE-14949: ----------------------------------- I check current logic and found that we need to do nothing...... It has already skip the duplicate entries during split WAL into recovery region edits. And WAL named by timestamp when it is generated, so there is no need to use another format name. relates code {code: title=WALSplitter#splitLogFile} 352 if (lastFlushedSequenceId >= entry.getKey().getLogSeqNum()) { 353 editsSkipped++; 354 continue; 355 } {code} I think we can invalid this issue. > Skip duplicate entries when replay WAL. > --------------------------------------- > > Key: HBASE-14949 > URL: https://issues.apache.org/jira/browse/HBASE-14949 > Project: HBase > Issue Type: Sub-task > Reporter: Heng Chen > Attachments: HBASE-14949.patch > > > As HBASE-14004 design, there will be duplicate entries in different WAL. It happens when one hflush failed, we will close old WAL with 'acked hflushed' length, then open a new WAL and write the unacked hlushed entries into it. > So there maybe some overlap between old WAL and new WAL. > We should skip the duplicate entries when replay. I think it has no harm to current logic, maybe we do it first. -- This message was sent by Atlassian JIRA (v6.3.4#6332)