Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 96906 invoked from network); 18 Feb 2011 05:13:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 18 Feb 2011 05:13:38 -0000 Received: (qmail 30888 invoked by uid 500); 18 Feb 2011 05:13:38 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 30777 invoked by uid 500); 18 Feb 2011 05:13:36 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 30767 invoked by uid 99); 18 Feb 2011 05:13:35 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Feb 2011 05:13:35 +0000 X-ASF-Spam-Status: No, hits=-1996.4 required=5.0 tests=ALL_TRUSTED,FS_REPLICA,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Feb 2011 05:13:33 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 69A361AC627 for ; Fri, 18 Feb 2011 05:13:12 +0000 (UTC) Date: Fri, 18 Feb 2011 05:13:12 +0000 (UTC) From: "Todd Lipcon (JIRA)" To: issues@hbase.apache.org Message-ID: <509798148.1064.1298005992428.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1265762513.2972.1297190817469.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Updated: (HBASE-3515) [replication] ReplicationSource can miss a log after RS comes out of GC MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-3515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HBASE-3515: ------------------------------- Fix Version/s: (was: 0.90.1) 0.90.2 > [replication] ReplicationSource can miss a log after RS comes out of GC > ----------------------------------------------------------------------- > > Key: HBASE-3515 > URL: https://issues.apache.org/jira/browse/HBASE-3515 > Project: HBase > Issue Type: Bug > Affects Versions: 0.90.0 > Reporter: Jean-Daniel Cryans > Assignee: Jean-Daniel Cryans > Priority: Critical > Fix For: 0.90.2 > > Attachments: HBASE-3515.patch > > > This is from Hudson build 1738, if a log is about to be rolled and the ZK connection is already closed then the replication code will fail at adding the new log in ZK but the log will still be rolled and it's possible that some edits will make it in. > From the log: > {quote} > 2011-02-08 10:21:20,618 FATAL [RegionServer:0;vesta.apache.org,46117,1297160399378.logRoller] regionserver.HRegionServer(1383): > ABORTING region server serverName=vesta.apache.org,46117,1297160399378, load=(requests=1525, regions=12, > usedHeap=273, maxHeap=1244): Failed add log to list > org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for > /1/replication/rs/vesta.apache.org,46117,1297160399378/2/vesta.apache.org%3A46117.1297160480509 > ... > 2011-02-08 10:21:22,444 DEBUG [MASTER_META_SERVER_OPERATIONS-vesta.apache.org:56008-0] wal.HLogSplitter(258): > Splitting hlog 8 of 8: hdfs://localhost:55474/user/hudson/.logs/vesta.apache.org,46117,1297160399378/vesta.apache.org%3A46117.1297160480509, length=0 > 2011-02-08 10:21:22,862 DEBUG [MASTER_META_SERVER_OPERATIONS-vesta.apache.org:56008-0] wal.HLogSplitter(436): > Pushed=31 entries from hdfs://localhost:55474/user/hudson/.logs/vesta.apache.org,46117,1297160399378/vesta.apache.org%3A46117.1297160480509 > {quote} > The easiest thing to do would be let the exception out and cancel the log roll. -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira