Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E89C6F612 for ; Wed, 29 May 2013 20:50:21 +0000 (UTC) Received: (qmail 87107 invoked by uid 500); 29 May 2013 20:50:20 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 87044 invoked by uid 500); 29 May 2013 20:50:20 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 86985 invoked by uid 99); 29 May 2013 20:50:20 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 May 2013 20:50:20 +0000 Date: Wed, 29 May 2013 20:50:20 +0000 (UTC) From: "Ted Yu (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (HBASE-8646) Intermittent TestIOFencing#testFencingAroundCompaction failure due to region getting stuck in compaction MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Ted Yu created HBASE-8646: ----------------------------- Summary: Intermittent TestIOFencing#testFencingAroundCompaction failure due to region getting stuck in compaction Key: HBASE-8646 URL: https://issues.apache.org/jira/browse/HBASE-8646 Project: HBase Issue Type: Test Reporter: Ted Yu >From http://54.241.6.143/job/HBase-TRUNK/org.apache.hbase$hbase-server/348/testReport/junit/org.apache.hadoop.hbase/TestIOFencing/testFencingAroundCompaction/ (the underlying region is tabletest,,1369855507443.c251a1d71e75fed8e490db63419edcf1.): {code} 2013-05-29 19:25:20,363 DEBUG [pool-1-thread-1] catalog.CatalogTracker(208): Stopping catalog tracker org.apache.hadoop.hbase.catalog.CatalogTracker@6280d069 2013-05-29 19:25:20,366 INFO [pool-1-thread-1] hbase.TestIOFencing(255): Waiting for compaction to be about to start 2013-05-29 19:25:20,367 DEBUG [pool-1-thread-1] hbase.TestIOFencing$CompactionBlockerRegion(107): waiting for compaction to block 2013-05-29 19:25:20,367 DEBUG [pool-1-thread-1] hbase.TestIOFencing$CompactionBlockerRegion(109): compaction block reached 2013-05-29 19:25:20,367 INFO [pool-1-thread-1] hbase.TestIOFencing(257): Starting a new server 2013-05-29 19:25:20,424 DEBUG [pool-1-thread-1] client.HConnectionManager(2811): regionserver/ip-10-197-74-184.us-west-1.compute.internal/10.197.74.184:0 HConnection server-to-server retries=100 ... 2013-05-29 19:25:20,861 INFO [pool-1-thread-1] hbase.TestIOFencing(260): Killing region server ZK lease ... 2013-05-29 19:25:21,030 DEBUG [RS_CLOSE_REGION-ip-10-197-74-184.us-west-1.compute.internal,37836,1369855503920-0] handler.CloseRegionHandler(125): Processing close of tabletest,,1369855507443.c251a1d71e75fed8e490db63419edcf1. 2013-05-29 19:25:21,031 DEBUG [RS_CLOSE_REGION-ip-10-197-74-184.us-west-1.compute.internal,37836,1369855503920-0] regionserver.HRegion(928): Closing tabletest,,1369855507443.c251a1d71e75fed8e490db63419edcf1.: disabling compactions & flushes 2013-05-29 19:25:21,031 DEBUG [RS_CLOSE_REGION-ip-10-197-74-184.us-west-1.compute.internal,37836,1369855503920-0] regionserver.HRegion(1022): waiting for 1 compactions to complete for region tabletest,,1369855507443.c251a1d71e75fed8e490db63419edcf1. ... 2013-05-29 19:25:27,037 INFO [pool-1-thread-1] hbase.TestIOFencing(265): Waiting for the new server to pick up the region tabletest,,1369855507443.c251a1d71e75fed8e490db63419edcf1. {code} The test started new region server. However, the region got stuck in: {code} public void waitForFlushesAndCompactions() { synchronized (writestate) { while (writestate.compacting > 0 || writestate.flushing) { LOG.debug("waiting for " + writestate.compacting + " compactions" + (writestate.flushing ? " & cache flush" : "") + " to complete for region " + this); try { writestate.wait(); {code} This led to the timeout: {code} assertTrue("Timed out waiting for new server to open region", System.currentTimeMillis() - startWaitTime < 60000); {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira