Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8A2B010C3F for ; Fri, 13 Dec 2013 09:16:30 +0000 (UTC) Received: (qmail 87292 invoked by uid 500); 13 Dec 2013 09:16:28 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 87224 invoked by uid 500); 13 Dec 2013 09:16:25 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 87203 invoked by uid 99); 13 Dec 2013 09:16:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Dec 2013 09:16:23 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of nkeywal@gmail.com designates 209.85.212.173 as permitted sender) Received: from [209.85.212.173] (HELO mail-wi0-f173.google.com) (209.85.212.173) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Dec 2013 09:16:16 +0000 Received: by mail-wi0-f173.google.com with SMTP id hn9so773292wib.0 for ; Fri, 13 Dec 2013 01:15:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=5r7IAEkQlYTTfTSTKHUIyNPWczVewarDf+2ursHoZGQ=; b=B2brkyhVHgWXMc5npw9MKv1oIEVQ1m6sKTpqzKVK8GRLlfT5V3mRpHsRg2K++UdRG3 FMKSeBmJz6aKjP5UWMUdniczq4MbpzmddLS71Kt+pw8PU9uqGCV/YBAYOcQ0hS+yN1EI pAF135Wb/RJyJ841Yh76LZhAFXwANhaG+Axex8mNvNrcD1F19xXUbulD6QtXQcp/XpgA RXpBcZZ/M6R0Z2a6hB54hb6bEPgFFOVeJhA0K7IpraMNfo1noTZm+FbSoYahkKLcdacz czbTNXxDUZpFRH9RuD04VU/4o2jJixiMm0YEt/JiZqrgkuqzx5LiicYtVNciBZ/k8C7j L6Bw== X-Received: by 10.180.185.101 with SMTP id fb5mr1958067wic.11.1386926155822; Fri, 13 Dec 2013 01:15:55 -0800 (PST) MIME-Version: 1.0 Received: by 10.194.163.195 with HTTP; Fri, 13 Dec 2013 01:15:35 -0800 (PST) In-Reply-To: References: From: Nicolas Liochon Date: Fri, 13 Dec 2013 10:15:35 +0100 Message-ID: Subject: Re: Forked large unit tests seem slow to terminate To: dev@hbase.apache.org Content-Type: multipart/alternative; boundary=001a11c2448ed8fc8004ed66e8be X-Virus-Checked: Checked by ClamAV on apache.org --001a11c2448ed8fc8004ed66e8be Content-Type: text/plain; charset=ISO-8859-1 > I wonder if there is anything that can be done to wait for the forked process to actually terminate? As far as I know, no. We could be hacky, and, in the afterClass, start a deamon thread that will do a kill -9 after a few seconds. Or something similar, but that's very hacky imho. > Is this a Surefire bug? I don't think so, even if surefire has its share of bugs. The issue could be in HBase itself: surefire execute the "afterClass" method, this method does finish, but HBase then gets stuck in the shutdown hooks. On Fri, Dec 13, 2013 at 1:26 AM, Andrew Purtell wrote: > We eventually have no zombies, at least the zombie detector does not find > anything, but meanwhile on memory limited EC2 instances I'm seeing tests > killed by the OOM killer, very likely meaning that previous tests are still > in the process of shutting down and are hanging around in the process > table, consuming RAM, VMM, etc. I didn't see this before the switch to > Hadoop 2 as default. I'm not suggesting we switch back. I wonder if there > is anything that can be done to wait for the forked process to actually > terminate? Is this a Surefire bug? > > -- > Best regards, > > - Andy > > Problems worthy of attack prove their worth by hitting back. - Piet Hein > (via Tom White) > --001a11c2448ed8fc8004ed66e8be--