Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 23287 invoked from network); 29 Jul 2010 20:17:36 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Jul 2010 20:17:36 -0000 Received: (qmail 40929 invoked by uid 500); 29 Jul 2010 20:17:36 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 40872 invoked by uid 500); 29 Jul 2010 20:17:35 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 40864 invoked by uid 99); 29 Jul 2010 20:17:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Jul 2010 20:17:35 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kuosenhao@gmail.com designates 209.85.213.169 as permitted sender) Received: from [209.85.213.169] (HELO mail-yx0-f169.google.com) (209.85.213.169) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Jul 2010 20:17:28 +0000 Received: by yxs7 with SMTP id 7so351390yxs.14 for ; Thu, 29 Jul 2010 13:17:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:date:message-id :subject:from:to:content-type; bh=Z5mhl0KhFFI0+VcT7nZJkonq0jilPDIp/qGGj/uVHfw=; b=xfneEYIj5DOpiaC23zs370Flt2Gd9qnGVnI3FZBPhpkW4Xd4iqryi7QlEO6zDXBJJS XBKmw8/gojN5lVJJO3yNUZOtrw7zsrrQFS6uvizH+GxvtEZ4meMhWxWF6dUfoq3iMNBi 4k95qQ1mMBd5zb064hIpkM1m+YtM7MlZ6iLrE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=pM7ehAiTs6ZtaQyOaezwx6Z2jZ1ndDj8//sBHbipSF8LpjYfHm5YxUT+99jCgAwYPe ww1CYKGkGpEy8hRxdg+WpZ19CNmLfRJ+lDnxccsx2Mp19K/nFw5ATNaZt/86XxxkEDgj GtDOcgMmP8dPrh+LiOUS1rJUOr4FIlAakncYQ= MIME-Version: 1.0 Received: by 10.101.69.16 with SMTP id w16mr732932ank.141.1280434627715; Thu, 29 Jul 2010 13:17:07 -0700 (PDT) Received: by 10.231.199.82 with HTTP; Thu, 29 Jul 2010 13:17:07 -0700 (PDT) Date: Thu, 29 Jul 2010 13:17:07 -0700 Message-ID: Subject: GC [ParNew...] took 299 secs causing region server to die From: Steve Kuo To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=0016368e258f24a4d9048c8c6b3b --0016368e258f24a4d9048c8c6b3b Content-Type: text/plain; charset=ISO-8859-1 I kept running into the stop-the-world GC during batch import of data into hbase. The configuration of a node in the 8-node cluster is as follows. * 4-core * 64-bit JVM * 8 GB of memory * CDH2 for hadoop and 0.20.5 for hbase * TT: 128 MB * DN: 128 MB * 2 Mappers at 512 MB each * 2 Reducer at 512 MB each * 1 regionserver at 4096 MB The import job was a mapper only job so that only TT, DN, 2 mappers and regionserver were running. Below is the JMX output for the dead regionserver. Time: 2010-07-29 12:25:47 Used: 224,949 kbytes Committed: 670,728 kbytes Max: 4,185,792 kbytes GC time: 5 minutes on ParNew (2,126 collections) 0.000 seconds on ConcurrentMarkSweep (0 collections) Clearly the regionserver was spent all GC time on ParNew, which was not surprising as I was imported tons of data. But I could not figure out why the same GC that usually take way less than a second, took 299 secs at line 3. Any enlightenment is greatly appreciated. I will change ParNew to 6M as documented in Performance Tuning page and gave it another shot. 010-07-28T12:06:57.249-0700: 2406.986: [GC 2406.986: [ParNew: 17786K->755K(19136K), 0.0015410 secs] 348288K->331394K(620416K) icms_dc=27 , 0.0016330 secs] [Times: user=0.00 sys=0.00, real=0.00 secs] 2010-07-28T12:06:57.268-0700: 2407.004: [GC 2407.004: [ParNew: 17580K->761K(19136K), 0.0016710 secs] 348154K->331343K(620416K) icms_dc=27 , 0.0017610 secs] [Times: user=0.00 sys=0.00, real=0.00 secs] 2010-07-28T12:06:57.288-0700: 2407.024: [GC 2407.088: [ParNew: 17564K->757K(19136K), 299.1513910 secs] 348081K->331283K(620416K) icms_dc=27 , 299.1515120 secs] [Times: user=0.17 sys=0.04, real=299.23 secs] 2010-07-28T12:11:56.558-0700: 2706.294: [GC 2706.294: [ParNew: 17735K->925K(19136K), 0.0094600 secs] 348197K->331458K(620416K) icms_dc=27 , 0.0095670 secs] [Times: user=0.01 sys=0.00, real=0.01 secs] 2010-07-28T12:11:56.606-0700: 2706.343: [GC 2706.343: [ParNew: 17940K->932K(19136K), 0.0085750 secs] 348473K->331474K(620416K) icms_dc=27 , 0.0086710 secs] [Times: user=0.01 sys=0.00, real=0.01 secs] --0016368e258f24a4d9048c8c6b3b--