Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 54226 invoked from network); 15 Jul 2008 08:00:24 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 15 Jul 2008 08:00:24 -0000 Received: (qmail 65661 invoked by uid 500); 15 Jul 2008 08:00:24 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 65550 invoked by uid 500); 15 Jul 2008 08:00:24 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 65539 invoked by uid 99); 15 Jul 2008 08:00:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jul 2008 01:00:24 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jul 2008 07:59:39 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id CC08A234C164 for ; Tue, 15 Jul 2008 00:59:31 -0700 (PDT) Message-ID: <5532016.1216108771834.JavaMail.jira@brutus> Date: Tue, 15 Jul 2008 00:59:31 -0700 (PDT) From: "LN (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Created: (HBASE-745) scaling of one regionserver, improving memory and cpu usage MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org scaling of one regionserver, improving memory and cpu usage ----------------------------------------------------------- Key: HBASE-745 URL: https://issues.apache.org/jira/browse/HBASE-745 Project: Hadoop HBase Issue Type: Improvement Components: regionserver Affects Versions: 0.1.3 Environment: hadoop 0.17.1 Reporter: LN Priority: Minor after weeks testing hbase 0.1.3 and hadoop(0.16.4, 0.17.1), i found there are many works to do, before a particular regionserver can handle data about 100G, or even more. i'd share my opions here with stack, and other developers. first, the easiest way improving scalability of regionserver is upgrading hardware, use 64bit os and 8G memory for the regionserver process, and speed up disk io. besides hardware, following are software bottlenecks i found in regionserver: 1. as data increasing, compaction was eating cpu(with io) times, the total compaction time is basicly linear relative to whole data size, even worse, sometimes square relavtive to that size. 2. memory and socket connection usage are depends on opened mapfiles, see HADOOP-2341 and HBASE-24. will explain above in comments later. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.