Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3924810424 for ; Tue, 25 Mar 2014 11:07:26 +0000 (UTC) Received: (qmail 58895 invoked by uid 500); 25 Mar 2014 11:07:23 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 58470 invoked by uid 500); 25 Mar 2014 11:07:18 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 58437 invoked by uid 99); 25 Mar 2014 11:07:14 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Mar 2014 11:07:14 +0000 Date: Tue, 25 Mar 2014 11:07:14 +0000 (UTC) From: "zhaojianbo (JIRA)" To: dev@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (HBASE-10827) Making HBase use multiple ethernet cards will improve the performance MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 zhaojianbo created HBASE-10827: ---------------------------------- Summary: Making HBase use multiple ethernet cards will improve the performance Key: HBASE-10827 URL: https://issues.apache.org/jira/browse/HBASE-10827 Project: HBase Issue Type: New Feature Affects Versions: 0.99.0 Reporter: zhaojianbo In our online cluster, usually there are multiple ethernet cards in one machine, one for outer network, one for inner network. But the current version of HBase can not use all of them which waste the network bandwidth of one ethernet card. If we make HBase use multiple ethernet cards concurrently, the performance of HBase will be improved. So I did the work, and test a simple scenario: 8 clients scan the same region data from a different machine with two ethernet cards.(machine of regionserver also with two ethernet cards) The Environment is: * I start HBase cluster with a master, a regionserver, a zookeeper in a machine. * HDFS cluster with a Namenode, a datanode, a secondary namenode is also started in the same machine. * 8 client run on different machine. * all data local * 22GB data size I measure the performance before and after the optimization. The results are: ||client||time before optimization||time after optimization|| | 8 | 1628.11s | 1242.45s | The patch is uploaded. What I did is the following: # create new RPC getAllServerAddress which obtain all the addresses of regionserver # client call the RPC to obtain the addresses, choose one of them randomly, validate the address and use the address as the regionLocation address # add a cache serverAddressMap to avoid redundant RPC. -- This message was sent by Atlassian JIRA (v6.2#6252)