Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5468BDCD4 for ; Sat, 16 Feb 2013 03:11:00 +0000 (UTC) Received: (qmail 13972 invoked by uid 500); 16 Feb 2013 03:10:55 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 13759 invoked by uid 500); 16 Feb 2013 03:10:54 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 13742 invoked by uid 99); 16 Feb 2013 03:10:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Feb 2013 03:10:54 +0000 X-ASF-Spam-Status: No, hits=-4.0 required=5.0 tests=RCVD_IN_DNSWL_HI,SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (athena.apache.org: transitioning domain of raymond.liu@intel.com does not designate 143.182.124.21 as permitted sender) Received: from [143.182.124.21] (HELO mga03.intel.com) (143.182.124.21) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Feb 2013 03:10:38 +0000 Received: from azsmga001.ch.intel.com ([10.2.17.19]) by azsmga101.ch.intel.com with ESMTP; 15 Feb 2013 19:10:17 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.84,676,1355126400"; d="scan'208";a="257718916" Received: from fmsmsx104.amr.corp.intel.com ([10.19.9.35]) by azsmga001.ch.intel.com with ESMTP; 15 Feb 2013 19:10:17 -0800 Received: from FMSMSX109.amr.corp.intel.com (10.19.9.28) by FMSMSX104.amr.corp.intel.com (10.19.9.35) with Microsoft SMTP Server (TLS) id 14.1.355.2; Fri, 15 Feb 2013 19:10:17 -0800 Received: from shsmsx102.ccr.corp.intel.com (10.239.4.154) by fmsmsx109.amr.corp.intel.com (10.19.9.28) with Microsoft SMTP Server (TLS) id 14.1.355.2; Fri, 15 Feb 2013 19:10:16 -0800 Received: from shsmsx101.ccr.corp.intel.com ([169.254.1.236]) by SHSMSX102.ccr.corp.intel.com ([169.254.2.51]) with mapi id 14.01.0355.002; Sat, 16 Feb 2013 11:10:15 +0800 From: "Liu, Raymond" To: "user@hadoop.apache.org" Subject: why my test result on dfs short circuit read is slower? Thread-Topic: why my test result on dfs short circuit read is slower? Thread-Index: Ac4L8yO0iWsCqVYcQtqlClElHzeq+Q== Date: Sat, 16 Feb 2013 03:10:14 +0000 Message-ID: <391D65D0EBFC9B4B95E117F72A360F1A1CB449@SHSMSX101.ccr.corp.intel.com> Accept-Language: zh-CN, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.239.127.40] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org Hi I tried to use short circuit read to improve my hbase cluster MR scan perf= ormance. I have the following setting in hdfs-site.xml dfs.client.read.shortcircuit set to true dfs.block.local-path-access.user set to MR job runner. The cluster is 1+4 node and each data node have 16cpu/4HDD, with all hbase= table major compact thus all data is local. I have hoped that the short circuit read will improve the performance. While the test result is that with short circuit read enabled, the perform= ance actually dropped 10-15%. Say scan a 50G table cost around 100s instead= of 90s. My hadoop version is 1.1.1, any idea on this? Thx! Best Regards, Raymond Liu