Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2AD1ED3E9 for ; Tue, 26 Feb 2013 21:48:15 +0000 (UTC) Received: (qmail 3090 invoked by uid 500); 26 Feb 2013 21:48:14 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 3025 invoked by uid 500); 26 Feb 2013 21:48:14 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 3016 invoked by uid 99); 26 Feb 2013 21:48:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Feb 2013 21:48:14 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of bikas@hortonworks.com designates 209.85.219.53 as permitted sender) Received: from [209.85.219.53] (HELO mail-oa0-f53.google.com) (209.85.219.53) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Feb 2013 21:48:07 +0000 Received: by mail-oa0-f53.google.com with SMTP id m1so6231504oag.40 for ; Tue, 26 Feb 2013 13:47:47 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:from:references:in-reply-to:mime-version:x-mailer :thread-index:date:message-id:subject:to:content-type :x-gm-message-state; bh=i4j5OER51oD+F88LyPc73uLKVWYW3p+nL5uPPbvlXUg=; b=B9kLYnchPtruuN7phjvXToaE5ljc0D1ybCDeXGOdR4r+UhR9n7r3L+ON1Y4vT4unJL 4cwx1AC7XfeQWt21XLq49OXdPKjrGKB+xK/e5Lk4RJs17rX7mSJXv1UULYZthWJ7hqX1 g20bXqti0SBlWFhMJShQutqju6s/egMt3qmHMuvLZRzna+WhftS1sE+P48CnvKx9euMj vwNjsJiRcDvD6Mz7hKjrg5jCfshDUl8PwWALllmMT8PWtwJ3Tv11yRYJKha6sNFqqAMf 7f1TdB44H6npLhRFvnyYDc/a+vV3T99qIrp4Lj8w28M+A6/yO044ibpeZ+Mmo5vU2Jmv hvcQ== X-Received: by 10.182.107.66 with SMTP id ha2mr13281923obb.43.1361915267032; Tue, 26 Feb 2013 13:47:47 -0800 (PST) From: Bikas Saha References: In-Reply-To: MIME-Version: 1.0 X-Mailer: Microsoft Outlook 14.0 Thread-Index: AQGu4Uy1la7aFCUEKpF5Z8jlJObYCJjLDXNw Date: Tue, 26 Feb 2013 13:47:09 -0800 Message-ID: Subject: RE: VOTE: HDFS-347 merge To: hdfs-dev@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQmbd6b9+MwNGsRyeav3eCf51Ruz1SlDujTbvB8Ciho+28Fov2RJdM05pO9HtIGsv8/lyHFk X-Virus-Checked: Checked by ClamAV on apache.org Hi, In my opinion, this feature of short circuit reads (HDFS-347 or HDFS-2246) is not a desirable feature for HDFS. We should be working towards removing this feature instead of enhancing it and making it popular. Maybe short-circuit reads were something that HBase needed for performance at a point in time when HDFS performance was slow. But after all the improvements that have been made, is it still unacceptably slow to read from HDFS? Is there more good engineering that we can do to close that gap? Close it for all HDFS users and not just the ones who use short-circuit reads? Which brings me to the question - Who is the target audience for this feature? From what I see, anyone who potentially wants to use it == everyone. Now if everyone starts using short circuit reads what happens to the performance problem that we are trying to solve? Will performance still be better then? This is especially important in the context of YARN where we don't control the apps that run on the shared grid. What problem are we trying to solve here? If we want better HDFS performance and QOS for services then we want to give as much control over the disk to HDFS rather than take it away. Short circuit reads leave a gaping hole towards that end and making short circuit reads better and easier to use makes that hole larger. I am sorry for replying late and also because my response might be missing historical perspectives that I am not aware of. Bikas -----Original Message----- From: rarecactus@gmail.com [mailto:rarecactus@gmail.com] On Behalf Of Colin McCabe Sent: Sunday, February 17, 2013 1:49 PM To: hdfs-dev@hadoop.apache.org Subject: VOTE: HDFS-347 merge Hi all, I would like to merge the HDFS-347 branch back to trunk. It's been under intensive review and testing for several months. The branch adds a lot of new unit tests, and passes Jenkins as of 2/15 [1] We have tested HDFS-347 with both random and sequential workloads. The short-circuit case is substantially faster [2], and overall performance looks very good. This is especially encouraging given that the initial goal of this work was to make security compatible with short-circuit local reads, rather than to optimize the short-circuit code path. We've also stress-tested HDFS-347 on a number of clusters. This iniial VOTE is to merge only into trunk. Just as we have done with our other recent merges, we will consider merging into branch-2 after the code has been in trunk for few weeks. Please cast your vote by EOD Sunday 2/24. best, Colin McCabe [1] https://issues.apache.org/jira/browse/HDFS-347?focusedCommentId=13579704&p age=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comme nt-13579704 [2] https://issues.apache.org/jira/browse/HDFS-347?focusedCommentId=13551755&p age=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comme nt-13551755