Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A8522929B for ; Fri, 24 Feb 2012 02:09:08 +0000 (UTC) Received: (qmail 51728 invoked by uid 500); 24 Feb 2012 02:09:08 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 51645 invoked by uid 500); 24 Feb 2012 02:09:08 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 51637 invoked by uid 99); 24 Feb 2012 02:09:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Feb 2012 02:09:08 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of todd@cloudera.com designates 209.85.215.48 as permitted sender) Received: from [209.85.215.48] (HELO mail-lpp01m010-f48.google.com) (209.85.215.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Feb 2012 02:09:03 +0000 Received: by lagw12 with SMTP id w12so2079427lag.35 for ; Thu, 23 Feb 2012 18:08:42 -0800 (PST) Received-SPF: pass (google.com: domain of todd@cloudera.com designates 10.112.42.72 as permitted sender) client-ip=10.112.42.72; Authentication-Results: mr.google.com; spf=pass (google.com: domain of todd@cloudera.com designates 10.112.42.72 as permitted sender) smtp.mail=todd@cloudera.com Received: from mr.google.com ([10.112.42.72]) by 10.112.42.72 with SMTP id m8mr124596lbl.10.1330049322208 (num_hops = 1); Thu, 23 Feb 2012 18:08:42 -0800 (PST) Received: by 10.112.42.72 with SMTP id m8mr103965lbl.10.1330049322111; Thu, 23 Feb 2012 18:08:42 -0800 (PST) MIME-Version: 1.0 Received: by 10.112.2.130 with HTTP; Thu, 23 Feb 2012 18:08:22 -0800 (PST) In-Reply-To: References: From: Todd Lipcon Date: Thu, 23 Feb 2012 18:08:22 -0800 Message-ID: Subject: Re: Merging the HA branch to trunk - Wednesday, February 29th To: hdfs-dev@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQnZtM56hFGtFTV4k4QGSX6/cwXUPczwefdNeAuWN22F1O8TwVLaOFYpHHX8UIuY9eoaD16w Hi folks, I had the chance today to run some performance benchmarks on one of Cloudera's 100 node test clusters. I've posted the results to the JIRA, but the summary is that I think we need to do a bit more optimization of the persistBlocks edit log entries before we merge. That said, I think there are a couple easy wins that should bring performance back in line with trunk, which I expect to complete by early next week. Please continue reviewing the branch so that when the optimizations have been made, we can proceed with a merge. -Todd On Thu, Feb 23, 2012 at 10:27 AM, Todd Lipcon wrote: > On Thu, Feb 23, 2012 at 10:23 AM, Suresh Srinivas > wrote: >> I am not sure any of these issues are serious show stoppers for merging >> into trunk. >> Why not merge into trunk and fix some of these issues? >> >> The reason is, merging is non trivial with two branches changing >> independently. Given that >> Jitendra has posted a merge patch, why not do it earlier? Do we need heads >> up of a week. >> If merging must wait, should we consider creating a merge branch and >> committing the patch >> Jitendra has. This makes other merges more manageable. > > We already have a merge branch - the patch is easy to generate since > we have been merging in the trunk->HA direction daily since its > inception. > > I'm all for merging to trunk earlier if everyone's cool with it, but > we do need to start a vote. Shall I call one? > > -Todd > >> On Wed, Feb 22, 2012 at 6:24 PM, Aaron T. Myers wrote: >> >>> Hello HDFS devs, >>> >>> Work has largely stabilized on the HA-branch in the last few weeks. At this >>> point the HA NN project is nearly feature-complete for manual failover. >>> We've been running the full test suite nightly, and all automated tests >>> have been passing, except for one known test failure which should be fixed >>> shortly. >>> >>> I'd like to begin the process of merging this branch back to HDFS trunk. >>> There are still several outstanding sub-JIRAs under the HDFS-1623 and >>> HADOOP-7454 umbrella JIRAs, but most of these are either nice-to-haves or >>> relate to supporting automatic failover. Once the branch is merged to >>> trunk, work on these JIRAs can continue there. >>> >>> I've identified the following JIRAs which I think should be the only >>> remaining blockers for merging to trunk: >>> >>> HDFS-2904 - Client support for getting delegation tokens in an HA cluster >>> HDFS-2920 - Fix remaining TODOs in the code from HA. Mostly little cleanup >>> stuff. >>> HDFS-2958 - Sweep for remaining proxy construction which doesn't go through >>> failover path >>> HDFS-2979 - Balancer should use logical URI for creating failover proxy >>> (will fix the only current test failure) >>> >>> All of these JIRAs should be fixed in the next few days. >>> >>> I propose that, unless more blocker issues are discovered in the interim, >>> we merge this branch to trunk one week from today, i.e. Wednesday, February >>> 29th. During this time we will also execute the test plans described in the >>> test documents attached to HDFS-1623 to try to identify any regressions or >>> performance issues in the branch. If you plan to review the code changes or >>> the test plan, I ask that you please do so as soon as possible. >>> >>> Feedback is certainly welcome on this plan. >>> >>> Thanks a lot, >>> Aaron >>> >>> -- >>> Aaron T. Myers >>> Software Engineer, Cloudera >>> > > > > -- > Todd Lipcon > Software Engineer, Cloudera -- Todd Lipcon Software Engineer, Cloudera