Return-Path: X-Original-To: apmail-hadoop-common-dev-archive@www.apache.org Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AD0F0D64A for ; Wed, 15 May 2013 23:18:30 +0000 (UTC) Received: (qmail 91579 invoked by uid 500); 15 May 2013 23:18:28 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 91513 invoked by uid 500); 15 May 2013 23:18:28 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 91504 invoked by uid 99); 15 May 2013 23:18:28 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 May 2013 23:18:28 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [76.96.30.24] (HELO qmta02.emeryville.ca.mail.comcast.net) (76.96.30.24) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 May 2013 23:18:21 +0000 Received: from omta07.emeryville.ca.mail.comcast.net ([76.96.30.59]) by qmta02.emeryville.ca.mail.comcast.net with comcast id cNnf1l00A1GXsucA2PHfGD; Wed, 15 May 2013 23:17:39 +0000 Received: from boudnik.org ([24.4.185.157]) by omta07.emeryville.ca.mail.comcast.net with comcast id cPHe1l00d3QAh8g8UPHf2Z; Wed, 15 May 2013 23:17:39 +0000 Received: from mail.boudnik.org (localhost [127.0.0.1]) by boudnik.org (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id r4FNHcaD027626 for ; Wed, 15 May 2013 16:17:38 -0700 Received: (from cos@localhost) by mail.boudnik.org (8.14.3/8.14.3/Submit) id r4FNHbTp027625 for common-dev@hadoop.apache.org; Wed, 15 May 2013 16:17:37 -0700 X-Authentication-Warning: mail.boudnik.org: cos set sender to cos@apache.org using -f Date: Wed, 15 May 2013 16:17:37 -0700 From: Konstantin Boudnik To: common-dev@hadoop.apache.org Subject: Re: [VOTE] - Release 2.0.5-beta Message-ID: <20130515231737.GA27292@linspire.com> References: <48A037BF-5EE8-44EA-BE27-0BBB46DA0B5B@hortonworks.com> <548593E8-2791-4975-9710-D5D1EC51C516@apache.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: X-Organization: It's something of 'Cos User-Agent: Mutt/1.5.18 (2008-05-17) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=comcast.net; s=q20121106; t=1368659859; bh=2UmZXraAy2VNU6iEn6rtD+eTT9ovWxY5N1LoDRbkZY8=; h=Received:Received:Received:Received:Date:From:To:Subject: Message-ID:MIME-Version:Content-Type; b=MxW4eXmPIorGPxg5KFGNde2Le6Wp0YAM5rtc7pOCqDdeck/B9kiekGeRvb6HySSNt TxX0QFvwwSwg46jNFYIOnqARn766EThDE1FBk8YRT3PYsdwOdq2abQ/bJuCgDcQoJo 2eyzO2w6WHyRh0omxyKNPuFInyg1myA3rwMkSFMLlY20LzRIjdWpS3ADLzLYMxofwD VH6viN54RGcT6FqHCNyXlvn33bPP1HM2vZFh3EaqY3nOPmZnM8Fii/Ipqq9SbwTeRr 3a9chC3QhrdHZYgbAZyNdt53Fp2+B4UtnaUiaDaH6kkqwK/rgvel4RsGxDc1n9Tkc0 OLIlR+qqAmJaA== X-Virus-Checked: Checked by ClamAV on apache.org Indeed. I think the root of the issue is deeper. ASF software practices are great to deal with isolated, relatively contained projects like httpd, libreoffice, trac, etc. However, Hadoop based stack - essentially, software aimed at enterprises with bigger scale operations - is a different animal, that requires balancing of a huge number of moving parts and an unbroken flow of feedback up the stream. Anyone who have delivered any enterprise grade software system knows perfectly well how hard is that. However, in the environment where a release pushed out in the rush (essentially causing DOA issues downstream), these are got fixed in consequent releases. That ironically is likely to contain some other DOAs because an integration testing - and I mean real world integration system testing - is done by this small project, that is treated like a toy for adolescent kids. And there's no other real integration testing happening OPENLY on the full stack. Despite numerous claims, that is. Software comes with bugs - this is a somewhat expected phenomena. However, bug fixes shouldn't be mixed with new features, increasing entropy in the system. In other words, the development process should fan-in. A process with multiple consequent stable releases helps to achieve it; and compatibility issues would be addressed by working on the next major release. The model above leaves downstream with a choice of sticking to the 3.x or switching to 4.x and so on. Where's having permanent alpha tag is a convenient way to control software project that effectively became a vendor-controlled effort. And yes - this leads to fragmentation, makes no mistakes about it. Because no one can sit on the hands for a year and wait until a usable release with all great features will come about: lot of organizations just silently forking away to make their own environment suitable for production or sale; some of them might sporadically contribute something back. And of course - this is not the aim of Apache project to produce commercial grade platform. Cos On Wed, May 15, 2013 at 02:54PM, Roman Shaposhnik wrote: > On Wed, May 15, 2013 at 2:14 PM, Vinod Kumar Vavilapalli > wrote: > > Please list down all the issues that BigTop ran into *because of* new features. > > Whether the bug is *because of* new feature or not is a red herring > for my argument. Please lets drop this distinction. I never used it. > > > You continue to argue that new features are destabilizing 2.0.*, > > which I don't agree with at all. 2.0.3-alpha was the last time major > > features got merged in, and we found blockers irrespective of those. > > This is not my argument at all. I apologize if somehow I failed to > communicate it, but here's what my argument boils down to: > given *my* experience with Hadoop 2.0.x series and Bigtop > release every time I try a different release of Hadoop 2.0.x > I run into issues that scare me. They scare me because > they are so basic yet they make component like Sqoop > and Oozie (and I believe Giraph on one occasion) > pretty much DOA for YARN-base mapreduce implementation. > > In my mind, what that translates into is the fact that nobody > did *any* real testing of a particular downstream component > running on a given Hadoop 2.0.x release. Like I said -- > the issues so far make the components in question DOA. > Effectively the onion of issues remain unpeeled, so to speak. > > What I'm asking on this thread (and somehow nobody is willing > to give me a straight answer) is whether the Hadoop community > is willing to invest in peeling this onion of issues somewhat more > before declaring Hadoop 2.0.5 a beta release. > > Once again it is a binary question -- please give me an answer > of yes or no. > > > I am not arguing that new features *may* destabilize the branch, but you've repeatedly stated this as if that were a fact. > > Your list of issues is pretty complete (give or take a few that I didn't file > but Cos and others did). And I'd be the first one to agree that > it is not a large list of issues. What scares me is not its size, > but the fact how basic they are and how the block the *rest* > of the testing completely. > > To be extra clear -- what scares me about something like > MAPREDUCE-5240 is not whether it came as a result of > a merge or was sitting there since day one. What scares > me is that we've identified it last week and yet Sqoop 2 is > DOA in its presense. > > How many more issues like that one (regardless of how > they originated) are in branch-2? Wouldn't we want to > know before declaring Hadoop 2.0.5 beta? > > Now, knowing would require work -- that's what my > argument is all about. > > > Thanks, > Roman.