Return-Path: X-Original-To: apmail-hadoop-general-archive@minotaur.apache.org Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C9F5EDF75 for ; Sat, 2 Mar 2013 03:52:49 +0000 (UTC) Received: (qmail 16672 invoked by uid 500); 2 Mar 2013 03:52:46 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 16173 invoked by uid 500); 2 Mar 2013 03:52:44 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 16107 invoked by uid 99); 2 Mar 2013 03:52:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Mar 2013 03:52:41 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [76.96.27.243] (HELO qmta13.emeryville.ca.mail.comcast.net) (76.96.27.243) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Mar 2013 03:52:32 +0000 Received: from omta22.emeryville.ca.mail.comcast.net ([76.96.30.89]) by qmta13.emeryville.ca.mail.comcast.net with comcast id 6SlT1l0031vN32cADTs8bi; Sat, 02 Mar 2013 03:52:08 +0000 Received: from boudnik.org ([24.4.185.157]) by omta22.emeryville.ca.mail.comcast.net with comcast id 6Ts71l00b3QAh8g8iTs87j; Sat, 02 Mar 2013 03:52:08 +0000 Received: from localhost (tpx.boudnik.org [192.168.102.148]) by boudnik.org (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id r223q7vQ024004 for ; Fri, 1 Mar 2013 19:52:07 -0800 Date: Fri, 1 Mar 2013 19:52:07 -0800 From: Konstantin Boudnik To: general@hadoop.apache.org Subject: Re: [DISCUSS] stabilizing Hadoop releases wrt. downstream Message-ID: <20130302035207.GH28322@tpx> Mail-Followup-To: general@hadoop.apache.org References: MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="xkXJwpr35CY/Lc3I" Content-Disposition: inline In-Reply-To: X-Organization: It's something of 'Cos X-PGP-Key: http://www.boudnik.org/~cos/pubkey.asc User-Agent: Mutt/1.5.21 (2010-09-15) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=comcast.net; s=q20121106; t=1362196328; bh=mTon5bw6BbNNkudbT61CO7oEJbudVe/sRs4MdZKBz7c=; h=Received:Received:Received:Date:From:To:Subject:Message-ID: MIME-Version:Content-Type; b=CrrVZBZiDUawH+oYlWyLldtzxmBz3mUvEDxOfPiLPlY1819Fpm5ozD01pDoEQMSwY /bXEPYfcQd2cpARMOzJD9G+FQOovj4ZstpbEJodDuqfB/7PEnhEKzRAzpunTh2dR+/ QdOjt/6kVUBS4UVMQKDwLsHkewFzk/jNXwy6xglVfNn1bymSGqFxg9nM4Vt6eOz8iP 8zcUTgPyiz1rUrtaeQqazVoJrXtAMYBtrMciBNJ0GQtXD6YRXJjzkpXeY2lHq7At6g FHLRKqDVsredyFm3F7JV3k0fDTJ9bv6cbsVuvqhSi7P3/DoOJIcYmpkt8eNSnOSPpD vdIFWCGokXeUw== X-Virus-Checked: Checked by ClamAV on apache.org --xkXJwpr35CY/Lc3I Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Luke, would you mind expanding the answer a little bit? I am, for once, very curi= ous how test-patch process would catch something like HADOOP-9299 ? Cos On Wed, Feb 27, 2013 at 11:03AM, Luke Lu wrote: > Sympathize with the sentiment... >=20 > > The good news, is that Bigtop's charter is in big part *exactly* about > providing you with this kind of feedback. > > What we can NOT do is submit patches for all the issues. >=20 > What you can do is to improve and maintain our test-patch process to do > sanity compatibility checks for downstream projects (that you care about), > so most breakage is caught before the patches are committed. >=20 > I'd be happy to help reviewing the patches to improve our test patch > process. >=20 > __Luke >=20 >=20 > On Tue, Feb 26, 2013 at 5:43 PM, Roman Shaposhnik w= rote: >=20 > > Hi! > > > > for the past couple of releases of Hadoop 2.X code line the issue > > of integration between Hadoop and its downstream projects has > > become quite a thorny issue. The poster child here is Oozie, where > > every release of Hadoop 2.X seems to be breaking the compatibility > > in various unpredictable ways. At times other components (such > > as HBase for example) also seem to be affected. > > > > Now, to be extremely clear -- I'm NOT talking about the *latest* version > > of Oozie working with the *latest* version of Hadoop, instead > > my observations come from running previous *stable* releases > > of Bigtop on top of Hadoop 2.X RCs. > > > > As many of you know Apache Bigtop aims at providing a single > > platform for integration of Hadoop and Hadoop ecosystem projects. > > As such we're uniquely positioned to track compatibility between > > different Hadoop releases with regards to the downstream components > > (things like Oozie, Pig, Hive, Mahout, etc.). Every single single RC > > we've been pretty diligent at trying to provide integration-level feedb= ack > > on the quality of the upcoming release, but it seems that our efforts > > don't quite suffice in Hadoop 2.X stabilizing. > > > > Of course, one could argue that while Hadoop 2.X code line was > > designated 'alpha' expecting much in the way of perfect integration > > and compatibility was NOT what the Hadoop community was > > focusing on. I can appreciate that view, but what I'm interested in > > is the future of Hadoop 2.X not its past. Hence, here's my question > > to all of you as a Hadoop community at large: > > > > Do you guys think that the project have reached a point where integrati= on > > and compatibility issues should be prioritized really high on the list > > of things that make or break each future release? > > > > The good news, is that Bigtop's charter is in big part *exactly* about > > providing you with this kind of feedback. We can easily tell you when > > Hadoop behavior, with regard to downstream components, changes > > between a previous stable release and the new RC (or even branch/trunk). > > What we can NOT do is submit patches for all the issues. We are simply > > too small a project and we need your help with that. > > > > I truly believe that we owe it to the downstream projects, and in the > > second half of this email I will try to convince you of that. > > > > We all know that integration projects are impossible to pull off > > unless there's a general consensus between all of the projects involved > > that they indeed need to work with each other. You can NOT force > > that notion, but you can always try to influence. This relationship > > goes both ways. > > > > Consider a question in front of the downstream communities > > of whether or not to adopt Hadoop 2.X as the basis. To answer > > that question each downstream project has to be reasonably > > sure that their concerns will NOT fall on deaf ears and that > > Hadoop developers are, essentially, 'ready' for them to pick > > up Hadoop 2.X. I would argue that so far the Hadoop community > > had gone out of its way to signal that 2.X codeline is NOT > > ready for the downstream. > > > > I would argue that moving forward this is a really unfortunate > > situation that may end up undermining the long term success > > of Hadoop 2.X if we don't start addressing the problem. Think > > about it -- 90% of unit tests that run downstream on Apache > > infrastructure are still exercising Hadoop 1.X underneath. > > In fact, if you were to forcefully make, lets say, HBase's > > unit tests run on top of Hadoop 2.X quite a few of them > > are going to fail. Hadoop community is, in effect, cutting > > itself off from the biggest source of feedback -- its downstream > > users. This in turn: > > > > * leaves Hadoop project in a perpetual state of broken > > windows syndrome. > > > > * leaves Apache Hadoop 2.X releases in a state considerably > > inferior to the releases *including* Apache Hadoop done by the > > vendors. The users have no choice but to alight themselves > > with vendor offerings if they wish to utilize latest Hadoop > > functionality. > > The artifact that is know as Apache Hadoop 2.X stopped being > > a viable choice thus fracturing the user community and reducing > > the benefits of a commonly deployed codebase. > > > > * leaves downstream projects of Hadoop in a jaded state where > > they legitimately get very discouraged and frustrated and eventual= ly > > give up thinking that -- well, we work with one release of Hadoop > > (the stable one Hadoop 1.X) and we shall wait for the Hadoop > > community to get their act together. > > > > In my view (shared by quite a few members of the Apache Bigtop) we > > can definitely do better than this if we all agree that the proposed > > first 'beta' release of Hadoop 2.0.4 is the right time for it to happen. > > > > It is about time Hadoop 2.X community wins back all those end users > > and downstream projects that got left behind during the alpha > > stabilization phase. > > > > Thanks, > > Roman. > > --xkXJwpr35CY/Lc3I Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) iF4EAREIAAYFAlExd2cACgkQenyFlstYjhICOQEA16mNuZCj8PK0qB/5e07luGv8 33LpXHylDmarr7S0PdoA/jRcMOLgcRd27p6I7jeQzFJhx1A+QEEJouZ8N0sPbo1c =3Bgh -----END PGP SIGNATURE----- --xkXJwpr35CY/Lc3I--