Return-Path: X-Original-To: apmail-hadoop-general-archive@minotaur.apache.org Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DD72ADD16 for ; Wed, 29 Aug 2012 14:15:15 +0000 (UTC) Received: (qmail 8851 invoked by uid 500); 29 Aug 2012 14:15:14 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 8736 invoked by uid 500); 29 Aug 2012 14:15:13 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 8723 invoked by uid 99); 29 Aug 2012 14:15:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Aug 2012 14:15:13 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [128.149.139.109] (HELO mail.jpl.nasa.gov) (128.149.139.109) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Aug 2012 14:15:05 +0000 Received: from mail.jpl.nasa.gov (ap-ehub-sp01.jpl.nasa.gov [128.149.137.148]) by smtp.jpl.nasa.gov (Sentrion-MTA-4.2.2/Sentrion-MTA-4.2.2) with ESMTP id q7TEEhf4025960 (using TLSv1/SSLv3 with cipher AES128-SHA (128 bits) verified NO) for ; Wed, 29 Aug 2012 07:14:43 -0700 Received: from AP-EMBX-SP20.RES.AD.JPL ([169.254.8.34]) by ap-ehub-sp01.RES.AD.JPL ([169.254.3.211]) with mapi id 14.02.0298.004; Wed, 29 Aug 2012 07:14:43 -0700 From: "Mattmann, Chris A (388J)" To: "" Subject: Re: [DISCUSS] Spin out MR, HDFS and YARN as their own TLPs and disband Hadoop umbrella project Thread-Topic: [DISCUSS] Spin out MR, HDFS and YARN as their own TLPs and disband Hadoop umbrella project Thread-Index: AQHNhY6+cKAg7NaI+EWRM79fMSTKeJdwnQgAgACvYAA= Date: Wed, 29 Aug 2012 14:14:42 +0000 Message-ID: <7F5019B7-CFB1-4A12-958E-F6745875F39D@jpl.nasa.gov> References: <0C112F45-3130-4093-B0F2-2D1F02B8C84C@jpl.nasa.gov> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [128.149.137.114] Content-Type: text/plain; charset="us-ascii" Content-ID: Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Source-Sender: chris.a.mattmann@jpl.nasa.gov X-AUTH: Authorized Hi Alejandro, On Aug 28, 2012, at 8:50 PM, Alejandro Abdelnur wrote: > Chris, thanks for initiating the discussion. No probs! >=20 > IMO a pre-requisite to this is to figure out how we'll handle the followi= ng: >=20 To be honest, I don't think any of the below are prereqs. They are technica= l issues that can be dealt with post facto of just SVN copy'ing hadoop as it= =20 stands today per my SVN commands into each of the new TLPs and then using that as a starting point for doing the below, as part of the natural = evolution of the project code.=20 That being said, if I had to guess what the TLPs would do to address the be= low once they are created: > * Where does common stuff lives? This usually happens over time and depending on how often things release,=20 and other things cited else-threads, and else-discussions over the past yea= rs in Hadoop. You guys clearly have a good handle on things like this.=20 I would just encourage the subsequent TLPs to not worry about doing everyth= ing perfectly and to realize that if you start out with the same code base, you= can selectively and then iteratively just make things more clean, refactored, and the answe= r to questions like this will happen naturally during that evolution. > * What are the public interfaces of each project (towards the other proje= cts)? This is something that each distinct community can answer once they are boo= tstrapped as TLPs. You can decide what portion of the code is really under charter an= d then work as a community to figure this out. Sorry I can't be more specific than that= . > * How do we do development/releases? In tandem? Separate? In tandem across communities never really works. Releases should occur sepa= rately, per community and TLP, on their own schedule. Code that depends on other projec= ts either has to wait for those communities/TLPs/projects to fix things, or add new f= eatures, or=20 whatever, or insulate, and keep the fixes locally in your project's SVN unt= il those fixes can be pushed upstream, and included in the other communities releases, etc= . Ask yourself this. If you guys have a dependency on e.g., Tomcat, and there= is some critical bug or new feature you want in Tomcat, how would you deal with that? I woul= d posit the same way that you could deal with this situation. Keep the fix to Tomcat locally= in your project;=20 work to get that fix upstream and included in some subsequent Tomcat releas= e, etc. > How this > will work in practice, currently we are constantly tweaking things > inter-projects, sometimes in the same JIRAs, sometimes in follow up > JIRAs. Technically you are doing that that, but community wise, it's not working o= ut, and hasn't really been working for years. I've been around Hadoop since its inception = (I was a Nutch committer before Hadoop existed), and though it's been hugely successful, a= nd really=20 awesome and super great (congrats, everyone, BTW!), the community issues ha= ve always cropped up b/c it's one big huge umbrella project and that doesn't work at = Apache. Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.a.mattmann@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++