Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 00484EF56 for ; Mon, 3 Dec 2012 23:22:46 +0000 (UTC) Received: (qmail 46114 invoked by uid 500); 3 Dec 2012 23:22:46 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 45968 invoked by uid 500); 3 Dec 2012 23:22:46 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 45827 invoked by uid 99); 3 Dec 2012 23:22:46 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Dec 2012 23:22:46 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of gates@hortonworks.com designates 209.85.220.48 as permitted sender) Received: from [209.85.220.48] (HELO mail-pa0-f48.google.com) (209.85.220.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Dec 2012 23:22:39 +0000 Received: by mail-pa0-f48.google.com with SMTP id fa1so2198780pad.35 for ; Mon, 03 Dec 2012 15:22:19 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=subject:mime-version:content-type:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to:x-mailer :x-gm-message-state; bh=hv2/UxpO46f8ilvDOMhtFPxX7X6OBe7s0Dsv3xF6VBo=; b=NeI2p+YQZJjlX7jmVaB845MjSNVOPVPq2vhUMnyW+HL0dbKuIkCwFB7Cr1im1CgkJV 0jm18VDRGBqCAtUziTcwb8TEvBeGTyc/GqV0mqGVyDQHuwAD6MxWzbmos6jUyxdXARTG GKT3sx/OpWAD89kxGHqAeb66ARvpWVnKR9dxZRuwb0c0TW+LFh7rRD/FRJ2J82dLw1rP CPhwYSgZIXDbEVGUlkKs9MnhNmrZxoqChPv/9sQ78zuvDuXGmluylULBIRdKu0QuaBL6 W3kgRw8PwuGfC3OMeO7o6outg3Xso1k3JOvb0uywtRtitLF8YHMKrn/foYa5t5xt6UOl +4sw== Received: by 10.68.138.198 with SMTP id qs6mr33545826pbb.151.1354576939355; Mon, 03 Dec 2012 15:22:19 -0800 (PST) Received: from [10.10.11.161] (host1.hortonworks.com. [70.35.59.2]) by mx.google.com with ESMTPS id c7sm8782324pay.10.2012.12.03.15.22.17 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 03 Dec 2012 15:22:18 -0800 (PST) Subject: Re: [DISCUSS] HCatalog becoming a subproject of Hive Mime-Version: 1.0 (Apple Message framework v1085) Content-Type: text/plain; charset=iso-8859-1 From: Alan Gates In-Reply-To: <1E5BCEC2F299094383A3CE0A87BF7B66503FEDF2@SC-MBX02-3.TheFacebook.com> Date: Mon, 3 Dec 2012 15:22:16 -0800 Cc: "dev@hive.apache.org" , "hcatalog-dev@incubator.apache.org" Content-Transfer-Encoding: quoted-printable Message-Id: <3F43F45C-78E8-4978-9777-97E8616FBD54@hortonworks.com> References: <1E5BCEC2F299094383A3CE0A87BF7B66503FEDF2@SC-MBX02-3.TheFacebook.com> To: Namit Jain X-Mailer: Apple Mail (2.1085) X-Gm-Message-State: ALoCoQn3d80XgPK/JYdY7faL6Mo7iziqKzFjMuPyfu37LtttBreEkN7Qtgws0chZizY3JIiPVOXl X-Virus-Checked: Checked by ClamAV on apache.org I am not sure where we are on this discussion. So far those who have = chimed in seemed generally positive (Namit, Edward, Clark, Alexander). = Namit and I have different visions for what the committership might look = like, so I'd like to hear from other Hive PMC members what their view is = on this. I have to say from an HCatalog perspective the proposition is = much less attractive without some commit rights. On a related note, people should be aware of these threads in the = Incubator list: = http://mail-archives.apache.org/mod_mbox/incubator-general/201211.mbox/%3C= CAGU5spdWHNtJxgQ8f%3DnPEXx9xNLjyjOYaFfnSw4EyAjgm1c46w%40mail.gmail.com%3E = http://mail-archives.apache.org/mod_mbox/incubator-general/201211.mbox/%3C= CAKQbXgDZj_zMj4qSodXjMHV7xQZxpcY1-35cvq959YKLNd6tJQ%40mail.gmail.com%3E For those not inclined to read all the mails in the threads I will = summarize (though I urge all PMC members of Hive and PPMC members of = HCat to read both mail threads because this is highly relevant to what = we are discussing). There are two salient points in these threads: 1) It is not wise to build a subproject that is distinct from the main = project in the sense that it has separate community members interested = in it. Bertrand, Arun, Chris Mattman, and Greg Stein all spoke against = this, and all are long time Apache contributors with a lot of = experience. They were all of the opinion that it was reasonable for one = project to release separate products. 2) It is not wise to have committers that have access to parts of a = project but not others. Greg and Bertrand argued (and Arun seemed to = imply) that splitting up committer lists by sections of the code did not = work out well. These insights cause me to question what we mean by subproject. I had = originally envisioned something that looked like Pig and Hive did when = they were subprojects of Hadoop. But this violates both 1 and 2 above. = Given this input from many of the "wise old timers" of Apache I think we = should consider what we mean when we say subproject and how tightly we = are willing to integrate these projects. Personally I think it makes = sense to continue to pursue integration, as I think HCat is really a set = of interfaces on top of Hive and it makes sense to coalesce those into = one project. I guess this would mean HCat becomes just another set of = jars that Hive releases when it releases, rather than a stand alone = entity. But I'm curious to hear what others think. =20 Alan. On Nov 14, 2012, at 10:22 PM, Namit Jain wrote: > The same criteria should be applied to all Hive committers. Only a > committer should be able to commit code. > I don=B9t think we should bend this rule. Metastore is not a separate > project, but a integral part of hive. >=20 > -namit >=20 >=20 > On 11/12/12 10:32 PM, "Alan Gates" wrote: >=20 >> I would suggest looking over the patch history of HCat committers. I >> think most of them have already contributed a number of patches to = the >> metastore. All are certainly aware of how to run Hive unit tests and >> have an understanding of how Hive works. So I don't think it's fair = to >> say they would be unsafe with access to the metastore. And the Hive = PMC >> is there to assure this does not happen. If there are issues I am = sure >> they can deal with them. >>=20 >> Alan. >>=20 >>=20 >> On Nov 6, 2012, at 8:06 PM, Namit Jain wrote: >>=20 >>> Alan, that would not be a good idea. Metastore code is part of hive >>> code, >>> and it >>> would be safer if only Hive committers had commit access to that. >>>=20 >>>=20 >>> On 11/6/12 11:25 PM, "Alan Gates" wrote: >>>=20 >>>>=20 >>>> On Nov 4, 2012, at 8:35 PM, Namit Jain wrote: >>>>=20 >>>>> I like the idea of Hcatalog becoming a Hive sub-project. The >>>>> enhancements/bugs in the serde/metastore areas can indirectly >>>>> benefit the hive community, and it will be easier for the fix to = be in >>>>> one >>>>> place. Having said that, I don't see serde/metastore >>>>> moving out of hive into a separate component. Things are tied too >>>>> closely >>>>> together. I am assuming that no new committers would >>>>> be automatically added to Hive as part of this, and both Hive and >>>>> HCatalog >>>>> will continue to have its own committers. >>>>=20 >>>> One thing in this we'd like to discuss is the HCatalog committers >>>> having >>>> commit access to the metastore sections of Hive code. That doesn't >>>> mean >>>> it has to move into HCatalog's code base. But more and more the = fixes >>>> and changes we're doing in HCatalog are really in Hive's metastore. = So >>>> we believe it would make sense to give HCat committers access to = that >>>> component as well as HCat. >>>>=20 >>>> Alan. >>>>=20 >>>>>=20 >>>>> Thanks, >>>>> -namit >>>>>=20 >>>>>=20 >>>>> On 11/3/12 2:22 AM, "Alan Gates" wrote: >>>>>=20 >>>>>> Hello Hive community. It is time for HCatalog to graduate from = the >>>>>> Apache Incubator. Given the heavy dependence of HCatalog on Hive = the >>>>>> HCatalog community agreed it made sense to explore graduating = from >>>>>> the >>>>>> Incubator to become a subproject of Hive (see >>>>>>=20 >>>>>>=20 >>>>>> = http://mail-archives.apache.org/mod_mbox/incubator-hcatalog-user/20120 >>>>>> 9. >>>>>> mb >>>>>> ox/%3C08C40723-8D4D-48EB-942B-8EE4327DD84A%40hortonworks.com%3E = and >>>>>>=20 >>>>>>=20 >>>>>> = http://mail-archives.apache.org/mod_mbox/incubator-hcatalog-user/20121 >>>>>> 0. >>>>>> mb >>>>>>=20 >>>>>>=20 >>>>>> = ox/%3CCABN7xTCRM5wXGgJKEko0PmqDXhuAYpK%2BD-H57T29zcSGhkwGQw%40mail.gma >>>>>> il >>>>>> .c >>>>>> om%3E ). To help both communities understand what HCatalog is = and >>>>>> hopes >>>>>> to become we also developed a roadmap that summarizes HCatalog's >>>>>> current >>>>>> features, planned features, and other possible features under >>>>>> discussion: >>>>>> = https://cwiki.apache.org/confluence/display/HCATALOG/HCatalog+Roadmap >>>>>>=20 >>>>>> So we are now approaching you to see if there is agreement in the >>>>>> Hive >>>>>> community that HCatalog graduating into Hive would make sense. >>>>>>=20 >>>>>> Alan. >>>>>=20 >>>>=20 >>>=20 >>=20 >=20