Return-Path: X-Original-To: apmail-hadoop-general-archive@minotaur.apache.org Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C81B9C95B for ; Thu, 10 May 2012 05:06:31 +0000 (UTC) Received: (qmail 7115 invoked by uid 500); 10 May 2012 05:06:30 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 6928 invoked by uid 500); 10 May 2012 05:06:30 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 6912 invoked by uid 99); 10 May 2012 05:06:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 May 2012 05:06:29 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of todd@cloudera.com designates 209.85.214.48 as permitted sender) Received: from [209.85.214.48] (HELO mail-bk0-f48.google.com) (209.85.214.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 May 2012 05:06:24 +0000 Received: by bkcjf20 with SMTP id jf20so1359179bkc.35 for ; Wed, 09 May 2012 22:06:03 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding:x-gm-message-state; bh=Ec+okzvQZ5TCp6MULMbf2FtxbRhwN+sNndn2WMdMfn4=; b=eatZpxVjIIWezXUdW1RWCjigdUvNOG5w93CbbYVxoNSOhg7jhM2jcR4xohvj5hK075 23uEMoTBMNAENdxaW/Pp2XlgvjK4NrdQbQH/GWnuve6QnvOEKDOwN9YuLxKXAVKP0FKj hJxUN+49h3OwlutN7NCFSZTUyQbOOto7IGmhkS158jKy1rOPN2x8OZ/B/tWDt858/aqw dNy5cKONKtibABV3HWy2TXcc1j8kB7aZf9QVRKALSBQOsoLe0uA52IRvtgfhqKTSNzD6 J29RPGechGc/keYoiNIzkRv/5kCikYHdutJl8jSVvY788uOp4D804fEi3zpgx/gjtoCd qo1Q== Received: by 10.205.133.197 with SMTP id hz5mr1034417bkc.126.1336626362776; Wed, 09 May 2012 22:06:02 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.157.152 with HTTP; Wed, 9 May 2012 22:00:12 -0700 (PDT) In-Reply-To: <320D8CFB-73A9-41AF-850A-F6ED5C1255D9@gmail.com> References: <4C4776F7-94C7-453E-ACC8-E4663186FED0@hortonworks.com> <8BB5D1E9-0C05-41B7-ADBF-D60A8811DEB9@hortonworks.com> <320D8CFB-73A9-41AF-850A-F6ED5C1255D9@gmail.com> From: Todd Lipcon Date: Wed, 9 May 2012 22:00:12 -0700 Message-ID: Subject: Re: [VOTE] Release hadoop-2.0.0-alpha To: general@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Gm-Message-State: ALoCoQnfAa33mXjs0lgkCmXDHO3raFwJfKZnUTVGccyPisjTPn4hbWIBzooo3M4KUVtlgr3/7sEg X-Virus-Checked: Checked by ClamAV on apache.org Hi Andrew, Have you seen the new MiniMRClientCluster class? It's meant to be what you describe - a minicluster which only exposes "external" APIs -- most importantly a way of getting at a JobClient to submit jobs. We have it implemented in both 1.x and 2.x at this point, though I don't recall if it's in the 1.0.x releases or if it's only slated for 1.1+ -Todd On Wed, May 9, 2012 at 6:05 PM, Andrew Purtell w= rote: > Hi Suresh, > > The unstable designation makes sense. =A0As would one for MiniMRCluster. > > I was over the top initially to surprise. I'm sure the MR minicluster see= ms a minor detail. > > Maybe it's worth thinking about the miniclusters differently? Please pard= on if I am rehashing an old discussion. > > Things like MRUnit for applications and BigTop for full cluster tests can= help, but for as mentioned in the below annotation Pig, Hive, HBase, and o= ther parts of the stack use miniclusters for local end to end testing in un= it tests. As the complexity of the stack increases and we consider cross ve= rsion support, unit tests on miniclusters I think will have no substitute. > > As Hadoop 2 has been evolving there has been some difficulty keeping up w= ith minicluster changes. This makes sense. The attention to stability to cl= ient APIs and such, and the lack thereof to the minicluster, I think is sel= f evident. But the need to fix up tests unpredictably introduces some frict= ion that perhaps need not be there. > > Would a JIRA to discuss defining a subset of the minicluster interfaces a= s more stable be worthwhile? > > Best regards, > > =A0 =A0- Andy > > > On May 9, 2012, at 1:45 PM, Suresh Srinivas wrot= e: > >> For this reason, in HDFS, we change MiniDFSCluster to LimitedPrivate and >> not treat it as such: >> >> @InterfaceAudience.LimitedPrivate({"HBase", "HDFS", "Hive", "MapReduce", >> "Pig"}) >> @InterfaceStability.Unstable >> public class MiniDFSCluster { ...} >> >> On Wed, May 9, 2012 at 11:33 AM, Andrew Purtell wr= ote: >> >>> Sounds good Arun. >>> >>> How should we consider the suitability and stability of MiniMRCluster >>> for downstream projects? >>> >>> On Wed, May 9, 2012 at 11:30 AM, Arun C Murthy >>> wrote: >>>> No worries Andy. I can spin an rc1 once we can pin-point the bug. >>>> >>>> thanks, >>>> Arun >>>> >>>> On May 9, 2012, at 10:17 AM, Andrew Purtell wrote: >>>> >>>>> -1 (nonbinding), we are currently facing a minicluster semantic chang= e >>>>> of some kind, or more than one: >>>>> >>>>> =A0 https://issues.apache.org/jira/browse/HBASE-5966 >>>>> >>>>> There are other HBase JIRAs related to 2.0.0-alpha that we are workin= g >>>>> on, but I'd claim those are all our fault for breaking abstractions t= o >>>>> solve issues. In one case there's a new helpful 2.x API >>>>> (ShutdownHookManager, thank you!) that we can eventually move to. >>>>> >>>>> However, the minicluster changes are causing us some repeated >>>>> discomfort. It will break, we'll get some help fixing up our tests fo= r >>>>> that, then some time later it will break again, repeat. Perhaps we >>>>> have no right to complain, the minicluster isn't meant to be used by >>>>> downstream projects. If so then please disregard the complaint, but >>>>> your assistance in helping to fix the breakage again would be much >>>>> appreciated. And, if so, perhaps we can discuss what makes sense in >>>>> terms of a stable minicluster consumable for downstream projects? >>>>> >>>>> Best regards, >>>>> >>>>> =A0 - Andy >>>>> >>>>> On Wed, May 9, 2012 at 9:58 AM, Arun C Murthy >>> wrote: >>>>>> I've created a release candidate for hadoop-2.0.0-alpha that I would >>> like to release. >>>>>> >>>>>> It is available at: >>> http://people.apache.org/~acmurthy/hadoop-2.0.0-alpha-rc0/ >>>>>> >>>>>> The maven artifacts are available via repository.apache.org. >>>>>> >>>>>> Please try the release and vote; the vote will run for the usual 7 >>> days. >>>>>> >>>>>> This is a big milestone for the Apache Hadoop community - >>> congratulations and thanks for all the contributions! >>>>>> >>>>>> thanks, >>>>>> Arun >>>>>> >>>>>> >>>>>> -- >>>>>> Arun C. Murthy >>>>>> Hortonworks Inc. >>>>>> http://hortonworks.com/ >>>>>> >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> Best regards, >>>>> >>>>> =A0 - Andy >>>>> >>>>> Problems worthy of attack prove their worth by hitting back. - Piet >>>>> Hein (via Tom White) >>>> >>>> -- >>>> Arun C. Murthy >>>> Hortonworks Inc. >>>> http://hortonworks.com/ >>>> >>>> >>> >>> >>> >>> -- >>> Best regards, >>> >>> =A0 - Andy >>> >>> Problems worthy of attack prove their worth by hitting back. - Piet >>> Hein (via Tom White) >>> --=20 Todd Lipcon Software Engineer, Cloudera