Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 3B2BA200B9A for ; Fri, 23 Sep 2016 05:16:18 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 39B0C160AE0; Fri, 23 Sep 2016 03:16:18 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 4BD89160AAD for ; Fri, 23 Sep 2016 05:16:17 +0200 (CEST) Received: (qmail 78118 invoked by uid 500); 23 Sep 2016 03:16:16 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 78106 invoked by uid 99); 23 Sep 2016 03:16:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Sep 2016 03:16:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 2DC18C03BC for ; Fri, 23 Sep 2016 03:16:15 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.88 X-Spam-Level: * X-Spam-Status: No, score=1.88 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, MIME_QP_LONG_LINE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id niwzYECJ7jlL for ; Fri, 23 Sep 2016 03:16:13 +0000 (UTC) Received: from mail-pa0-f49.google.com (mail-pa0-f49.google.com [209.85.220.49]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id BD8785FBDB for ; Fri, 23 Sep 2016 03:16:12 +0000 (UTC) Received: by mail-pa0-f49.google.com with SMTP id oz2so35418379pac.2 for ; Thu, 22 Sep 2016 20:16:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:content-transfer-encoding:mime-version:date:subject:message-id :references:in-reply-to:to; bh=bRLB+twxzw49w+0Nx/apYu2lFZCoRRBgnKVCA1nzmCE=; b=p6h6qRyf3bb8Lld2/etRxfe/tNx8ZRU0Gi+wbHifGE+wV2uQvFim9a4MU/zDLNtCAE UGKzydhFVuMwYJw+5MPUyHAOO1AyjLNvUAYLzCMYjFU6r+qyG7IPpArBenOyZJmIDL3x blNSXmTLghFVAlt0D/vTm9v0LWBiwwD+HF8h+MGFiWyUiYe+awL3Q81yp+RRcrpl36LA fcvg6SoauK+HXPGChmcv0qgZ/NnPse8cGe4zn9Oy8TGVrSIkKpv8JRkAh0AAjBdmt0+u 7jVLyFvzWphb7VOJS2IxYB+73S1/bf4oOKewP9KWrdzsnuz5qfV8Q/+Rn4r8V3pvScj5 BUEA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:content-transfer-encoding:mime-version:date :subject:message-id:references:in-reply-to:to; bh=bRLB+twxzw49w+0Nx/apYu2lFZCoRRBgnKVCA1nzmCE=; b=FfLEkXuE8R7Sjo5nBMOzBFRKv1Fc2RmNb0mLcUpE98BCSF+OcBTEFjffz8GecArsaV L3eae1cb3jj+13FpaU18xB2sNyfbUj4JeenKWILdT/xtFt0zjku7FmjSPgOF22634PCU laT45aiUuP17TlGHPBUS7fm6C77NaA3zKHRlPpeqHfp1LpJo8znxXamnaPyjoMIbjVQ1 l7kpLtNepKmDigXokragDYEcFU4tnWjHJqCBWUkeCPmJ4rzkihPsyJJIUbLNRil1+G90 9qgseJ9JmgZdsBUUXUvpypzEAK0th/SkHzeySW2JgleHcmLa3Mysn40M6EGz6eieEJwM dUSw== X-Gm-Message-State: AE9vXwNL8XVUmIW9FwvgvDHpmWDZb6RMZ0uw6Hm9VfV1+dbTScojTl2SD7kWwARV8vwp0A== X-Received: by 10.66.89.72 with SMTP id bm8mr8823468pab.1.1474600571353; Thu, 22 Sep 2016 20:16:11 -0700 (PDT) Received: from ?IPv6:2605:e000:23c3:7100:95f0:fe51:9bbc:180f? ([2605:e000:23c3:7100:95f0:fe51:9bbc:180f]) by smtp.gmail.com with ESMTPSA id cu2sm7066165pad.18.2016.09.22.20.16.09 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Thu, 22 Sep 2016 20:16:10 -0700 (PDT) From: Andrew Purtell Content-Type: multipart/alternative; boundary=Apple-Mail-3B869125-C3D8-492B-87EB-69C777B62080 Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (1.0) Date: Thu, 22 Sep 2016 20:16:08 -0700 Subject: Re: [DISCUSSION] MR jobs started by Master or RS Message-Id: References: <0D1ED081-8BBB-4486-87E3-8F4E3BDFC638@gmail.com> <1474580970259.54191@hortonworks.com> <54FE16D7-96FB-4853-8864-1008DD61320D@gmail.com> In-Reply-To: To: dev@hbase.apache.org X-Mailer: iPhone Mail (14A403) archived-at: Fri, 23 Sep 2016 03:16:18 -0000 --Apple-Mail-3B869125-C3D8-492B-87EB-69C777B62080 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Agreed, this would be interesting to contemplate.=20 On Sep 22, 2016, at 8:03 PM, Vladimir Rodionov wrot= e: >>> No, never. >=20 > No need for M/R here, just a simple compaction-server colocated with RS on= > a same node. > You save a lot on GC in RS. Ideally, it can be IO "nice" in Linux (by > setting IO priority). But offtopic, of course :) >=20 > -Vlad >=20 > On Thu, Sep 22, 2016 at 7:57 PM, Vladimir Rodionov > wrote: >=20 >>>> And if MR not deployed, Backup/Restore feature could not be used, >> right? >>=20 >> Yes. >>=20 >> On Thu, Sep 22, 2016 at 7:53 PM, Heng Chen >> wrote: >>=20 >>> {quote} >>> If MR framework is not deployed in the cluster, hbase still functions >>> normally (post merge). >>> {quote} >>>=20 >>> If MR is not strong dependency for Master/RS, it is OK for me. >>> And if MR not deployed, Backup/Restore feature could not be used, right= ? >>>=20 >>> 2016-09-23 10:49 GMT+08:00 Ted Yu : >>>> If MR framework is not deployed in the cluster, hbase still functions >>>> normally (post merge). >>>>=20 >>>> In terms of build time dependency, we have long been depending on >>>> mapreduce. Take a look at ExportSnapshot. >>>>=20 >>>> Cheers >>>>=20 >>>> On Thu, Sep 22, 2016 at 7:42 PM, Heng Chen >>> wrote: >>>>=20 >>>>> In our production cluster, it is a common case we just have HDFS and >>>>> HBase deployed. >>>>> If our Master/RS depend on MR framework (especially some features we >>>>> have not used at all), it introduced another cost for maintain. I >>>>> don't think it is a good idea. >>>>>=20 >>>>> 2016-09-23 10:28 GMT+08:00 =E5=BC=A0=E9=93=8E := >>>>>> To be specific, for example, our nice Backup/Restore feature, if we >>> think >>>>>> this is not a core feature of HBase, then we could make it depend on >>> MR, >>>>>> and start a standalone BackupManager instance that submits MR jobs >>> to do >>>>>> periodical maintenance job. And if we think this is a core feature >>> that >>>>>> everyone should use it, then we'd better implement it without MR >>>>>> dependency, like DLS. >>>>>>=20 >>>>>> Thanks. >>>>>>=20 >>>>>> 2016-09-23 10:11 GMT+08:00 =E5=BC=A0=E9=93=8E = : >>>>>>=20 >>>>>>> I=E2=80=98m -1 on let master or rs launch MR jobs. It is OK that som= e of our >>>>>>> features depend on MR but I think the bottom line is that we should >>>>> launch >>>>>>> the jobs from outside manually or by other services. >>>>>>>=20 >>>>>>> 2016-09-23 9:47 GMT+08:00 Andrew Purtell >>> : >>>>>>>=20 >>>>>>>> Ok, got it. Well "shelling out" is on the line I think, so a fair >>>>>>>> question. >>>>>>>>=20 >>>>>>>> Can this be driven by a utility derived from Tool like our other MR= >>>>> apps? >>>>>>>> The issue is needing the AccessController to decide if allowed? But= >>>>> nothing >>>>>>>> prevents the user from running the job manually/independently, >>> right? >>>>>>>>=20 >>>>>>>>> On Sep 22, 2016, at 3:44 PM, Matteo Bertozzi < >>>>> theo.bertozzi@gmail.com> >>>>>>>> wrote: >>>>>>>>>=20 >>>>>>>>> just a remark. my query was not about tools using MR (everyone i >>>>> think >>>>>>>> is >>>>>>>>> ok with those). >>>>>>>>> the topic was about: "are we ok with running MR jobs from Master >>> and >>>>> RSs >>>>>>>>> code?" since this will be the first time we do this >>>>>>>>>=20 >>>>>>>>> Matteo >>>>>>>>>=20 >>>>>>>>>=20 >>>>>>>>>> On Thu, Sep 22, 2016 at 2:49 PM, Devaraj Das < >>> ddas@hortonworks.com> >>>>>>>> wrote: >>>>>>>>>>=20 >>>>>>>>>> Very much agree; for tools like ExportSnapshot / Backup / >>> Restore, >>>>> it's >>>>>>>>>> fine to be dependent on MR. MR is the right framework for such. >>> We >>>>>>>> should >>>>>>>>>> also do compactions using MR (just saying :) ) >>>>>>>>>> ________________________________________ >>>>>>>>>> From: Ted Yu >>>>>>>>>> Sent: Thursday, September 22, 2016 2:00 PM >>>>>>>>>> To: dev@hbase.apache.org >>>>>>>>>> Subject: Re: [DISCUSSION] MR jobs started by Master or RS >>>>>>>>>>=20 >>>>>>>>>> I agree - backup / restore is in the same category as import / >>>>> export. >>>>>>>>>>=20 >>>>>>>>>> On Thu, Sep 22, 2016 at 1:58 PM, Andrew Purtell < >>>>>>>> andrew.purtell@gmail.com> >>>>>>>>>> wrote: >>>>>>>>>>=20 >>>>>>>>>>> Backup is extra tooling around core in my opinion. Like import >>> or >>>>>>>> export. >>>>>>>>>>> Or the optional MOB tool. It's fine. >>>>>>>>>>>=20 >>>>>>>>>>>> On Sep 22, 2016, at 1:50 PM, Matteo Bertozzi < >>>>> mbertozzi@apache.org> >>>>>>>>>>> wrote: >>>>>>>>>>>>=20 >>>>>>>>>>>> What's the latest opinion around running MR jobs from hbase >>>>> (Master >>>>>>>> or >>>>>>>>>>> RS)? >>>>>>>>>>>>=20 >>>>>>>>>>>> I remember in the past that there was discussion about not >>> having >>>>> MR >>>>>>>>>> has >>>>>>>>>>>> direct dependency of hbase. >>>>>>>>>>>>=20 >>>>>>>>>>>> I think some of discussion where around MOB that had a MR job >>> to >>>>>>>>>> compact, >>>>>>>>>>>> that later was transformed in a non-MR job to be merged, I >>> think >>>>> we >>>>>>>>>> had a >>>>>>>>>>>> similar discussion for log split/replay. >>>>>>>>>>>>=20 >>>>>>>>>>>> the latest is the new Backup feature (HBASE-7912), that runs >>> a MR >>>>> job >>>>>>>>>>> from >>>>>>>>>>>> the master to copy data or restore data. >>>>>>>>>>>> (backup is also "not really core" as in.. if you don't use >>> backup >>>>>>>>>> you'll >>>>>>>>>>>> not end up running MR jobs, but this was probably true for >>> MOB as >>>>> in >>>>>>>>>> "if >>>>>>>>>>>> you don't enable MOB you don't need MR") >>>>>>>>>>>>=20 >>>>>>>>>>>> any thoughts? do we a rule that says "we don't want to have >>> hbase >>>>> run >>>>>>>>>> MR >>>>>>>>>>>> jobs, only tool started manually by the user can do that". or >>> can >>>>> we >>>>>>>>>>> start >>>>>>>>>>>> adding MR calls around without problems? >>>>>>>>>>>=20 >>>>>>>>>>=20 >>>>>>>>=20 >>>>>>>=20 >>>>>>>=20 >>>>>=20 >>>=20 >>=20 >>=20 --Apple-Mail-3B869125-C3D8-492B-87EB-69C777B62080--