Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 71EA5200B35 for ; Tue, 21 Jun 2016 04:29:13 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7073A160A65; Tue, 21 Jun 2016 02:29:13 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 939D7160A55 for ; Tue, 21 Jun 2016 04:29:12 +0200 (CEST) Received: (qmail 98102 invoked by uid 500); 21 Jun 2016 02:29:11 -0000 Mailing-List: contact yarn-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-dev@hadoop.apache.org Received: (qmail 98073 invoked by uid 99); 21 Jun 2016 02:29:11 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2016 02:29:11 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id E7E001804D0 for ; Tue, 21 Jun 2016 02:29:10 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.979 X-Spam-Level: * X-Spam-Status: No, score=1.979 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera-com.20150623.gappssmtp.com Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id ph4h9ya-MFlF for ; Tue, 21 Jun 2016 02:29:08 +0000 (UTC) Received: from mail-yw0-f178.google.com (mail-yw0-f178.google.com [209.85.161.178]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id 905615F4EE for ; Tue, 21 Jun 2016 02:29:08 +0000 (UTC) Received: by mail-yw0-f178.google.com with SMTP id i12so2381205ywa.1 for ; Mon, 20 Jun 2016 19:29:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudera-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc; bh=eWpvdF9CVfQLNZlavwqPlVAQtjbEnr7IbAqgmGITyTo=; b=EY+HwXWIuqJgqK2tQfcghBxrRArHLTFv7qLOtaUomNXFBR6t+KmDl3DlarJf2GPL9f lDH6HB4SSkt6P+lDb4OTbvvb2QYDVrLeMdF71SoCf9eNoCWji5ElQOs5WrNFZ1LYqjIG oo7R08aBPWZRMmYc+1a4PMiDzA6SiPJw0YyR5uWKrcLVLrrfkFR5h58N5MWTNT9xjV11 Wxm3Q7wOexayDYVjZwnVxrFJlvWqGG5nPHJdCbH+LW7SlhLjnWJz/yrn7W9/0uryA0Jo Q8fEjtastD53Y1j9V7xQ8KtVE0EZYdOmFgKNeabLY16v+FLuW5pmJRbA4r0wcoIZUpqC L70Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc; bh=eWpvdF9CVfQLNZlavwqPlVAQtjbEnr7IbAqgmGITyTo=; b=L+m1PohhEVGGWfngmB4zPWJMOJLV6Uk7TEyfO0rwxECdZLriN+HzHQPCAwOCtsHkfx UTsXD+TlNN6nipt53o7RccDEtSmsB1XxoOh5RUzuuDEd4mVfy6vJBmeqYGd+2M34YLgY i2RTsFN6nhdNMh8OgVc2RYaap7U1FmTEuRKJCqMpEf6owLXYN56Ob7ZEncRzxGT4WeHW oLUXn8OKgv/uPYr63JFuxt5LLVCb1tj9BWcylWvHOgVZDxNEocV8KLNRT8rU8jp7HwxO vJM3Gd72RpUT2gW65Db7g8hL0TUU5SFG5/bvOiSTQ0I1VDMXosTuV+y+KQEqBozRnS5a BeFA== X-Gm-Message-State: ALyK8tI2sSk66C+MRZ/GwCxJV8X5aDon3UXYmoaQ+XZKY+q4+PAmkkKFvkT9TMCYzahcKD2ShJ2rJaYhREzRwxUF MIME-Version: 1.0 X-Received: by 10.37.25.70 with SMTP id 67mr7479147ybz.97.1466476147853; Mon, 20 Jun 2016 19:29:07 -0700 (PDT) Received: by 10.13.193.199 with HTTP; Mon, 20 Jun 2016 19:29:07 -0700 (PDT) In-Reply-To: References: Date: Mon, 20 Jun 2016 19:29:07 -0700 Message-ID: Subject: Re: [DISCUSS] merging YARN-2928 (Timeline Service v.2) to trunk From: Karthik Kambatla To: Sangjin Lee Cc: "yarn-dev@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a113db1dedda0d40535c095f7 archived-at: Tue, 21 Jun 2016 02:29:13 -0000 --001a113db1dedda0d40535c095f7 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Firstly, thanks Sangjin and others for driving this major feature. Merging to trunk and including in 3.0.0-alpha1 seems reasonable, as it will give early access to downstream users. With regards to merging, it would help to have clear documentation on how to setup and use ATS. Slightly unrelated to the merge, do we plan to support any other simpler backend for users to try out, in addition to HBase? LevelDB? I understand this wouldn't scale, but would it help with initial adoption and feedback from early users? On Mon, Jun 20, 2016 at 10:26 AM, Sangjin Lee wrote: > Hi all, > > I=E2=80=99d like to open a discussion on merging the Timeline Service v.2= feature > to trunk (YARN-2928 and MAPREDUCE-6331) [1][2]. We have been developing t= he > feature in a feature branch (YARN-2928 [3]) for a while, and we are > reasonably confident that the state of the feature meets the criteria to = be > merged onto trunk and we'd love folks to get their hands on it and provid= e > valuable feedback so that we can make it production-ready. > > In a nutshell, Timeline Service v.2 delivers significant scalability and > usability improvements based on a new architecture. You can browse the > requirements/design doc, the storage schema doc, the new entity/data mode= l, > the YARN documentation, and also discussions on subsequent milestones on > YARN-2928 [1]. > > What we would like to merge to trunk is termed "alpha 1" (milestone 1). T= he > feature has a complete end-to-end read/write flow, and you should be able > to start setting it up and testing it. At a high level, the following are > the key features that have been implemented: > > - distributed writers (collectors) as NM aux services > - HBase storage > - new entity model that includes flows > - setting the flow context via YARN app tags > - real time metrics aggregation to the application level and the flow lev= el > - rich REST API that supports filters, complex conditionals, limits, > content selection, etc. > - YARN generic events and system metrics > - integration with Distributed Shell and MapReduce > > There are a total of 139 subtasks that were completed as part of this > effort. > > We paid close attention to ensure that once disabled Timeline Service v.2 > does not impact existing functionality when disabled (by default). > > I'd like to call out a couple of things to discuss in particular. > > *First*, if the merge vote is approved, to which branch should this be > merged and what would be the release version? My preference is that *it > would be merged to branch "trunk" and be part of 3.0.0-alpha1* if approve= d. > Since the 3.0.0-alpha1 is in active progress, I wanted to get your though= ts > on this. > > *Second*, Timeline Service v.2 introduces a dependency on HBase from YARN= . > It is not a cyclical dependency (as HBase does not really depend on YARN)= . > However, the version of Hadoop that HBase currently supports lags behind > the Hadoop version that Timeline Service is based on, so there is a > potential for subtle dependency conflicts. We made some efforts to isolat= e > the issue (see [4] and [5]). The HBase folks have also been responsive in > keeping up with the trunk as much as they can. Nonetheless, this is > something to keep in mind. > > I would love to get your thoughts on these and more before we open a real > voting thread. Thanks! > > Regards, > Sangjin > > [1] YARN-2928: https://issues.apache.org/jira/browse/YARN-2928 > [2] MAPREDUCE-6331: https://issues.apache.org/jira/browse/MAPREDUCE-6331 > [3] YARN-2928 commits: https://github.com/apache/hadoop/commits/YARN-2928 > [4] YARN-5045: https://issues.apache.org/jira/browse/YARN-5045 > [5] YARN-5071: https://issues.apache.org/jira/browse/YARN-5071 > --001a113db1dedda0d40535c095f7--