Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id D8248200CDE for ; Tue, 25 Jul 2017 01:29:06 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id D680D165E44; Mon, 24 Jul 2017 23:29:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 285E3165E43 for ; Tue, 25 Jul 2017 01:29:06 +0200 (CEST) Received: (qmail 44606 invoked by uid 500); 24 Jul 2017 23:29:05 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 44594 invoked by uid 99); 24 Jul 2017 23:29:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Jul 2017 23:29:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 7A4E71A1CFE for ; Mon, 24 Jul 2017 23:29:04 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ZF6ONnXK7_p9 for ; Mon, 24 Jul 2017 23:29:03 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 7922C5FC12 for ; Mon, 24 Jul 2017 23:29:02 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id CAFEEE0DF5 for ; Mon, 24 Jul 2017 23:29:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 2CBA223F1D for ; Mon, 24 Jul 2017 23:29:00 +0000 (UTC) Date: Mon, 24 Jul 2017 23:29:00 +0000 (UTC) From: "Vrushali C (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (YARN-6733) Add table for storing sub-application entities MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 24 Jul 2017 23:29:07 -0000 [ https://issues.apache.org/jira/browse/YARN-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16099207#comment-16099207 ] Vrushali C edited comment on YARN-6733 at 7/24/17 11:28 PM: ------------------------------------------------------------ So we thought that it will be good to keep the column name so that sub apps can store this information. For regular applications, the flow version can be used to determine whether optimizations are to be done. The flow version indicates if the flow has changed, that is, say if the pig script changes, its flow version will change. So then, for example, reducer estimation calculations can be done differently. This applies to the application entities. We discussed that it will be good to keep the same information for sub-apps in case they want to use this information in a similar fashion. As such, this column currently only exists in code, it's not taking up any disk space/hbase space etc if no one writes to it. But having it gives the framework developers a chance to use it if they want. was (Author: vrushalic): So we thought that it will be good to keep the column name so that sub apps can store this information. For regular applications, the flow version can be used to determine whether optimizations are to be done. The flow version indicates if the flow has changed, that is, say if the pig script changes, it's flow version will change. So then, for example, reducer estimation calculations can be done differently. This applies to the application entities. We discussed that it will be good to keep the same information for sub-apps in case they want to use this information in a similar fashion. As such, this column currently only exists in code, it's not taking up any disk space/hbase space etc if no one writes to it. But having it given the framework developers a chance to use it if they want. > Add table for storing sub-application entities > ---------------------------------------------- > > Key: YARN-6733 > URL: https://issues.apache.org/jira/browse/YARN-6733 > Project: Hadoop YARN > Issue Type: Sub-task > Components: timelineserver > Reporter: Vrushali C > Assignee: Vrushali C > Attachments: IMG_7040.JPG, YARN-6733-YARN-5355.001.patch, YARN-6733-YARN-5355.002.patch, YARN-6733-YARN-5355.003.patch, YARN-6733-YARN-5355.004.patch, YARN-6733-YARN-5355.005.patch, YARN-6733-YARN-5355.006.patch, YARN-6733-YARN-5355.007.patch, YARN-6733-YARN-5355.008.patch > > > After a discussion with Tez folks, we have been thinking over introducing a table to store sub-application information. > For example, if a Tez session runs for a certain period as User X and runs a few AMs. These AMs accept DAGs from other users. Tez will execute these dags with a doAs user. ATSv2 should store this information in a new table perhaps called as "sub_application" table. > This jira tracks the code changes needed for table schema creation. > I will file other jiras for writing to that table, updating the user name fields to include sub-application user etc. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org