Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EEEE8200C23 for ; Wed, 8 Feb 2017 03:40:53 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EDC31160B68; Wed, 8 Feb 2017 02:40:53 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 41E46160B3E for ; Wed, 8 Feb 2017 03:40:53 +0100 (CET) Received: (qmail 79967 invoked by uid 500); 8 Feb 2017 02:40:47 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 79956 invoked by uid 99); 8 Feb 2017 02:40:47 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2017 02:40:47 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id F130FC0A77 for ; Wed, 8 Feb 2017 02:40:46 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -1.199 X-Spam-Level: X-Spam-Status: No, score=-1.199 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, KAM_LAZY_DOMAIN_SECURITY=1, RP_MATCHES_RCVD=-2.999] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id N9LYDoU5tqeB for ; Wed, 8 Feb 2017 02:40:44 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 6D8925F283 for ; Wed, 8 Feb 2017 02:40:44 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 88288E059C for ; Wed, 8 Feb 2017 02:40:42 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id C267125290 for ; Wed, 8 Feb 2017 02:40:41 +0000 (UTC) Date: Wed, 8 Feb 2017 02:40:41 +0000 (UTC) From: "Rohith Sharma K S (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-6027) Improve /flows API for more flexible filters fromid, collapse, userid MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 08 Feb 2017 02:40:54 -0000 [ https://issues.apache.org/jira/browse/YARN-6027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15857267#comment-15857267 ] Rohith Sharma K S commented on YARN-6027: ----------------------------------------- Thanks [~varun_saxena] for the review.. bq.Do we need cluster ID in fromId because we are ignoring it completely? Yes, it is required even though it is ignored, considering when fromId is being used. Do not want user to parse something and provide it as fromId. User can directly provide flow entity ID as fromId. Lets reader server handles it. Cluster Id check can be done to verify context cluster and from clusterId are equal. Ideally both should match. Otherwise we can throw exception. bq. If there is a / in cluster ID we may have to escape it to avoid parsing errors. If need to parse the errors, then why flow entity id is providing full row key as id? I think need to change flow entity id format itself. bq. If we use collapse, even with fromId, there seems to be a full table scan which will impact Yes, it does table scan. But it is expected to collapse with date range otherwise default behavior of /flows should be changed to give one day flows rather than full table data. It is a engineering issue, and may be can mention like performance will be bit slow. bq. Maybe we can send the last real ID in info field of last flow activity entity if previous query was made with collapse field Initially idea was to send last real id as fromId field info. But flows are stored per day for each user which not useful. Note that when collapse is used, we must scan to get all entities and apply fromId. Scanning can't be done half the way which end up in redundant entries for the user. Given previous comment is satisfied this should not be an issue. bq. you have mentioned that fromId validation is happening in getResult method. Could not find it ahh, I think I have missed it at global level. I have validating in one condition. Will validate at global level. bq. In processResults we first get the result from backend while applying limit and then process result for collapse and fromId filters. If you look at the patch, I have removed PageFilter while scanning which gives all the data. One optimization I can do is PageFilter can be applied in non-collapse mode because in non collapse mode scanning will start from given fromId. But the same logic can not be used for collapse mode. > Improve /flows API for more flexible filters fromid, collapse, userid > --------------------------------------------------------------------- > > Key: YARN-6027 > URL: https://issues.apache.org/jira/browse/YARN-6027 > Project: Hadoop YARN > Issue Type: Sub-task > Components: timelineserver > Reporter: Rohith Sharma K S > Assignee: Rohith Sharma K S > Labels: yarn-5355-merge-blocker > Attachments: YARN-6027-YARN-5355.0001.patch > > > In YARN-5585 , fromId is supported for retrieving entities. We need similar filter for flows/flowRun apps and flow run and flow as well. > Along with supporting fromId, this JIRA should also discuss following points > * Should we throw an exception for entities/entity retrieval if duplicates found? > * TimelieEntity : > ** Should equals method also check for idPrefix? > ** Does idPrefix is part of identifiers? -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org