hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhijie Shen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3914) Entity created time should be part of the row key of entity table
Date Sat, 11 Jul 2015 01:27:05 GMT

    [ https://issues.apache.org/jira/browse/YARN-3914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14623170#comment-14623170
] 

Zhijie Shen commented on YARN-3914:
-----------------------------------

This will not block the implementation of getEntities (YARN-3049), but the performance will
be bad without it, especially when the number of entities per type per app becomes huge, i.e.,
there's a big job.

> Entity created time should be part of the row key of entity table
> -----------------------------------------------------------------
>
>                 Key: YARN-3914
>                 URL: https://issues.apache.org/jira/browse/YARN-3914
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Zhijie Shen
>            Assignee: Zhijie Shen
>
> Entity created time should be part of the row key of entity table, between entity type
and entity Id. The reason to have it is to index the entities. Though we cannot index the
entities for all kinds of information, indexing them according to the created time is very
necessary. Without it, every query for the latest entities that belong to an application and
a type will scan through all the entities that belong to them. For example, if we want to
list the 100 latest started containers in an YARN app.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message