hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vrushali C (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-4063) Populate the flow activity table
Date Wed, 19 Aug 2015 18:17:45 GMT
Vrushali C created YARN-4063:
--------------------------------

             Summary: Populate the flow activity table
                 Key: YARN-4063
                 URL: https://issues.apache.org/jira/browse/YARN-4063
             Project: Hadoop YARN
          Issue Type: Sub-task
            Reporter: Vrushali C


Need to populate the flow_activity table

-Stores per day flow run pointers and info
-Written to by RM’s collector for application lifecycle
primary key: cluster ! day timestamp ! user ! flow id 
-For the day timestamp we can take the millis since epoch for the end of the day (24:00h).
columns include runids, start time, end time, snapshot time
-This table will also be used to efficiently retrieve the flows that had an activity in a
certain day. That is needed for daily aggregations, but also for several UIs, including a
flow-based UI.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message