hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vrushali C (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-4063) Populate the flow activity table
Date Wed, 19 Aug 2015 18:17:45 GMT
Vrushali C created YARN-4063:

             Summary: Populate the flow activity table
                 Key: YARN-4063
                 URL: https://issues.apache.org/jira/browse/YARN-4063
             Project: Hadoop YARN
          Issue Type: Sub-task
            Reporter: Vrushali C

Need to populate the flow_activity table

-Stores per day flow run pointers and info
-Written to by RM’s collector for application lifecycle
primary key: cluster ! day timestamp ! user ! flow id 
-For the day timestamp we can take the millis since epoch for the end of the day (24:00h).
columns include runids, start time, end time, snapshot time
-This table will also be used to efficiently retrieve the flows that had an activity in a
certain day. That is needed for daily aggregations, but also for several UIs, including a
flow-based UI.

This message was sent by Atlassian JIRA

View raw message