hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Billie Rinaldi (JIRA)" <>
Subject [jira] [Created] (HIVE-3708) Add mapreduce workflow information to job configuration
Date Tue, 13 Nov 2012 21:26:12 GMT
Billie Rinaldi created HIVE-3708:

             Summary: Add mapreduce workflow information to job configuration
                 Key: HIVE-3708
             Project: Hive
          Issue Type: Improvement
            Reporter: Billie Rinaldi

Adding workflow properties to the job configuration would enable logging and analysis of workflows
in addition to individual MapReduce jobs.  Suggested properties include a workflow ID, workflow
name, adjacency list connecting nodes in the workflow, and the name of the current node in
the workflow. - a unique ID for the workflow, ideally prepended with the application
e.g. hive_<hiveQueryId> - a name for the workflow, to distinguish this workflow from other
workflows and to group different runs of the same workflow
e.g. hive query string

mapreduce.workflow.adjacency - an adjacency list for the workflow graph, encoded as mapreduce.workflow.adjacency.<source
node> = <comma-separated list of target nodes> - the name of the node corresponding to this MapReduce job in
the workflow adjacency list

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message