chukwa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Cheng (JIRA)" <j...@apache.org>
Subject [jira] Updated: (CHUKWA-253) Aggregations by User
Date Thu, 28 May 2009 20:45:45 GMT

     [ https://issues.apache.org/jira/browse/CHUKWA-253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Cheng updated CHUKWA-253:
-------------------------

    Attachment: chukwa-253.patch

Patch submitted.
* the pig script is at chukwa-home/script/pig
* the shell script for cronjob is at chukwa-home/bin. To manually run the shell script, use
command
        /path/to/chukwa-home/bin/UserDailySummary.sh <YYYYMMDD> <CLUSTER>

> Aggregations by User
> --------------------
>
>                 Key: CHUKWA-253
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-253
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>            Reporter: Cheng
>         Attachments: chukwa-253.patch
>
>
> Typically a user submits jobs with similar characteristics. Aggregating the following
metrics based on users can help
> quickly identify VIP users and how their jobs look like:
> -       slot-hours used for map tasks, for reduce tasks
> -       total jobs, jobs failed
> -       data-local-maps,  rack-local-maps, remote-maps
> -       total map-input-bytes, reduce-output-records
> -       total map tasks, total reduce tasks
> The granularity of the aggregation can be as coarse as daily. Data may be used to report
top-K users in certain
> categories. Data shall be available as chukwa records (namely, one record per day per
user).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message