hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-809) number of input lines it processed, number of output lines it produced for PIG job
Date Sat, 01 May 2010 00:44:54 GMT

     [ https://issues.apache.org/jira/browse/PIG-809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Olga Natkovich updated PIG-809:
-------------------------------

    Fix Version/s: 0.8.0

> number of input lines it processed, number of output lines it produced for PIG job
> ----------------------------------------------------------------------------------
>
>                 Key: PIG-809
>                 URL: https://issues.apache.org/jira/browse/PIG-809
>             Project: Pig
>          Issue Type: Improvement
>          Components: impl
>         Environment: Linux
>            Reporter: Supreeth
>            Assignee: Richard Ding
>             Fix For: 0.8.0
>
>
> Excerpt from the mail conversation.
> It will be a great addition to Pig. Hadoop currently provides all these
> counters. All Pig has to do is to add them up for all Hadoop jobs in the
> script, and emit them at the end of the script. File a jira ?
> - Milind
> On 5/13/09 8:16 AM, "Supreeth Hosur Nagesh Rao" <supreeth@yahoo-inc.com>
> wrote:
> > > Hi Olga
> > > 
> > > With every PIG job is there any way for us to trap into the operational
> > > stats of that job, like number of input lines it processed, number of
> > > output lines it produced?
> > > 
> > > I dont want to have a separate PIG script to do the same as it may be
> > > additional parsing, so is there such a stat. If not can that be
> > > provided, and exposed as a config parameter?
> > > 
> > > -Supreeth
> This will be a great feature to have for our processing.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message