hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bejoy KS (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-3678) The Map tasks logs should have the value of input split it processed
Date Tue, 17 Jan 2012 06:34:40 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-3678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187487#comment-13187487
] 

Bejoy KS commented on MAPREDUCE-3678:
-------------------------------------

Ya it is available in taskdetails.jsp . But when we have a large number of jobs running on
our cluster in a matter of half an hour the jobs would be in history and in in jobtaskshistory.jsp
there are only the following values
-Task Id	
-Start Time	
-Finish Time
-Error

Can we have one more filed here similar to status in  taskdetails.jsp that would show the
input split it processed as well.

Once the job is in history viewer currently do we have any option to find this information?

                
> The Map tasks logs should have the value of input split it processed
> --------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3678
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3678
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: nodemanager, tasktracker
>    Affects Versions: 0.20.203.0, 0.20.205.0, 1.0.0
>         Environment: Linux red hat.
>            Reporter: Bejoy KS
>
> It would be easier to debug some corner in tasks if we knew what was the input split
processed by that task. Map reduce task tracker log should accommodate the same. Also in the
jobdetails web UI, the split also should be displayed along with the Split Locations. 
> Sample as
> Input Split
> hdfs://myserver:9000/userdata/sampleapp/inputdir/file1.csv - <split no>/<offset
from beginning of file>
> This would be much beneficial to nail down some data quality issues in large data volume
processing.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message