hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Runping Qi (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3297) The way in which ReduceTask/TaskTracker gets completion events during shuffle can be improved
Date Mon, 28 Apr 2008 16:37:55 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12592842#action_12592842
] 

Runping Qi commented on HADOOP-3297:
------------------------------------


Under what condition fetched map outputs will end up on disk directly?
If a segment is very large, it makes sense to write it out on disk directly.
If it is one or the last few, it makes sense too. Otherwise, a fetched segment 
should get into in-mem file system. If the in-mem file is full, the fetcher should wait.

This is related to hadoop-2095. They should be considered together.



> The way in which ReduceTask/TaskTracker gets completion events during shuffle can be
improved
> ---------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-3297
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3297
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>            Assignee: Devaraj Das
>             Fix For: 0.18.0
>
>
> Certain things like poll frequency, number of events fetched in one go, etc. can probably
be improved to improve the shuffle performance. This would affect the task-->tasktracker
and the tasktracker-->jobtracker shuffle related RPCs.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message