hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sreekanth Ramakrishnan (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-5869) TestQueueCapacities timeout in trunk.
Date Mon, 08 Jun 2009 03:52:07 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-5869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sreekanth Ramakrishnan updated HADOOP-5869:
-------------------------------------------

    Attachment: HADOOP-5869-2.patch

Attaching new patch correcting few issues:

* The tests were failing randomly because of a timing issue with regards to speculative tasks
being launched. The speculative execution is currently disabled in this patch.
* The tests were timing out instead of assertion failing because, in {{MiniMRCluster.shutdown()}}
we do a {{waitTaskTrackers()}}. In case of controlled jobs the trackers never get idle until
we finish tasks, but then assertion has failed and we would have wait for test to time out.

> TestQueueCapacities timeout in trunk.
> -------------------------------------
>
>                 Key: HADOOP-5869
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5869
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/capacity-sched
>    Affects Versions: 0.20.1
>            Reporter: Sreekanth Ramakrishnan
>            Assignee: Sreekanth Ramakrishnan
>             Fix For: 0.20.1
>
>         Attachments: HADOOP-5869-1.patch, HADOOP-5869-2.patch, hadoop-5869.patch, thread-dump.txt
>
>
> TestQueueCapacities in trunk currently times out with message failed to fetch map-outputs.
Stack trace is:
> {noformat}
> 2009-05-19 10:54:01,162 WARN org.apache.hadoop.mapred.ReduceTask: \
>   attempt_200905191053_0001_r_000011_0 copy failed: attempt_200905191053_0001_m_000000_0
from localhost
> 2009-05-19 10:54:01,163 WARN org.apache.hadoop.mapred.ReduceTask: java.io.FileNotFoundException:
\
>   http://localhost:54203/mapOutput?job=job_200905191053_0001&map=attempt_200905191053_0001_m_000000_0&reduce=11
>         at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1241)
>         at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getInputStream(ReduceTask.java:1436)
>         at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getMapOutput(ReduceTask.java:1353)
>         at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.copyOutput(ReduceTask.java:1267)
>         at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(ReduceTask.java:1199)
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message