hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (YARN-1575) Public localizer crashes with "Localized unkown resource"
Date Thu, 09 Jan 2014 16:12:50 GMT

     [ https://issues.apache.org/jira/browse/YARN-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jason Lowe updated YARN-1575:
-----------------------------

    Attachment: YARN-1575.patch
                YARN-1575.branch-0.23.patch

Attaching a blunt way to solve the race condition, which is to synchronize the queueing and
update of {{pending}}.  This basically defeats the point of {{pending}} being a ConcurrentHashMap,
so I updated it to a synchronized map since some unit tests are accessing it asynchronously.

For 0.23 we already are synchronizing {{attempts}}, so I piggy-backed the synchronization
on that variable.

> Public localizer crashes with "Localized unkown resource"
> ---------------------------------------------------------
>
>                 Key: YARN-1575
>                 URL: https://issues.apache.org/jira/browse/YARN-1575
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>    Affects Versions: 0.23.10, 2.2.0
>            Reporter: Jason Lowe
>            Priority: Critical
>         Attachments: YARN-1575.branch-0.23.patch, YARN-1575.patch
>
>
> The public localizer can crash with the error:
> {noformat}
> 2014-01-08 14:11:43,212 [Thread-467] ERROR org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Localized unkonwn resource to java.util.concurrent.FutureTask@852e26
> 2014-01-08 14:11:43,212 [Thread-467] INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Public cache exiting
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message