reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julia (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (REEF-1723) Fix TestFailMapperEvaluatorOnWaitingForEvaluatorAndExecution failures in AppVeyor
Date Tue, 08 Aug 2017 02:54:00 GMT

     [ https://issues.apache.org/jira/browse/REEF-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Julia updated REEF-1723:
------------------------
    Attachment: driver.stderr
                driver.stdout

I had a repro in local test and caught the logs. The task was completed pretty late, took
about 2 minutes. 

Aug 07, 2017 7:14:20 PM org.apache.reef.javabridge.generic.JobDriver$RunningTaskHandler onNext
INFO: RunningTask will be handled by CLR handler. Task Id: IMRUMap-RandomInputPartition-3-3
Aug 07, 2017 7:16:22 PM org.apache.reef.javabridge.generic.JobDriver$CompletedTaskHandler
onNext
INFO: Completed task: IMRUMaster-3
Aug 07, 2017 7:16:22 PM org.apache.reef.javabridge.generic.JobDriver$CompletedTaskHandler
onNext
INFO: Return results to the client:
UpdateTaskCompleted


identifier: "BroadcastReduceDriver"
state: DONE

All the events eventually still received by driver and the driver shut down finally.
 
Aug 07, 2017 7:16:29 PM org.apache.reef.runtime.common.driver.DriverRuntimeStopHandler onNext
INFO: Driver shutdown complete
Aug 07, 2017 7:16:29 PM org.apache.reef.runtime.common.REEFLauncher main
INFO: Exiting REEFLauncher.main()

The entire testing have 4 retries. The main delay was because the last successful run took
about a few minutes and test cannot open the log file in specified time frame (2min). 


> Fix TestFailMapperEvaluatorOnWaitingForEvaluatorAndExecution failures in AppVeyor
> ---------------------------------------------------------------------------------
>
>                 Key: REEF-1723
>                 URL: https://issues.apache.org/jira/browse/REEF-1723
>             Project: REEF
>          Issue Type: Sub-task
>          Components: REEF.NET
>            Reporter: Mariia Mykhailova
>            Assignee: Julia
>         Attachments: driver.stderr, driver.stdout
>
>
> Test introduced in REEF-1691.
> https://ci.appveyor.com/project/tcNickolas/reef/build/481-master/job/88eu113ns7no4fto/tests
> {noformat}
> Assert.True() Failure
> Expected: True
> Actual:   False
> at
> Assert.True(NumberOfRetry * numTasks < completedTaskCount + failedEvaluatorCount -
1 + failedTaskCount);
> {noformat}
> https://ci.appveyor.com/project/ApacheSoftwareFoundation/reef/build/1307-master/job/way15tkd4kyywoc1/tests
- normally the test completes in a minute, and here it timed out after 4 minutes, so looks
like driver didn't finish:
> {noformat}
> Cannot read from log file C:\projects\reef\lang\cs\bin\x64\Debug\Org.Apache.REEF.Tests\REEF_LOCAL_RUNTIME1b3b23f8\reef-BroadcastReduceDriver-20170120070049453\driver\driver.stdout
> Expected: True
> Actual:   False
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message