singa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhu Lei (JIRA)" <>
Subject [jira] [Commented] (SINGA-399) Rafiki cannot test rebuilt image
Date Sun, 28 Oct 2018 07:17:00 GMT


Zhu Lei commented on SINGA-399:

Hi Ngin Yun Chuan, thanks for your reply,

1). I think I am not facing the first scenario, as the only changes I made to
to check Rafiki can run newly built image is only to let it return \{'prediction': 'test'}
as shown in attached image 'rafiki-5.PNG'.

2). for the second scenario, I rebuilt the new images, where the only change is that the predict()
function in returnsĀ \{'prediction': 'test'}. I follow my previous procedure,
running 'bash ./scripts/' and 'python'. I got the same error as I
mentioned. later, I run the commands you suggest inĀ [].
The predictor server correctly returns the result as shown in image 'rafiki-6.PNG'. However,
the predictor are not using the modified I think it somehow in the middle replace
the newly built predictor image with the docker hub one. !rafiki-5.PNG!!rafiki-6.PNG!

> Rafiki cannot test rebuilt image
> --------------------------------
>                 Key: SINGA-399
>                 URL:
>             Project: Singa
>          Issue Type: Bug
>            Reporter: Zhu Lei
>            Priority: Major
>         Attachments: rafiki-1.PNG, rafiki-2.PNG, rafiki-3.PNG, rafiki-4.PNG
> After downloading the newest rafiki code, at commit 7b3b04e15c62233e515c4d82051cd5dfb799215f,
with comments "Add more error handling to notify user of invalid train job; compact exceptions",
I ran "bash ./scripts/" to build the new admin, advisor, predictor and worker
images. I got the images shown in attached image 'rafiki-1.PNG'. Then I run "bash ./script/"
to build the containers as shown in the attached image 'rafiki-2.PNG'. Finally when I ran
the example. I got the error in attached image 'rafiki-3.PNG'.
> And I find very surprising that the images of admin, advisor, predictor and worker I
built just now, become some images built weeks ago, shown in attached image 'rafiki-4.PNG'.
Could you kindly provide me some explanations on why this happens? I really do not understand
why this happened.
> And finally, when I run "bash ./script/" and leave the swarm and repeat my previous
procedure again, now there is no errors. The only thing difference between the two runs I
think is only the images are different. So the current code of rafiki does not support newly
build images, that is my speculation.

This message was sent by Atlassian JIRA

View raw message