singa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ngin Yun Chuan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SINGA-399) Rafiki cannot test rebuilt image
Date Sun, 28 Oct 2018 07:23:00 GMT

    [ https://issues.apache.org/jira/browse/SINGA-399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16666313#comment-16666313
] 

Ngin Yun Chuan commented on SINGA-399:
--------------------------------------

Hi Zhu Lei,

I suspect it is the issue I described in the comment above (https://issues.apache.org/jira/browse/SINGA-399?focusedCommentId=16666273&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16666273).

As I wrote in another comment above (https://issues.apache.org/jira/browse/SINGA-399?focusedCommentId=16666311&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16666311),
do try out the new PR's code and uses `RAFIKI_VERSION=0.0.5` that doesn't conflict with the
Docker Hub's images, and you should be able to use locally-built images. We will make it easier
to use locally-built images in the future.

> Rafiki cannot test rebuilt image
> --------------------------------
>
>                 Key: SINGA-399
>                 URL: https://issues.apache.org/jira/browse/SINGA-399
>             Project: Singa
>          Issue Type: Bug
>            Reporter: Zhu Lei
>            Priority: Major
>         Attachments: rafiki-1.PNG, rafiki-2.PNG, rafiki-3.PNG, rafiki-4.PNG, rafiki-5.PNG,
rafiki-6.PNG
>
>
> After downloading the newest rafiki code, at commit 7b3b04e15c62233e515c4d82051cd5dfb799215f,
with comments "Add more error handling to notify user of invalid train job; compact exceptions",
I ran "bash ./scripts/build_images.sh" to build the new admin, advisor, predictor and worker
images. I got the images shown in attached image 'rafiki-1.PNG'. Then I run "bash ./script/start.sh"
to build the containers as shown in the attached image 'rafiki-2.PNG'. Finally when I ran
the client-usage.py example. I got the error in attached image 'rafiki-3.PNG'.
> And I find very surprising that the images of admin, advisor, predictor and worker I
built just now, become some images built weeks ago, shown in attached image 'rafiki-4.PNG'.
Could you kindly provide me some explanations on why this happens? I really do not understand
why this happened.
> And finally, when I run "bash ./script/stop.sh" and leave the swarm and repeat my previous
procedure again, now there is no errors. The only thing difference between the two runs I
think is only the images are different. So the current code of rafiki does not support newly
build images, that is my speculation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message