singa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ngin Yun Chuan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SINGA-399) Rafiki cannot test rebuilt image
Date Tue, 30 Oct 2018 10:05:00 GMT

    [ https://issues.apache.org/jira/browse/SINGA-399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16668457#comment-16668457
] 

Ngin Yun Chuan commented on SINGA-399:
--------------------------------------

Hi Zhu Lei,

Would like to update you that we have updated the PR at https://github.com/nginyc/rafiki/pull/64
and merged it into `master` as version 0.0.5. Please pull the latest code from `master`, which
contains some documentation improvements (at https://nginyc.github.io/rafiki/docs/) and edits
to the model interface. Since the code at master have been pushed to our Docker Hub with the
tag 0.0.5, you might need to set `RAFIKI_VERSION=0.0.6` to avoid image version conflicts.


> Rafiki cannot test rebuilt image
> --------------------------------
>
>                 Key: SINGA-399
>                 URL: https://issues.apache.org/jira/browse/SINGA-399
>             Project: Singa
>          Issue Type: Bug
>            Reporter: Zhu Lei
>            Priority: Major
>         Attachments: rafiki-1.PNG, rafiki-2.PNG, rafiki-3.PNG, rafiki-4.PNG, rafiki-5.PNG,
rafiki-6.PNG
>
>
> After downloading the newest rafiki code, at commit 7b3b04e15c62233e515c4d82051cd5dfb799215f,
with comments "Add more error handling to notify user of invalid train job; compact exceptions",
I ran "bash ./scripts/build_images.sh" to build the new admin, advisor, predictor and worker
images. I got the images shown in attached image 'rafiki-1.PNG'. Then I run "bash ./script/start.sh"
to build the containers as shown in the attached image 'rafiki-2.PNG'. Finally when I ran
the client-usage.py example. I got the error in attached image 'rafiki-3.PNG'.
> And I find very surprising that the images of admin, advisor, predictor and worker I
built just now, become some images built weeks ago, shown in attached image 'rafiki-4.PNG'.
Could you kindly provide me some explanations on why this happens? I really do not understand
why this happened.
> And finally, when I run "bash ./script/stop.sh" and leave the swarm and repeat my previous
procedure again, now there is no errors. The only thing difference between the two runs I
think is only the images are different. So the current code of rafiki does not support newly
build images, that is my speculation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message