systemml-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthias Boehm (JIRA)" <>
Subject [jira] [Commented] (SYSTEMML-2359) Extend update per EPOCH
Date Sun, 03 Jun 2018 20:51:00 GMT


Matthias Boehm commented on SYSTEMML-2359:

I'm not sure if I understand your question correctly. The slicing of batches from the local
worker's data partition should stay unchanged. In contrast to updates per batch, we would
update the worker's model only locally (without synchronizing/communicating with the parameter
server). For that it might be good to abstract the aggregation service a bit to make it accessible
from both the workers and param server. Since we don't need to keep the gradients for all
batches, the memory requirements should be the same for per-batch/per-epoch, but per-epoch
requires less synchronization and aggregation overhead which will be important especially
in distributed settings.

> Extend update per EPOCH
> -----------------------
>                 Key: SYSTEMML-2359
>                 URL:
>             Project: SystemML
>          Issue Type: Sub-task
>            Reporter: LI Guobao
>            Assignee: LI Guobao
>            Priority: Major

This message was sent by Atlassian JIRA

View raw message