systemml-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthias Boehm (JIRA)" <>
Subject [jira] [Commented] (SYSTEMML-2197) Multi-threaded broadcast creation
Date Thu, 29 Mar 2018 16:35:00 GMT


Matthias Boehm commented on SYSTEMML-2197:

Our testsuite will run 100s of tests that use this broadcast primitive, but if you want to
have one particular test you can use {{FullDistributedMatrixMultiplicationTest}}.

> Multi-threaded broadcast creation
> ---------------------------------
>                 Key: SYSTEMML-2197
>                 URL:
>             Project: SystemML
>          Issue Type: Task
>            Reporter: Matthias Boehm
>            Priority: Major
> All spark instructions that broadcast one of the input operands, rely on a shared primitive
{{sec.getBroadcastForVariable(var)}} for creating partitioned broadcasts, which are wrapper
objects around potentially many broadcast variables to overcome Spark 2GB limitation for compressed
broadcasts. Each individual broadcast blocks the matrix into squared blocks for direct access
without unnecessary copy per task. So far this broadcast creation is single-threaded. 
> This task aims to parallelize the blocking of the given in-memory matrix into squared
blocks (
as well as the subsequent partition creation and actual broadcasting (

> For consistency and in order to avoid excessive over-provisioning, this multi-threading
should use the common internal thread pool or parallel java streams, which similarly calls
the shared {{ForkJoinPool.commonPool}}. An example is the multi-threaded parallelization of
RDDs which similarly blocks a given matrix into its squared blocks (see

This message was sent by Atlassian JIRA

View raw message