reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Chung (JIRA)" <>
Subject [jira] [Commented] (REEF-1339) Adding IInputPartition.Cache() for data download and cache
Date Mon, 25 Apr 2016 19:56:13 GMT


Andrew Chung commented on REEF-1339:

[~markus.weimer] [~dkm2110] I've been looking into this, but it is hard to add a generic {{Cache}}
method that supports different levels of caching to the {{IInputPartition}} interface and
{{FileSystemInputPartition}} without breaking backward compatibility. For this item, I will
simply add a {{[Unstable] Cache()}} function to the interface and preserve the old behavior
(e.g. {{FileSystemInputPartition}} will still cache only to local disk). I'll create a separate
JIRA item to support more generic caching with {{CacheLevel}} support.

> Adding IInputPartition.Cache() for data download and cache
> ----------------------------------------------------------
>                 Key: REEF-1339
>                 URL:
>             Project: REEF
>          Issue Type: Task
>            Reporter: Julia
>            Assignee: Andrew Chung
>              Labels: FT
> Currently, in FileSystemInputPartition, data downloading is implemented in Initilaize()
and called from GetPartitionHandle. It doesn't give client a flexibility to decide when to
download data. Besides, if client wants to cache data in advance, they need to call GetPartitionHandle()
and iterate the data. 
> We would like to expose a new API Cache() in IInputPartition which performs data download
to RAM, SSD, HDD, etc based on client's configuration. 
> The method should be called in ContextStartHandler  in IMRU scenarios. 

This message was sent by Atlassian JIRA

View raw message