spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <>
Subject [jira] [Commented] (SPARK-2192) Examples Data Not in Binary Distribution
Date Tue, 25 Nov 2014 11:48:12 GMT


Sean Owen commented on SPARK-2192:

Data files are now consolidated under "data/", and they are not in the binary distribution.
It would be easy to add them, and seems like a reasonable thing to do. However, I'm not clear
all of those data files can be distributed; MovieLens data for example isn't supposed to be
AFAIK. In fact, I'm not clear it should be in the Spark repo even.

Any support for me adding this to the distro, but removing examples based on things like Movielens
that shouldn't be redistributed?

> Examples Data Not in Binary Distribution
> ----------------------------------------
>                 Key: SPARK-2192
>                 URL:
>             Project: Spark
>          Issue Type: Bug
>          Components: Build
>    Affects Versions: 1.0.0
>            Reporter: Pat McDonough
> The data used by examples is not packaged up with the binary distribution. The data subdirectory
of spark should make it's way in to the distribution somewhere so the examples can use it.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message