spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (SPARK-14744) Put examples packaging on a diet
Date Wed, 20 Apr 2016 22:59:25 GMT

     [ https://issues.apache.org/jira/browse/SPARK-14744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Apache Spark reassigned SPARK-14744:
------------------------------------

    Assignee: Apache Spark

> Put examples packaging on a diet
> --------------------------------
>
>                 Key: SPARK-14744
>                 URL: https://issues.apache.org/jira/browse/SPARK-14744
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Examples
>    Affects Versions: 2.0.0
>            Reporter: Marcelo Vanzin
>            Assignee: Apache Spark
>            Priority: Minor
>
> Currently the examples bring in a lot of external dependencies, ballooning the size of
the Spark distribution packages.
> I'd like to propose two things to slim down these dependencies:
> - make all non-Spark, and also Spark Streaming, dependencies "provided". This means,
especially for streaming connectors, that launching examples becomes more like launching real
applications (where you need to figure out how to provide those dependencies, e.g. using {{--packages}}).
> - audit examples and remove those that don't provide a lot of value. For example, HBase
is working on full-featured Spark bindings, based on code that has already been in use for
a while before being merged into HBase. The HBase example in Spark is very bare bones and,
in comparison, not really useful and in fact a little misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message