spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph K. Bradley (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-7150) Facilitate random column generation for DataFrames
Date Sun, 26 Apr 2015 07:49:38 GMT

    [ https://issues.apache.org/jira/browse/SPARK-7150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512941#comment-14512941
] 

Joseph K. Bradley commented on SPARK-7150:
------------------------------------------

Sounds good

> Facilitate random column generation for DataFrames
> --------------------------------------------------
>
>                 Key: SPARK-7150
>                 URL: https://issues.apache.org/jira/browse/SPARK-7150
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML, SQL
>            Reporter: Joseph K. Bradley
>            Priority: Minor
>
> It would be handy to have easy ways to construct random columns for DataFrames.  Proposed
API:
> {code}
> object RandomRDD {
>   def normalColumn(): Column = ???
> }
> {code}
> Usage:
> {code}
> myDataFrame.withColumn("myRandCol", RandomRDD.normalColumn())
> {code}
> This could be part of spark.ml (in which case it could be in a RandomRDD object resembling
the one in spark.mllib), or it could be in SQL proper.  I'd go for spark.ml, but either is
fine with me.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message