spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sun Rui (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-12235) Enhance mutate() to support replace existing columns
Date Wed, 09 Dec 2015 07:00:17 GMT
Sun Rui created SPARK-12235:
-------------------------------

             Summary: Enhance mutate() to support replace existing columns
                 Key: SPARK-12235
                 URL: https://issues.apache.org/jira/browse/SPARK-12235
             Project: Spark
          Issue Type: Improvement
          Components: SparkR
    Affects Versions: 1.5.2
            Reporter: Sun Rui


mutate() in the dplyr package supports adding new columns and replacing existing columns.
But currently the implementation of mutate() in SparkR supports adding new columns only.

Also make the behavior of mutate more consistent with that in dplyr.
1. Throw error message when there are duplicated column names in the DataFrame being mutated.
2. when there are duplicated column names in specified columns by arguments, the last column
of the same name takes effect.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message