spark-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [spark] HyukjinKwon commented on a change in pull request #25939: [SPARK-27463][PYTHON][FOLLOW-UP] Miscellaneous documentation and code cleanup of cogroup pandas UDF
Date Fri, 27 Sep 2019 00:41:11 GMT
HyukjinKwon commented on a change in pull request #25939: [SPARK-27463][PYTHON][FOLLOW-UP]
Miscellaneous documentation and code cleanup of cogroup pandas UDF
URL: https://github.com/apache/spark/pull/25939#discussion_r328877165
 
 

 ##########
 File path: python/pyspark/sql/cogroup.py
 ##########
 @@ -79,6 +81,27 @@ def apply(self, udf):
         |20000102|  2|4.0|  y|
         +--------+---+---+---+
 
+        Alternatively, the user can define a function that takes three arguments.  In this
case,
+        the grouping key(s) will be passed as the first argument and the data will be passed
as the
+        second and third arguments.  The grouping key(s) will be  passed as a tuple of numpy
data
 
 Review comment:
   "be  pa" -> "be pa"

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Mime
View raw message