spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jerry Z (JIRA)" <>
Subject [jira] [Commented] (SPARK-9744) Add RDD method to map with lag and lead
Date Fri, 07 Aug 2015 19:18:47 GMT


Jerry Z commented on SPARK-9744:

Fixed! Sorry didn't know you were referring to the title. So I think for performance sake,
this would be a handy feature to have....  also saves me a lot of typing and avoiding my code
wrapping around.

On a semi-related note, why does cogroup need an iterator of the class? join() doesn't.

> Add RDD method to map with lag and lead
> ---------------------------------------
>                 Key: SPARK-9744
>                 URL:
>             Project: Spark
>          Issue Type: Wish
>            Reporter: Jerry Z
>            Priority: Minor
> To avoid zipping with index and doing numerous mapping and joins, having a single method
call to map with an additional two parameters (1: list of offsets [(-) for lag, 0 for current
and (+) for lead])) and (2:default value). The other difference to the map function takes
an argument of List<T> and not just T.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message