crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gabriel Reid (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-376) Add aggregate(...) function to PCollection
Date Tue, 15 Apr 2014 07:43:15 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969308#comment-13969308
] 

Gabriel Reid commented on CRUNCH-376:
-------------------------------------

{quote}one concern: the result of an Aggregator call isn't necessarily a single value{quote}

About this, how about making the return type of PCollection#aggregate a PCollection, and then
adding a method like getFirst (or something like that) to PCollection. It's slightly less
convenient for the case where aggregate() returns a single value, but that way it'll work
for everything.

> Add aggregate(...) function to PCollection
> ------------------------------------------
>
>                 Key: CRUNCH-376
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-376
>             Project: Crunch
>          Issue Type: New Feature
>          Components: Core
>            Reporter: Jason Gauci
>            Assignee: Josh Wills
>            Priority: Minor
>         Attachments: CRUNCH-376b.patch
>
>
> This patch adds an aggregate() function to PCollection so you can do things like sum
a collection of integers in one line.
> https://github.com/MisterTea/crunch/commit/4bd8b7a13b893ddb89c11d657ba5347b92fe5a88



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message