incubator-crunch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dave Beech <d...@paraliatech.com>
Subject Re: Distinct for PTables
Date Thu, 24 Jan 2013 22:58:58 GMT
Great - thanks Josh


On 24 January 2013 22:56, Josh Wills <jwills@cloudera.com> wrote:

> It's a good idea, for lots of use cases. I created
> https://issues.apache.org/jira/browse/CRUNCH-150 to track it and posted a
> patch.
>
> J
>
>
> On Thu, Jan 24, 2013 at 2:32 PM, Dave Beech <dave@paraliatech.com> wrote:
>
>> Hi all,
>>
>> What's the right way to apply "distinct" to a PTable?
>>
>> Calling Distinct.distinct works, but returns the table back to you as a
>> PCollection<Pair<K,V>> instead. Is it possible to coerce this back to
a
>> PTable type?
>>
>> Thanks,
>> Dave
>>
>
>
>
> --
> Director of Data Science
> Cloudera <http://www.cloudera.com>
> Twitter: @josh_wills <http://twitter.com/josh_wills>
>

Mime
View raw message