hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Viral Bajaria <>
Subject Re: collect_set does not remove duplicate
Date Mon, 08 Sep 2014 07:55:54 GMT
It will be helpful if you paste some sample data to repro. I have used
collect_set and it works as documented for me.


On Sun, Sep 7, 2014 at 10:39 AM, Shushant Arora <>

> While group by, if I do collect_set on some other column , documentation
> says it will return Array of that column after removing duplicates, but its
> not doing dedup?Is it  expected?

View raw message