hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mafish Liu <maf...@gmail.com>
Subject Re: Why no two aggregations can have different DISTINCT columns ?
Date Thu, 25 Feb 2010 10:11:53 GMT
2010/2/25 Zheng Shao <zshao9@gmail.com>:
> Yes definitely. Do you want to open a JIRA and post a patch?
> Please link the new JIRA to the other 2 JIRA that was mentioned in the
> same email thread.
I'll open a jira.
And the patch will be post after code and documents  being arranged.

> Zheng
>
> On Thu, Feb 25, 2010 at 1:16 AM, Mafish Liu <mafish@gmail.com> wrote:
>> Hive does not support multi-distinct in one query.
>>
>> We have implemented multi-distinct based on hive 0.4.2rc to our demand.
>> We don't know that if Hive is intresting in this feature.
>>
>> 2010/2/25 Jeff Zhang <zjffdu@gmail.com>:
>>>
>>> Hi all,
>>>
>>> I read the tutorial of Hive, and it says that "no two aggregations can have
>>> different DISTINCT columns". Could anyone tell what is the reason ? Does the
>>> following Distinct will been translate to map-reduce job or just do it
>>> locally ?
>>>
>>>     INSERT OVERWRITE TABLE pv_gender_agg
>>>     SELECT pv_users.gender, count(DISTINCT pv_users.userid), count(DISTINCT
>>> pv_users.ip)
>>>     FROM pv_users
>>>     GROUP BY pv_users.gender;
>>>
>>> --
>>> Best Regards
>>>
>>> Jeff Zhang
>>>
>>
>>
>>
>> --
>> Mafish@gmail.com
>>
>
>
>
> --
> Yours,
> Zheng
>



-- 
Mafish@gmail.com

Mime
View raw message