kylin-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From yu feng <olaptes...@gmail.com>
Subject Re: query return error result.
Date Mon, 09 Oct 2017 02:37:42 GMT
the cube is using hllc15, we are tracing the code and try to find the
reason.

2017-10-08 14:52 GMT+08:00 Li Yang <liyang@apache.org>:

> Interesting... is it HLL count distinct or bitmap count distinct?
>
> On Wed, Sep 27, 2017 at 11:19 AM, yu feng <olaptestyu@gmail.com> wrote:
>
>> I add some log and find data from hbase is incorrect.
>>
>> 2017-09-27 11:17 GMT+08:00 yu feng <olaptestyu@gmail.com>:
>>
>>> I have a cube like this :
>>> dimensions : source_type, source_id, name, dt
>>> measures:count(distinct uid), count(1) , count(distinct buyer)
>>>
>>> I run the query :
>>>
>>> select source_type, source_id, name,
>>>     count(distinct uid), count(uid) as cnum, count(distinct buyer) as
>>> buyerNum,
>>>     count(buyer) as bnum
>>> from
>>>     table_name
>>> where
>>>     dt between '2017-06-01' and '2017-09-18'
>>>     and source_id is not null
>>>     and source_type is not null
>>> group by
>>>     source_type, source_id, name
>>> order by buyerNum desc limit 1 offset 0
>>>
>>> return :
>>>
>>> mv
>>> 423031
>>> 起点‧终站
>>> 193794
>>> 92
>>> 42043
>>> 92
>>>
>>>
>>>
>>>
>>>
>>> obviously, it is error result, I query the sourceid like this:
>>>
>>> select source_type, source_id, name,
>>>     count(distinct uid), count(uid) as cnum, count(distinct buyer) as
>>> buyerNum,
>>>     count(buyer) as bnum
>>> from
>>>     vip_buying_funnel_cube_view
>>> where
>>>     dt between '2017-06-01' and '2017-09-18'
>>>     and source_id is not null
>>>     and source_type is not null
>>>     and source_id = '423031'
>>> group by
>>>     source_type, source_id, name
>>> order by buyerNum desc limit 1 offset 0
>>>
>>> the result is corrent :
>>>
>>> mv
>>> 423031
>>> 起点‧终站
>>> 77
>>> 92
>>> 11
>>> 92
>>>
>>
>>
>

Mime
View raw message