kylin-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chao Long (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (KYLIN-3453) Improve cube size estimation for TOPN, COUNT DISTINCT
Date Tue, 24 Jul 2018 10:12:00 GMT

    [ https://issues.apache.org/jira/browse/KYLIN-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16553990#comment-16553990
] 

Chao Long edited comment on KYLIN-3453 at 7/24/18 10:11 AM:
------------------------------------------------------------

I have done some tests on the enhancement, and the results are as belows:

1, test sample cube

     cube real size: 

     !image-2018-07-24-16-30-50-804.png!      

    before enhancement

      !image-2018-07-24-16-33-43-231.png!

    after enhancement

     !image-2018-07-24-16-29-07-359.png!      

 

2,ssb dataSet

   cube real size

     !image-2018-07-24-16-37-09-199.png!

   before enhancement

     !image-2018-07-24-17-11-27-829.png!      

   after enhancement

     !image-2018-07-24-17-12-25-880.png!

And we can find that the estimatation of Cube size is more accurate after enhancement.


was (Author: wayne0101):
After the enhancement, I did some tests.

1, test sample cube

     cube real size: 

    !image-2018-07-24-16-30-50-804.png!      

    before enhancement

     !image-2018-07-24-16-33-43-231.png!

    after enhancement

     !image-2018-07-24-16-29-07-359.png!      

 

2,ssb dataSet

   cube real size

    !image-2018-07-24-16-37-09-199.png!

   before enhancement

    !image-2018-07-24-17-11-27-829.png!      

   after enhancement

    !image-2018-07-24-17-12-25-880.png!

from the test result, we can find that after the enhancement, the estimates of cube size are
more accurate.

> Improve cube size estimation for TOPN, COUNT DISTINCT
> -----------------------------------------------------
>
>                 Key: KYLIN-3453
>                 URL: https://issues.apache.org/jira/browse/KYLIN-3453
>             Project: Kylin
>          Issue Type: Improvement
>            Reporter: Chao Long
>            Assignee: Chao Long
>            Priority: Major
>             Fix For: v2.5.0
>
>         Attachments: image-2018-07-24-16-29-07-359.png, image-2018-07-24-16-30-50-804.png,
image-2018-07-24-16-33-43-231.png, image-2018-07-24-16-37-09-199.png, image-2018-07-24-17-11-26-283.png,
image-2018-07-24-17-11-27-829.png, image-2018-07-24-17-12-25-880.png
>
>
> Currently, Kylin has poor cube size estimation for TOPN, COUNT DISTINCT. We should improve
it, then we can get a reasonable split num when cube building. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message