kylin-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhong Yanghong (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (KYLIN-2867) split large fuzzy Key set
Date Mon, 11 Dec 2017 10:22:00 GMT

     [ https://issues.apache.org/jira/browse/KYLIN-2867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Zhong Yanghong updated KYLIN-2867:
----------------------------------
    Attachment:     (was: APACHE-KYLIN-2867.patch)

> split large fuzzy Key set
> -------------------------
>
>                 Key: KYLIN-2867
>                 URL: https://issues.apache.org/jira/browse/KYLIN-2867
>             Project: Kylin
>          Issue Type: Improvement
>          Components: Query Engine
>            Reporter: Zhong Yanghong
>            Assignee: Zhong Yanghong
>
> Too many fuzzy keys in one scan may downgrade the performance. Currently if the number
of fuzzy keys exceeds a threshold, like 200, the scan will not use fuzzy key. However, this
may also downgrade the performance. 
> There are many cases in eBay that the total number of fuzzy keys is around 1000. To deal
with these cases, it's better to split the large fuzzy key set, like 5*200. Then for each
subset, assign a scan. This strategy will increase the number of scan to reduce the number
of fuzzy keys in each scan to improve the total performance.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message