flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sihua Zhou (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-9474) Introduce an approximate version of "count distinct"
Date Wed, 30 May 2018 05:35:00 GMT
Sihua Zhou created FLINK-9474:
---------------------------------

             Summary: Introduce an approximate version of "count distinct"
                 Key: FLINK-9474
                 URL: https://issues.apache.org/jira/browse/FLINK-9474
             Project: Flink
          Issue Type: New Feature
          Components: Table API &amp; SQL
    Affects Versions: 1.5.0
            Reporter: Sihua Zhou
            Assignee: Sihua Zhou


We can implement an approximate version of "count distinct" base on the "Elastic Bloom Filter",
It could be very fast because we don't need to query the state anymore, its accuracy should
could be configurable. e.g 95%, 98%.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message