hivemall-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Makoto Yui (JIRA)" <j...@apache.org>
Subject [jira] [Closed] (HIVEMALL-18) Support approx_distinct_count UDAF using HyperLogLog
Date Tue, 21 Nov 2017 12:52:00 GMT

     [ https://issues.apache.org/jira/browse/HIVEMALL-18?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Makoto Yui closed HIVEMALL-18.
------------------------------
    Resolution: Fixed

> Support approx_distinct_count UDAF using HyperLogLog
> ----------------------------------------------------
>
>                 Key: HIVEMALL-18
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-18
>             Project: Hivemall
>          Issue Type: Sub-task
>            Reporter: Makoto Yui
>            Assignee: Makoto Yui
>            Priority: Minor
>             Fix For: 0.5.0
>
>
> https://github.com/addthis/stream-lib could be used for underlying library.
> http://www.slideshare.net/bzamecnik/hyperloglog-in-hive-how-to-count-sheep-efficiently
> https://databricks.com/blog/2016/05/19/approximate-algorithms-in-apache-spark-hyperloglog-and-quantiles.html
> There exist several HLL implementations as Hive UDAF.
> https://github.com/MLnick/hive-udf/wiki
> https://github.com/klout/brickhouse



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message