impala-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gabor Kaszab (Code Review)" <ger...@cloudera.org>
Subject [Impala-ASF-CR] IMPALA-10019: Implement ds kll pmf() function
Date Thu, 13 Aug 2020 16:34:38 GMT
Gabor Kaszab has uploaded this change for review. ( http://gerrit.cloudera.org:8080/16336


Change subject: IMPALA-10019: Implement ds_kll_pmf() function
......................................................................

IMPALA-10019: Implement ds_kll_pmf() function

This is the support for Probabilistic Mass Function (PMF) from Apache
DataSketches KLL algorithm collection. It receives a serialized KLL
sketch and one or more double values to represent ranges in the
sketched values.
E.g. [1, 5, 10] will mean the following ranges:
(-inf, 1), [1, 5), [5, 10), [10, +inf)
Returns a comma separated string where each value in the string is a
number in the range of [0,1] and shows that what percentage of the
data is in the particular ranges.

Example:
select ds_kll_pmf(ds_kll_sketch(float_col), 2, 4, 10) from alltypes;
+------------------------------------------------+
| ds_kll_pmf(ds_kll_sketch(float_col), 2, 4, 10) |
+------------------------------------------------+
| 0.202192,0.199452,0.598356,0                   |
+------------------------------------------------+

Change-Id: I222402f2dce2f49ab2b3f6e81a709da5539293ba
---
M be/src/exprs/datasketches-common.cc
M be/src/exprs/datasketches-common.h
M be/src/exprs/datasketches-functions-ir.cc
M be/src/exprs/datasketches-functions.h
M common/function-registry/impala_functions.py
M testdata/workloads/functional-query/queries/QueryTest/datasketches-kll.test
6 files changed, 164 insertions(+), 15 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/36/16336/1
-- 
To view, visit http://gerrit.cloudera.org:8080/16336
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I222402f2dce2f49ab2b3f6e81a709da5539293ba
Gerrit-Change-Number: 16336
Gerrit-PatchSet: 1
Gerrit-Owner: Gabor Kaszab <gaborkaszab@cloudera.com>

Mime
  • Unnamed multipart/alternative (inline, 8-Bit, 0 bytes)
View raw message