apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sandeep Narayanaswami (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (APEXMALHAR-2094) Quantiles sketch operator
Date Mon, 23 May 2016 17:47:12 GMT

    [ https://issues.apache.org/jira/browse/APEXMALHAR-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15296714#comment-15296714
] 

Sandeep Narayanaswami commented on APEXMALHAR-2094:
---------------------------------------------------

[~thw] Thomas, I do intend to add more functionality. DataSketches includes algorithms like
hyperloglog, and heavy hitters / frequent items and a "theta sketch" framework that will allow
set operations on multiple streams, and I want to build Apex operators for each of these.

By "module", do you mean a package like {{statistics}} or {{algo}} in the Malhar library?
If so, yes, it might make sense to move this operator from {{statistics}} into a {{sketches}}
package that will encapsulate all stream sketching algorithms. 

> Quantiles sketch operator
> -------------------------
>
>                 Key: APEXMALHAR-2094
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2094
>             Project: Apache Apex Malhar
>          Issue Type: New Feature
>            Reporter: Sandeep Narayanaswami
>            Assignee: Sandeep Narayanaswami
>            Priority: Minor
>
> An operator that "sketches" in an online fashion the probability distribution of an input
(numeric) data stream, enabling computation of quantiles and cumulative distribution functions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message