apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sandeep Narayanaswami (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (APEXMALHAR-2094) Quantiles sketch operator
Date Mon, 23 May 2016 17:47:12 GMT

    [ https://issues.apache.org/jira/browse/APEXMALHAR-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15296714#comment-15296714

Sandeep Narayanaswami commented on APEXMALHAR-2094:

[~thw] Thomas, I do intend to add more functionality. DataSketches includes algorithms like
hyperloglog, and heavy hitters / frequent items and a "theta sketch" framework that will allow
set operations on multiple streams, and I want to build Apex operators for each of these.

By "module", do you mean a package like {{statistics}} or {{algo}} in the Malhar library?
If so, yes, it might make sense to move this operator from {{statistics}} into a {{sketches}}
package that will encapsulate all stream sketching algorithms. 

> Quantiles sketch operator
> -------------------------
>                 Key: APEXMALHAR-2094
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2094
>             Project: Apache Apex Malhar
>          Issue Type: New Feature
>            Reporter: Sandeep Narayanaswami
>            Assignee: Sandeep Narayanaswami
>            Priority: Minor
> An operator that "sketches" in an online fashion the probability distribution of an input
(numeric) data stream, enabling computation of quantiles and cumulative distribution functions.

This message was sent by Atlassian JIRA

View raw message