mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-1368) Convert OnlineSummarizer to use the new TDigest
Date Thu, 05 Dec 2013 05:20:36 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13839835#comment-13839835
] 

Hudson commented on MAHOUT-1368:
--------------------------------

SUCCESS: Integrated in Mahout-Quality #2352 (See [https://builds.apache.org/job/Mahout-Quality/2352/])
MAHOUT-1368: Convert OnlineSummarizer to use the new TDigest (smarthi: rev 1547911)
* /mahout/trunk/CHANGELOG
* /mahout/trunk/examples/src/main/java/org/apache/mahout/clustering/streaming/tools/ClusterQualitySummarizer.java
* /mahout/trunk/math/src/main/java/org/apache/mahout/math/stats/OnlineSummarizer.java
* /mahout/trunk/math/src/test/java/org/apache/mahout/math/stats/OnlineSummarizerTest.java


> Convert OnlineSummarizer to use the new TDigest
> -----------------------------------------------
>
>                 Key: MAHOUT-1368
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1368
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Ted Dunning
>            Assignee: Suneel Marthi
>             Fix For: 0.9
>
>         Attachments: MAHOUT-1368.patch
>
>
> The new TDigest provides better accuracy for quartile estimation as well as producing
any other quantile you might like.  The current quartile estimation of the OnlineSummarizer
fails for highly skewed distributions and can't really be extended to provide other quantiles.
 The TDigest handles all of this.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message