Return-Path: X-Original-To: apmail-commons-issues-archive@minotaur.apache.org Delivered-To: apmail-commons-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C976017B9A for ; Wed, 22 Oct 2014 19:52:34 +0000 (UTC) Received: (qmail 82745 invoked by uid 500); 22 Oct 2014 19:52:34 -0000 Delivered-To: apmail-commons-issues-archive@commons.apache.org Received: (qmail 82643 invoked by uid 500); 22 Oct 2014 19:52:34 -0000 Mailing-List: contact issues-help@commons.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: issues@commons.apache.org Delivered-To: mailing list issues@commons.apache.org Received: (qmail 82630 invoked by uid 99); 22 Oct 2014 19:52:34 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Oct 2014 19:52:34 +0000 Date: Wed, 22 Oct 2014 19:52:34 +0000 (UTC) From: "Otmar Ertl (JIRA)" To: issues@commons.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (MATH-1159) Add quartiles to SummaryStatistics MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MATH-1159?page=3Dcom.atlassian.= jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D14180= 422#comment-14180422 ]=20 Otmar Ertl commented on MATH-1159: ---------------------------------- If multiple quantile values need to be computed simultaneously (e.g lower q= uartile, median, and upper quartile), it would make sense to use the P=C2= =B2 algorithm for histograms that uses more than 5 marker positions as desc= ribed in the original paper. Depending on the desired quantile values diffe= rent sets of marker positions could be used. Using a PSquarePercentile obje= ct for each of the desired quantiles has some overhead. For example, each o= f them store the minimum and the maximum value. The 2nd and 4th marker posi= tions used for the median correspond to the 3rd marker positions of the low= er and upper quartiles, respectively. In summary, a P=C2=B2 algorithm imple= mentation that calculates all desired quantiles simultaneously would be muc= h more efficient. > Add quartiles to SummaryStatistics > ---------------------------------- > > Key: MATH-1159 > URL: https://issues.apache.org/jira/browse/MATH-1159 > Project: Commons Math > Issue Type: Improvement > Affects Versions: 3.3 > Reporter: Phil Steitz > > Using PSquarePercentile, we can add quartile computation to SummaryStatis= tics. Since maintaining quartiles will add some overhead, implementation s= hould allow the feature to be turned off via some kind of constructor flag.= This does open the can of worms regarding turning on / off other stats, w= hich is probably a good thing to think about as the implementation of this = feature is developed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)