Return-Path: X-Original-To: apmail-commons-dev-archive@www.apache.org Delivered-To: apmail-commons-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2552F115C2 for ; Wed, 25 Jun 2014 20:10:03 +0000 (UTC) Received: (qmail 71665 invoked by uid 500); 25 Jun 2014 20:10:02 -0000 Delivered-To: apmail-commons-dev-archive@commons.apache.org Received: (qmail 71533 invoked by uid 500); 25 Jun 2014 20:10:02 -0000 Mailing-List: contact dev-help@commons.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "Commons Developers List" Delivered-To: mailing list dev@commons.apache.org Received: (qmail 71521 invoked by uid 99); 25 Jun 2014 20:10:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Jun 2014 20:10:02 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of phil.steitz@gmail.com designates 209.85.160.50 as permitted sender) Received: from [209.85.160.50] (HELO mail-pb0-f50.google.com) (209.85.160.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Jun 2014 20:09:56 +0000 Received: by mail-pb0-f50.google.com with SMTP id rp16so2133729pbb.23 for ; Wed, 25 Jun 2014 13:09:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:content-type:content-transfer-encoding:mime-version:subject :message-id:date:references:in-reply-to:to; bh=x6WHbj0QZ+Ro5pWivoTdjTq8sJpLjwJ8ZFjTuF6bXMA=; b=uEABz7/4Zx9/cKV/9qclrZS3px93GSv6i79zQ8Kvm4GFaUxyERf0AfTuxw6dyQ4j8r LjOxSN2SWVWOtileZd+yHjLOxuhHGNP1cu23z5kMjzDk0A0+SM+jJMRSyvDS8Xp1TdEc yhKmiH2PAgeEi7K2uNs9clQijyeL5u9+jFCtsOwU11DPvXOonD8MVtmQSyVwH1h2yCN5 2uAlH+BEGrY6PEkRlhSc1mYNEvwXh7AY5T5kC1FyodaFL921FB6dxggl/6cHs0Rnvdtl fj2cCcVPutUqrUhcYARIz9r+TCc2NfsAqphRjMEqWOGKyIAY5l89rMFAyFAmv6gEDHPv QRQg== X-Received: by 10.69.10.164 with SMTP id eb4mr14830543pbd.35.1403726971777; Wed, 25 Jun 2014 13:09:31 -0700 (PDT) Received: from [10.214.233.120] (mobile-166-137-185-253.mycingular.net. [166.137.185.253]) by mx.google.com with ESMTPSA id qf10sm6574213pbc.23.2014.06.25.13.09.29 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Wed, 25 Jun 2014 13:09:29 -0700 (PDT) From: Phil Steitz Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (1.0) Subject: Re: [jira] [Commented] (MATH-1131) Kolmogorov-Smirnov Tests takes 'forever' on 10,000 item dataset Message-Id: Date: Wed, 25 Jun 2014 13:09:28 -0700 References: In-Reply-To: To: Commons Developers List X-Mailer: iPhone Mail (11D201) X-Virus-Checked: Checked by ClamAV on apache.org Sorry for responding to the list but I have only mobile atm . IIRC the roun= dedK method should not be creating matrices of BigFractions, but rather usin= g doubles.=20 > On Jun 25, 2014, at 11:16 AM, "Thomas Neidhart (JIRA)" w= rote: >=20 >=20 > [ https://issues.apache.org/jira/browse/MATH-1131?page=3Dcom.atlassian.= jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D140438= 57#comment-14043857 ]=20 >=20 > Thomas Neidhart commented on MATH-1131: > --------------------------------------- >=20 > I did briefly debug the example and indeed the calculation hangs when call= ing roundedK, or more precisely in createH. >=20 > There powers of BigFraction objects are created with really big numerators= and denominators. Some of the calculations later on take then forever becau= se of this, e.g. when internally calculating the gcd. >=20 > Looking at the implementation from the referenced paper, there the H value= s are computed with double precision. Was there a specific reason to use Big= Fraction in our implementation? Is there a specific need for that level of a= ccuracy for the Kolmogorov-Smirnov Test? The other inference tests do not se= em to be so stringent. >=20 > It looks like there is no easy way to limit the maxDenominator when callin= g multiply() as it is possible when creating a BigFraction object. >=20 >=20 >> Kolmogorov-Smirnov Tests takes 'forever' on 10,000 item dataset >> --------------------------------------------------------------- >>=20 >> Key: MATH-1131 >> URL: https://issues.apache.org/jira/browse/MATH-1131 >> Project: Commons Math >> Issue Type: Bug >> Affects Versions: 3.3 >> Environment: Java 8 >> Reporter: Schalk W. Cronj=C3=A9 >> Attachments: 1.txt, ReproduceKsIssue.groovy, ReproduceKsIssue.java= >>=20 >>=20 >> I have code simplified to the following: >> KolmogorovSmirnovTest kst =3D new KolmogorovSmirnovTest(); >> NormalDistribution nd =3D new NormalDistribution(mean,stddev); >> kst.kolmogorovSmirnovTest(nd,dataset) >> I find that for my dataset of 10,000 items, the call to kolmogorovSmirnov= Test takes 'forever'. It has not returned after nearly 15minutes and in one m= y my tests has gone over 150MB in memory usage. >=20 >=20 >=20 > -- > This message was sent by Atlassian JIRA > (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org For additional commands, e-mail: dev-help@commons.apache.org