Return-Path: Delivered-To: apmail-jakarta-commons-dev-archive@www.apache.org Received: (qmail 59127 invoked from network); 24 Aug 2005 14:05:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 24 Aug 2005 14:05:10 -0000 Received: (qmail 48498 invoked by uid 500); 24 Aug 2005 14:05:04 -0000 Delivered-To: apmail-jakarta-commons-dev-archive@jakarta.apache.org Received: (qmail 48445 invoked by uid 500); 24 Aug 2005 14:05:04 -0000 Mailing-List: contact commons-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Help: List-Post: List-Id: "Jakarta Commons Developers List" Reply-To: "Jakarta Commons Developers List" Delivered-To: mailing list commons-dev@jakarta.apache.org Received: (qmail 48280 invoked by uid 99); 24 Aug 2005 14:05:04 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Aug 2005 07:05:03 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=RCVD_BY_IP,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of john.gant@gmail.com designates 64.233.184.206 as permitted sender) Received: from [64.233.184.206] (HELO wproxy.gmail.com) (64.233.184.206) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Aug 2005 07:05:18 -0700 Received: by wproxy.gmail.com with SMTP id 69so113140wra for ; Wed, 24 Aug 2005 07:04:57 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=tcaKqIbJiPhi7XoxT69fR7STldmzY6fXGit1bVH1ahgaP6fCkbrruR2r+l1V1yhFt6bX4B0IkKaz+Th72LVzeLHC94a4lJ2SwAuD7P95rQlqWbuCtuwkVz2pGoxrDWezJasN2dSQZxeY0HxMKF0qjIBeIL51BJLb03tUfxanLe0= Received: by 10.54.50.74 with SMTP id x74mr644626wrx; Wed, 24 Aug 2005 07:04:57 -0700 (PDT) Received: by 10.54.99.7 with HTTP; Wed, 24 Aug 2005 07:04:56 -0700 (PDT) Message-ID: Date: Wed, 24 Aug 2005 10:04:56 -0400 From: John Gant To: Jakarta Commons Developers List Subject: Re: DO NOT REPLY [Bug 36331] - [math] spearman rank cross correlation In-Reply-To: <20050824134330.5522A126@ajax.apache.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline References: <20050824134330.5522A126@ajax.apache.org> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Specifically testTwo() in http://issues.apache.org/bugzilla/attachment.cgi?id=3D16172 takes care of data with equal value (ie equal rank), is this the type of situation to which you are referring? Yes I agree, we should implement routines to sort in more diverse ways, but for right now I depend upon Arrays.sort() to perform the sorting. Thanks, John On 8/24/05, bugzilla@apache.org wrote: > DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG=B7 > RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT > . > ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND=B7 > INSERTED IN THE BUG DATABASE. >=20 > http://issues.apache.org/bugzilla/show_bug.cgi?id=3D36331 >=20 >=20 >=20 >=20 >=20 > ------- Additional Comments From brent@worden.org 2005-08-24 15:43 -----= -- > I noticed you implementation does not take into account ties in the data.= Ties > can effect the correlation coefficeint to a great degree. >=20 > A nice approach to address ties is to rank the data adjusting the ranks o= f tied > elements and then compute the Pearson's correlation coefficient on the ra= nkings. >=20 > See http://web.uccs.edu/lbecker/SPSS/ctabs2.htm#5E.%20Spearman%20Correlat= ion,% > 20rs for a brief explaination. >=20 > If we get into adding more non-parametric routines, I for one would like = to see > some general, ranking utililies such as taking an array of data and retur= ning > the ranking array. The ranking could be driven by a tie ranking policy f= or > dealing with ties in the data. The default policy would be to use the me= an > rank for ties. Other policies could be to omit the data, use the highest= rank > or, use the lowest rank. >=20 >=20 > -- > Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=3D= email > ------- You are receiving this mail because: ------- > You are the assignee for the bug, or are watching the assignee. >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org > For additional commands, e-mail: commons-dev-help@jakarta.apache.org >=20 >=20 --=20 John Gant --------------------------------------------------------------------- To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: commons-dev-help@jakarta.apache.org