Return-Path: Delivered-To: apmail-jakarta-commons-dev-archive@www.apache.org Received: (qmail 29953 invoked from network); 16 Aug 2005 01:03:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 16 Aug 2005 01:03:54 -0000 Received: (qmail 63955 invoked by uid 500); 16 Aug 2005 01:03:52 -0000 Delivered-To: apmail-jakarta-commons-dev-archive@jakarta.apache.org Received: (qmail 63929 invoked by uid 500); 16 Aug 2005 01:03:51 -0000 Mailing-List: contact commons-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Help: List-Post: List-Id: "Jakarta Commons Developers List" Reply-To: "Jakarta Commons Developers List" Delivered-To: mailing list commons-dev@jakarta.apache.org Received: (qmail 63916 invoked by uid 99); 16 Aug 2005 01:03:51 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Aug 2005 18:03:51 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=RCVD_BY_IP,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of phil.steitz@gmail.com designates 64.233.162.203 as permitted sender) Received: from [64.233.162.203] (HELO zproxy.gmail.com) (64.233.162.203) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Aug 2005 18:04:12 -0700 Received: by zproxy.gmail.com with SMTP id 40so786041nzk for ; Mon, 15 Aug 2005 18:03:50 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=DRov5DnsRiSDYkziw1Ed+UvitXAnUMGFL0q+GC2hk+vUwLuLhmUwID+tT9X+RscvktKAhiVaEJVNE3D2M1vBDJFHrz/7V+6/wUd3JYyHcQgG1vYqT42i5+ZxPQZ17BTlhuflspPMueNJsxRrK9Tnr+lw5mhA14HJxLKkfP1PLjA= Received: by 10.36.135.18 with SMTP id i18mr2407899nzd; Mon, 15 Aug 2005 18:03:50 -0700 (PDT) Received: by 10.36.33.16 with HTTP; Mon, 15 Aug 2005 18:03:50 -0700 (PDT) Message-ID: <8a81b4af05081518033d60365d@mail.gmail.com> Date: Mon, 15 Aug 2005 18:03:50 -0700 From: Phil Steitz To: Jakarta Commons Developers List Subject: Re: [math] Re: commons math In-Reply-To: Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline References: <8a81b4af05081315277bdb5ac5@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On 8/13/05, John Gant wrote: > Algorithms: >=20 > - Feature reduction > a. Basic cross correlation, including both spearman and pearson cross > correlation algorithms. > b. Principal Component Analysis. > c. Entropy Based reduction. All good, IMHO, esp. a. and b. >=20 > I currently have a, and b finished but need to brush up on my junit skill= s :) >=20 > -Difference Measures > I had in mind a difference engine, basically an engine that handles > all difference operations. This difference engine could, in the > constructor or using set methods, take an instance of one of the > following difference methods. >=20 > a. euclidean distance > b. city-block distance Can you describe a little more exactly what you mean here and how it would be used? >=20 > -Pattern Discovery > a. KMotif Discovery Algorithm. I am not familiar with this algorithm or its uses. Can you provide some more info and references? >=20 > Again I have this algorithm completed, just need to boundary test everyth= ing. >=20 > -Clustering Algorithms > a. K-means Algorithm. +1 >=20 > I'd like to discuss the architecture of the k-means, I have a few > ideas and would like a little feedback. I know this is just a small > subset of the available algorithms, but this seems to be a good start. Yes. Ask away... >=20 > Thanks, > John >=20 > On 8/13/05, Phil Steitz wrote: > > John, > > > > Sounds great! Extending the stat package to include some data mining > > capabilities would be a good and useful addition to commons-math, > > IMHO. To get started, the first thing to do is to read the > > developer's guide > > (http://jakarta.apache.org/commons/math/developers.html), which will > > tell point you to the general apache references and go over some IP > > stuff that we have to worry about in [math]. > > > > Then either here or on the Wiki (see the guide for a link), post a > > brief description of the kinds of mining algorithms that you are > > interested in developing and we can get this going. On this list, pls > > begin the subject line of all [math] messages with [math]. > > > > Thanks in advance for your contributions! > > > > Phil > > > > On 8/13/05, John Gant wrote: > > > Hello, > > > I am currently a graduate student in Computer Science and Computer > > > Engineering at the University of Louisville, Kentucky. First let me > > > congratulate the group of developers who commit and architect for > > > apache commons. I have used many of the libraries and they are all of > > > excellent quality (but I guess you already know that :)). I am > > > interested in contributing to open source software and have interests > > > that are in the domain of statistics with a focus in data mining. > > > After writing many algorithms for classes, and asking an apache > > > contributor if any of this would be needed elsewhere, he told me to > > > purpose something to the dev list. So here it goes, I would like to > > > help start a data mining section of commons math and advance the > > > existing statistical libraries. I plan on developing the algorithms > > > for personal use anyway, and would like to see some of my work be use= d > > > by others. If anyone is interested we can continue this thread and I > > > will email my code, and purpose my new algorithms. > > > > > > > > > John Gant > > > > > > --------------------------------------------------------------------- > > > To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org > > > For additional commands, e-mail: commons-dev-help@jakarta.apache.org > > > > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org > > For additional commands, e-mail: commons-dev-help@jakarta.apache.org > > > > >=20 >=20 > -- > John Gant >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org > For additional commands, e-mail: commons-dev-help@jakarta.apache.org >=20 > --------------------------------------------------------------------- To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: commons-dev-help@jakarta.apache.org