Return-Path: X-Original-To: apmail-incubator-hama-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-hama-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BFF279DA8 for ; Thu, 1 Mar 2012 07:42:43 +0000 (UTC) Received: (qmail 68300 invoked by uid 500); 1 Mar 2012 07:42:43 -0000 Delivered-To: apmail-incubator-hama-dev-archive@incubator.apache.org Received: (qmail 68252 invoked by uid 500); 1 Mar 2012 07:42:43 -0000 Mailing-List: contact hama-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hama-dev@incubator.apache.org Delivered-To: mailing list hama-dev@incubator.apache.org Delivered-To: moderator for hama-dev@incubator.apache.org Received: (qmail 27998 invoked by uid 99); 1 Mar 2012 06:51:07 -0000 Received-SPF: pass (google.com: domain of tjungblut@apache.org designates 10.52.91.196 as permitted sender) client-ip=10.52.91.196; Authentication-Results: mr.google.com; spf=pass (google.com: domain of tjungblut@apache.org designates 10.52.91.196 as permitted sender) smtp.mail=tjungblut@apache.org MIME-Version: 1.0 In-Reply-To: References: Date: Thu, 1 Mar 2012 07:51:06 +0100 Message-ID: Subject: Re: psvm From: Thomas Jungblut To: sarawgi.aditya@gmail.com, hama-dev@incubator.apache.org Content-Type: multipart/alternative; boundary=bcaec5016075604b5a04ba28e285 --bcaec5016075604b5a04ba28e285 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi Aditya, I'm from the Apache Hama team, we are working on a BSP (Bulk Synchronous Parallel) Engine. BSP is quite like MPI, just with 2 primitives (barrier sync and message send), I don't know if it is enough for your algorithm, but I would be very interested in implementing it with BSP and Apache Hama. I have already implemented a k-means clustering with BSP [1] which is much more faster than the MapReduce implementation [2]. I plan to contribute it over the next few months to Mahout, since I think BSP is a missing part of large scale machine learning (currently I just see MapReduce implementations everywhere), you would help to give Mahout another good example of BSP and machine learning. And it would of course help me to convince the Mahout team of the usage of Apache Hama ;) If you are interested, I'd be glad to hear from you. Best regards, Thomas [1] https://github.com/thomasjungblut/thomasjungblut-common/blob/master/src/de/= jungblut/clustering/KMeansBSP.java [2] http://wiki.apache.org/hama/Benchmarks (scroll down a bit) Am 1. M=E4rz 2012 06:31 schrieb Aditya Sarawgi : > Hello, > > I am looking to implement psvm for Mahout as a part of of my coursework. > The reference paper is > http://books.nips.cc/papers/files/nips20/NIPS2007_0435.pdf > and there is a implementation over http://code.google.com/p/psvm/ which > uses MPI. > Any ideas, pointers are much appreciated. > > Thanks > Aditya Sarawgi > --bcaec5016075604b5a04ba28e285--