mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: gsoc , EM or SVM?
Date Wed, 01 Apr 2009 20:09:16 GMT
Yifan,

EM is a highly non-specific term and covers a huge range of very different
algorithms.  For example, pLSI, HMM's, and mixture models can all be
estimated using EM.

What exactly did you mean to address with an EM implementation?

On Wed, Apr 1, 2009 at 1:05 PM, Grant Ingersoll <gsingers@apache.org> wrote:

> Hi Yifan,
>
> I think both are good candidates, although AIUI, SVM is a bit harder to
> parallelize, so maybe it would make sense to focus on EM.  Of course, we
> don't have to be distributed, so you could propose a non-distributed SVM
> implementation as a first cut and then work on the distributed part as the
> project develops.
>
> ...
>>
>>
>> For EM, it is a generalization of the k-means algorithm, and we already
>> have
>> k-means in the Mahout library.
>>
>>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message