mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Need a little help with using SVD
Date Fri, 18 Mar 2011 16:50:31 GMT
We have the encoders and the resulting vectors should cluster as easily as
anything.

What we don't have is a clean command line integration from text => hashed
vector => clusters

On Fri, Mar 18, 2011 at 9:44 AM, Grant Ingersoll <gsingers@apache.org>wrote:

> > Another option is to use hashed feature vectors.  These will retain
> > essentially all of the data of the larger vectors but will allow your
> > centroids to be more moderate in size.  This also helps in not requiring
> a
> > pass over your data to assign vector locations.
>
> Do we have code for using this with our existing algorithms?

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message