mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jake Mannix <jake.man...@gmail.com>
Subject Re: Determining Document Cluster Probabilities with LDA
Date Tue, 26 Apr 2011 21:32:53 GMT
On Tue, Apr 26, 2011 at 2:08 PM, Ted Dunning <ted.dunning@gmail.com> wrote:
>
> - LDA isn't really clustering.  It is more along the lines of SVD as a
> dimensionality reduction.  It should
> be possible to display the internals to find which terms or documents have
> the highest components on
> a single topic, but combinations of topics are still interesting in LDA
> just
> as combinations of coordinates
> in SVD are interesting.
>

Ted, I think what they are asking is for the output of the gamma matrix
(i.e.
the LDA version of the *left* singular vectors, living in
document-by-topic-space,
not topic-by-word space), which is currently not produced (not even on
trunk, iirc).

  -jake

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message