mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Extracting the topics of documents (LDA, Mahout 0.7)
Date Fri, 07 Feb 2014 00:31:59 GMT
OK.  Cool.

That probably means that problem is much smaller and more likely to be
logistics.  Your suggestion of an off-by-one issue is quite plausible.


On Thu, Feb 6, 2014 at 4:46 PM, Stamatis Rapanakis
<stamrapanakis@gmail.com>wrote:

> That is correct. My problem is not the categories developed (which are
> meaningful by the way) but the fact that a certain document is not assigned
> to the proper (LDA generated) category. The document to topics assignment
> is really bad...
>
>
> On Thu, Feb 6, 2014 at 5:08 PM, Ted Dunning <ted.dunning@gmail.com> wrote:
>
> > I can't comment on the specific question that you ask, but it should not
> > necessarily be expected that LDA will reconstruct the categories that you
> > have in mind.  It will develop categories that explain the data as well
> as
> > it can, but that won't necessarily match the categories you intend.
> >
> > It is likely, however, that the topics that LDA derives would make a good
> > set of features for a classifier.
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message