mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "S.Sudarshan" <sudarsha...@gmail.com>
Subject LDA only gives one topic
Date Fri, 29 Jun 2012 14:57:29 GMT
Hello,

I have been following the Mahout-In-Action book to learn mahout. Its a
great book. I am at the section where I am trying to run the LDA algorithm
to the reuters data. However, regardless of the number of ti mes I run it,
I only get one topic (Topic-0) when I run LDAPrintTopics on the state-20. I
ran the command as indicated:

mahout lda -i reuters-vectors/tf-vectors -o reuters-lda-sparse -k 10
-v 34262 -x 20 -ow

Topic 0
===========
billion [p(billion|topic_0) = 0.04580929884162013
pct [p(pct|topic_0) = 0.043323700764985575
dlrs [p(dlrs|topic_0) = 0.031395871939373196
3 [p(3|topic_0) = 0.027311386657272094
1987 [p(1987|topic_0) = 0.025690077982656934
1 [p(1|topic_0) = 0.022727304049111215
reuter [p(reuter|topic_0) = 0.019572283708227903
mln [p(mln|topic_0) = 0.014569551610736616
april [p(april|topic_0) = 0.014453636611524965
march [p(march|topic_0) = 0.014359948846622552

Could someone help me with this ?


Thanks.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message