mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Charly Lizarralde <charly.lizarra...@gmail.com>
Subject Re: lda + vector dump
Date Fri, 23 Aug 2013 15:36:21 GMT
Thanks! Docs are in spanish, so, maybe I should provide the spanish list
then...

The vector dump comand is: mahout vectordump --seqFile
tn/topics/part-m-00000 --dictionary tn/vectors/dictionary.file-0
--dictionaryType sequencefile --output topicdump.txt -sort --vectorSize 10

And the output ( topicdump.txt)  is:

{yo:0.05347025391826375,pero:0.01621850129739,hay:0.01256010577070346,como:0.015645488997146385,apellido:0.0138762425391
95612,quiero:0.02141736852909945,mis:0.011207260571060144,mi:0.10256988732241464,me:0.13765803016629644,zi:0.03454083289
2116805}

As you can see, the topicterms are not sorted.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message