mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Angel Martinez Gonzalez (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAHOUT-1047) CVB hangs after completion
Date Wed, 24 Apr 2013 17:57:16 GMT

     [ https://issues.apache.org/jira/browse/MAHOUT-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Angel Martinez Gonzalez updated MAHOUT-1047:
--------------------------------------------

    Attachment: MAHOUT-1047.patch

Hi,
I was trying cvb with the Reuters collection in local mode and it was hanging everytime. I
looked into the problem for a while and then I found this bug report.
The problem was with TopicModel's pool, because it never got shutdown. This patch passes tests
and Reuters clustering is not hanging anymore in my machine. Take it with some caution though,
because I do not have a complete understanding of the cvb implementation yet.
Also, I did not mean to change the issue status, I was just trying to upload the patch...
                
> CVB hangs after completion
> --------------------------
>
>                 Key: MAHOUT-1047
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1047
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.7
>         Environment: Ubuntu
>            Reporter: seth boyles
>            Priority: Minor
>              Labels: cvb, lda
>             Fix For: 0.7, 0.8
>
>         Attachments: MAHOUT-1047.patch, MAHOUT-1047-Show-Leak.patch
>
>
> After running the new LDA CVB implementation, it hangs and does not terminate the process
like every other time I run Mahout
> Terminal output:
> 12/07/19 11:38:49 INFO mapred.LocalJobRunner: 
> 12/07/19 11:38:49 INFO mapred.Task: Task 'attempt_local_0022_m_000000_0' done.
> 12/07/19 11:38:49 INFO mapred.JobClient:  map 100% reduce 0%
> 12/07/19 11:38:49 INFO mapred.JobClient: Job complete: job_local_0022
> 12/07/19 11:38:49 INFO mapred.JobClient: Counters: 8
> 12/07/19 11:38:49 INFO mapred.JobClient:   File Output Format Counters 
> 12/07/19 11:38:49 INFO mapred.JobClient:     Bytes Written=2247793
> 12/07/19 11:38:49 INFO mapred.JobClient:   File Input Format Counters 
> 12/07/19 11:38:49 INFO mapred.JobClient:     Bytes Read=1920337
> 12/07/19 11:38:49 INFO mapred.JobClient:   FileSystemCounters
> 12/07/19 11:38:49 INFO mapred.JobClient:     FILE_BYTES_READ=1342812616
> 12/07/19 11:38:49 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=1326092302
> 12/07/19 11:38:49 INFO mapred.JobClient:   Map-Reduce Framework
> 12/07/19 11:38:49 INFO mapred.JobClient:     Map input records=2772
> 12/07/19 11:38:49 INFO mapred.JobClient:     Spilled Records=0
> 12/07/19 11:38:49 INFO mapred.JobClient:     SPLIT_RAW_BYTES=140
> 12/07/19 11:38:49 INFO mapred.JobClient:     Map output records=2772
> 12/07/19 11:38:49 INFO driver.MahoutDriver: Program took 4089950 ms (Minutes: 68.16583333333334)
> $MAHOUT_HOME/mahout cvb -i /home/seth/Scripted/mahout_data/vectors/vectors/vectors-for-cvb/
-o /home/seth/Scripted/mahout_data/clusters/ -ow -k 90 -dt /home/seth/Scripted/mahout_data/distributions
-dict /home/seth/Scripted/mahout_data/vectors/vectors/dictionary.file-0 -mt /home/seth/Scripted/mahout_data/temp/
-x 20 -cd 0.05

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message