mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dawid Weiss (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-1225) OpenIntHashSet hangs on forever on the fourth add.
Date Tue, 21 May 2013 21:08:20 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13663401#comment-13663401
] 

Dawid Weiss commented on MAHOUT-1225:
-------------------------------------

I wouldn't be impolite to people who are trying to help you, Sophie. Software bugs are omnipresent
and are not necessarily a result of one's negligence -- very often it's a process of multiple
rewrites by many people. In this particular case collections have been ported over from Colt
and multiple changes layered.

Judging from the test code you submitted on this issue I am pretty confident that the choice
of collections library is secondary -- your problem is in algorithmic solutions to whatever
it is you're trying to achieve.
                
> OpenIntHashSet hangs on forever on the fourth add.
> --------------------------------------------------
>
>                 Key: MAHOUT-1225
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1225
>             Project: Mahout
>          Issue Type: Bug
>          Components: Math
>    Affects Versions: 0.7
>         Environment: Eclipse, linux Fedora 17, Java 1.7, Mahout Maths collections (Set)
0.7, hppc 0.4.3
>            Reporter: Sophie Sperner
>              Labels: hashset, java, mahout, test
>             Fix For: 0.7
>
>         Attachments: hppc-0.4.3.jar, mushroom.dat, test.java
>
>   Original Estimate: 48h
>  Remaining Estimate: 48h
>
> The code I attached hangs on forever, Eclipse does not print me its stack trace because
it does not terminate the program. So I decided to make a small test.java file that you can
easily run.
> This code has the main function that simply runs getItemList() method which successfully
executes getDataset() method (here please download mushroom.dat dataset and set the full path
into filePath string variable) and the hangs on (the problem happens on a fourth columnValues.add()
call). After the dataset was taken into X array, the code simply goes through X column by
column and searches for different items in it.
> If you uncomment IntSet columnValues = new IntOpenHashSet(); and corresponding import
headers then everything will work just fine (you will also need to include hppc jar file found
here http://labs.carrotsearch.com/hppc.html or below in the attachment).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message