mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dawid Weiss (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAHOUT-1225) Sets and maps incorrectly clear() their state arrays (potential endless loops)
Date Thu, 23 May 2013 08:10:20 GMT

     [ https://issues.apache.org/jira/browse/MAHOUT-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Dawid Weiss updated MAHOUT-1225:
--------------------------------

    Attachment: MAHOUT-1225.patch

Initial patch that shows how much needs to be done in terms of fixing stuff :)

I fixed the problem with clear() on the template for OpenIntHashSet but there are several
other classes that show non-uniform behavior against Java Util Collections (and they're very
likely bugs).

I'll try to fix these during the day today, time permitting. Stay tuned.
                
> Sets and maps incorrectly clear() their state arrays (potential endless loops)
> ------------------------------------------------------------------------------
>
>                 Key: MAHOUT-1225
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1225
>             Project: Mahout
>          Issue Type: Bug
>          Components: Math
>    Affects Versions: 0.7
>         Environment: Eclipse, linux Fedora 17, Java 1.7, Mahout Maths collections (Set)
0.7, hppc 0.4.3
>            Reporter: Sophie Sperner
>            Assignee: Dawid Weiss
>              Labels: hashset, java, mahout, test
>             Fix For: 0.7
>
>         Attachments: hppc-0.4.3.jar, MAHOUT-1225.patch, mushroom.dat
>
>   Original Estimate: 48h
>  Remaining Estimate: 48h
>
> The code I attached hangs on forever, Eclipse does not print me its stack trace because
it does not terminate the program. So I decided to make a small test.java file that you can
easily run.
> This code has the main function that simply runs getItemList() method which successfully
executes getDataset() method (here please download mushroom.dat dataset and set the full path
into filePath string variable) and the hangs on (the problem happens on a fourth columnValues.add()
call). After the dataset was taken into X array, the code simply goes through X column by
column and searches for different items in it.
> If you uncomment IntSet columnValues = new IntOpenHashSet(); and corresponding import
headers then everything will work just fine (you will also need to include hppc jar file found
here http://labs.carrotsearch.com/hppc.html or below in the attachment).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message