mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bhaskar Devireddy (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAHOUT-1001) Performance improvement in recommenditembased
Date Wed, 25 Apr 2012 15:36:16 GMT

     [ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Bhaskar Devireddy updated MAHOUT-1001:
--------------------------------------

    Attachment: RowSimilarityJob.patch
    
> Performance improvement in recommenditembased
> ---------------------------------------------
>
>                 Key: MAHOUT-1001
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1001
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.6
>            Reporter: Bhaskar Devireddy
>            Assignee: Sean Owen
>             Fix For: 0.7
>
>         Attachments: RowSimilarityJob.patch
>
>
> While running the recommendations with ASFEMail dataset using the example script provided
with mahout, we noticed that execution time for unsymmetrify mapper is very long.  While profiling
the task we noticed a hotspot consuming high CPU cycle.  Please find the attached patch addressing
issue and optimizes the unsymmetrify mapper class.  This patch while retaining functionality(verified
the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86
architectures.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message