Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D2ED59B87 for ; Wed, 25 Apr 2012 15:36:40 +0000 (UTC) Received: (qmail 31696 invoked by uid 500); 25 Apr 2012 15:36:40 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 31628 invoked by uid 500); 25 Apr 2012 15:36:40 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 31616 invoked by uid 99); 25 Apr 2012 15:36:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Apr 2012 15:36:40 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Apr 2012 15:36:37 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id EB43B35F390 for ; Wed, 25 Apr 2012 15:36:16 +0000 (UTC) Date: Wed, 25 Apr 2012 15:36:16 +0000 (UTC) From: "Bhaskar Devireddy (JIRA)" To: dev@mahout.apache.org Message-ID: <972633373.672.1335368176965.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <753482179.661.1335367945050.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (MAHOUT-1001) Performance improvement in recommenditembased MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bhaskar Devireddy updated MAHOUT-1001: -------------------------------------- Attachment: RowSimilarityJob.patch > Performance improvement in recommenditembased > --------------------------------------------- > > Key: MAHOUT-1001 > URL: https://issues.apache.org/jira/browse/MAHOUT-1001 > Project: Mahout > Issue Type: Improvement > Components: Collaborative Filtering > Affects Versions: 0.6 > Reporter: Bhaskar Devireddy > Assignee: Sean Owen > Fix For: 0.7 > > Attachments: RowSimilarityJob.patch > > > While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycle. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira