Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 19975 invoked from network); 19 Nov 2010 22:04:01 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 19 Nov 2010 22:04:01 -0000 Received: (qmail 4961 invoked by uid 500); 19 Nov 2010 22:04:30 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 4788 invoked by uid 500); 19 Nov 2010 22:04:30 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 4780 invoked by uid 99); 19 Nov 2010 22:04:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Nov 2010 22:04:30 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of srowen@gmail.com designates 74.125.83.42 as permitted sender) Received: from [74.125.83.42] (HELO mail-gw0-f42.google.com) (74.125.83.42) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Nov 2010 22:04:22 +0000 Received: by gwb10 with SMTP id 10so3007796gwb.1 for ; Fri, 19 Nov 2010 14:04:01 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=6LV1MZF7ZovsRFEBXiIC45UJNzan9Kl58WHmE5G18Lg=; b=KxaWNNzoaZb1h6JJiZ74orRMwJKHtYtazOQAhaMwkIahDys7IYbBNLDPpoQnez+vwZ gkYbAtQMl4waXYSTnCaFhzV/QCK2wyZ5q1xP0bOeN+F67ZsrcEeG7jPIfcISSHn6l1GQ TkoRL5R9h5MwsIzGKHaOOW+5sxX8pdlO24p24= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=aM0smNQLnZeRWrYhIEPKoZwiY2MBpQlZKA85RXkakZVmuITDkzr8WEBg/VQaXXGHgO 7xA7nAFnaPZUd44EaSXRDqHFKMCJmg7dCv38BE6EYtFUeIpNNgs5zycMHTXwZiTIZHL1 UjdhLJ2AnlVvDHlh0PShfziPqudfsxgsO1Utc= MIME-Version: 1.0 Received: by 10.231.34.2 with SMTP id j2mr2679263ibd.86.1290204241146; Fri, 19 Nov 2010 14:04:01 -0800 (PST) Received: by 10.231.158.200 with HTTP; Fri, 19 Nov 2010 14:04:01 -0800 (PST) In-Reply-To: References: Date: Fri, 19 Nov 2010 22:04:01 +0000 Message-ID: Subject: Re: Need for a distributed SVDRecommender From: Sean Owen To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=00032557a0d27b34f704956f1522 X-Virus-Checked: Checked by ClamAV on apache.org --00032557a0d27b34f704956f1522 Content-Type: text/plain; charset=UTF-8 That result sounds confusing. It should take about the same number of wall-clock hours either way. I don't see why it would take 14 hours -- that sounds wrong. If anything it should take 38 / N minutes where N is the number of recommenders you ran. SVDRecommender is not distributed at all, no. On Fri, Nov 19, 2010 at 9:34 PM, Sanjib Kumar Das wrote: > Hi All, > > I wanted to run a distributed RecommenderJob with the SVDRecommender > implementation. > So i ran the pseudo.RecommenderJob with an > SVDRecommender(numFeatures=30,trainingSteps=50) on the 1M Movielens > data(6040 users). So this generated 10 recommendations for each of the 6040 > users but took 14 hours to do so! My hadoop cluster had 12 m/cs. So i guess > it just ran multiple instances of the non-distributed SVD implementation > and > each of these instances did the same thing again and again. So unless the > implementation of the recommender is distributed, we dont get any special > benefit with the pseudo.RecommenderJob. > > But the item.RecommenderJob does the same 10 recommendations each for the > 6040 users in 38 minutes. This is because it has an underlying distributed > implementation. > > So my doubt is do we have a distributed SVDRecommender implementation? If > not, how should i go about writing one? Can I use the new LanczosSolver to > achieve this? > > Thanks, > Sanjib > --00032557a0d27b34f704956f1522--