Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 7099 invoked from network); 8 Apr 2011 10:16:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Apr 2011 10:16:20 -0000 Received: (qmail 49877 invoked by uid 500); 8 Apr 2011 10:16:19 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 49727 invoked by uid 500); 8 Apr 2011 10:16:18 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 49716 invoked by uid 99); 8 Apr 2011 10:16:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Apr 2011 10:16:18 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ssc.open@googlemail.com designates 209.85.214.42 as permitted sender) Received: from [209.85.214.42] (HELO mail-bw0-f42.google.com) (209.85.214.42) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Apr 2011 10:16:11 +0000 Received: by bwz18 with SMTP id 18so5520301bwz.1 for ; Fri, 08 Apr 2011 03:15:50 -0700 (PDT) Received: by 10.204.20.74 with SMTP id e10mr1842652bkb.148.1302257750447; Fri, 08 Apr 2011 03:15:50 -0700 (PDT) Received: from [10.58.117.64] ([62.53.164.158]) by mx.google.com with ESMTPS id z21sm1230573bku.16.2011.04.08.03.15.48 (version=SSLv3 cipher=OTHER); Fri, 08 Apr 2011 03:15:49 -0700 (PDT) Message-ID: <4D9EE053.3060203@apache.org> Date: Fri, 08 Apr 2011 12:15:47 +0200 From: Sebastian Schelter Reply-To: ssc@apache.org User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.14) Gecko/20110223 Thunderbird/3.1.8 MIME-Version: 1.0 To: user@mahout.apache.org Subject: Re: ItemSimilarityJob as UserSimilarityJob References: <4D9EDDB4.8090206@mufin.com> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Please don't use the similarity jobs from Mahout 0.4 as they have a serious bug, use the trunk. --sebastian On 08.04.2011 12:12, Sean Owen wrote: > Yep you can definitely do that, no problem. > > On Fri, Apr 8, 2011 at 11:04 AM, Thomas Rewig wrote: > >> Hello, >> >> I am testing at the moment a bit with mahout-jobs in hadoop. >> >> As I understand it: >> * the Recommender Job computes item-recommendations for all users >> * and the ItemSimilarityJob computes all item-item-similaritys >> >> I wonder if there is a job for the calculation of user-user-similaritys .. >> a UserSimilarityJob (Or have I write my own recommenderJob?). In my opinion >> the RowSimilartityJob is not for this purpose, or? >> >> So if i have this data: >> >> |userID1,itemID1,preferencevalue| >> |userID1,itemID2,preferencevalue >> ||userID2,itemID1,preferencevalue| >> |userID2,itemID3,preferencevalue >> ... >> >> and transform it to >> | >> |itemID1||,||userID1||,preferencevalue| >> |itemID1||||,||userID2||,preferencevalue >> ||itemID||2,||userID1||,preferencevalue| >> |itemID3||||,||userID2||,preferencevalue >> ... >> >> ||i can use the |ItemSimilarityJob to get all user-user-similaritys and >> this should be the same result I would expect in a UserSimilarityJob. But is >> there an easier way without a transformation of the data i use to get all >> user-user-similartitys? >> >> Or have I missed something? >> >> I use mahout 0.4. >> >> Thanks in advance! >> Thomas >> >> >> >> >