Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5511217F11 for ; Fri, 3 Apr 2015 17:28:37 +0000 (UTC) Received: (qmail 9874 invoked by uid 500); 3 Apr 2015 17:28:35 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 9812 invoked by uid 500); 3 Apr 2015 17:28:35 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 9796 invoked by uid 99); 3 Apr 2015 17:28:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Apr 2015 17:28:35 +0000 X-ASF-Spam-Status: No, hits=0.2 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of pielle87@yahoo.com designates 98.138.120.51 as permitted sender) Received: from [98.138.120.51] (HELO nm44.bullet.mail.ne1.yahoo.com) (98.138.120.51) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Apr 2015 17:28:30 +0000 Received: from [127.0.0.1] by nm44.bullet.mail.ne1.yahoo.com with NNFMP; 03 Apr 2015 17:28:09 -0000 Received: from [98.138.100.113] by nm44.bullet.mail.ne1.yahoo.com with NNFMP; 03 Apr 2015 17:25:19 -0000 Received: from [98.139.170.181] by tm104.bullet.mail.ne1.yahoo.com with NNFMP; 03 Apr 2015 17:25:19 -0000 Received: from [98.139.212.214] by tm24.bullet.mail.bf1.yahoo.com with NNFMP; 03 Apr 2015 17:25:19 -0000 Received: from [127.0.0.1] by omp1023.mail.bf1.yahoo.com with NNFMP; 03 Apr 2015 17:25:19 -0000 X-Yahoo-Newman-Property: ymail-4 X-Yahoo-Newman-Id: 296728.27612.bm@omp1023.mail.bf1.yahoo.com Received: (qmail 528 invoked by uid 60001); 3 Apr 2015 17:25:19 -0000 X-YMail-OSG: MhLElM8VM1mjhAWQSR1xoNfMM6Og86eGQz.qr8X53gv0L2n 5yQzQ5MuXknOoH.NPsE4gS2moPzti9ShF_0pY429uwZeIUUW24L4bS6XtxVj SiinHaDMNWez.WPQEuFsbevfr11AT0052OWUwNfEzv4bAka_E6Nj07szZPIg nY8zJSEdpgDKLB1OBhZ18pGik6dkmFtcVlpTQm7N3qHDlfgqIjJeRRFdkiFh Iy3H83siRp8niaP9LV4PVFGegFOshsKFWVZ6QrppJI5SgM2e1G7wqlO32Hu1 9nGlO4CDqC7FFDfXS0vdfw6DLCtJl.6oH3sUcfhf5FB1fIKRClLMg10tBJfb 16ugJwXAFrJ7.kSEWb4sHIFParXAVezcJoRa9v5Qh1yLbOW9QYOAasFI7c7p N5AfBoZnWapsOtDRoMHsceTLzX8VxEgoN9jKWjw1VREhNmoqWnztM5IIbtgq UCh4uLRiF4U_l7E.gm6JgycvgY_j8VFhpUHAUF5XA9vXGxw2qAnr1uXuxTzb 7KlOVPf4MFnoU6Cf0DWg6lyrowGlngRfSKoSFqbK82iSKckLmA7u4XjpGtTi Uyns8bM9Sug60QGHsxMl_HcPGnokzTlBqNDvsNrY2WxVskOBq76yBWhsy78m r_1zEITmLMX6CiJNoSVYFYdcZKaBzROevawt1MmZAuuY8HxnWCsCwryWlkeT 8o38szmMbcSB0NDRHHzJCNIWviE.pm7BzDqKvWOBD_q3dKeXnNfs47P4tiSp rFSg2T3s.1tGJ15dE0_oM0bmdrcF1_Et2k5hIyZqy6yzsJBXZlKzD1PAc4U6 1sgMdHLIZ8wWIZba12XB2ZGRy7wiem5AqIIlnheOipHgXWKhE5UmNh2eCA9M OTz1uWZ9jp9ziT00v.7YeAoaZzbenM.bEtdZ0vMyNBuPTzrkdqmtLuUrts3N vikH8PKksDVXKNIHMkLnCOZeBOktoaxxfNJEN_xzV7Rm_iyKVCcC3e6fwAtB Hhv4q1QOdPGsSm3W7kEYE08LgL_SKJsyXJUpP6d9c8Jw3hQ-- Received: from [91.119.52.40] by web163001.mail.bf1.yahoo.com via HTTP; Fri, 03 Apr 2015 10:25:19 PDT X-Rocket-MIMEInfo: 002.001,SGksDQp0aGFuayB5b3UgZm9yIHlvdXIgYW5zd2Vycy4gSSB1bmRlcnN0YW5kIG5vdyB3aHkgdGhlcmUgaXMgbm8gYnVpbHQtaW4gImFkanVzdGVkIGNvc2luZSBzaW1pbGFyaXR5IiA6KQ0KVW5mb3J0dW5hdGVseSB0aG91Z2gsIG15IHF1ZXN0aW9uIHN0aWxsIHN0YW5kcyAtIGlmIEkgaGFkIHRvIGltcGxlbWVudCBhbiAiYWRqdXN0ZWQgY29zaW5lIHNpbWlsYXJpdHkiLCBob3cgd291bGQgSSBkbyBpdD8NCkZvciBzYWtlIG9mIHNpbXBsaWNpdHkgSSdsbCBtb3ZlIHRoZSBkaXNjdXNzaW9uIHRvIG15IG90aGUBMAEBAQE- X-Mailer: YahooMailClassic/452 YahooMailWebService/0.8.203.740 Message-ID: <1428081919.61136.YahooMailBasic@web163001.mail.bf1.yahoo.com> Date: Fri, 3 Apr 2015 10:25:19 -0700 From: PierLorenzo Bianchini Subject: Re: adjusted cosine similarity for item-based recommender? To: user@mahout.apache.org In-Reply-To: <94B0BCD7-570F-430A-B8E5-7B4510CFE8BA@occamsmachete.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Hi, thank you for your answers. I understand now why there is no built-in "adju= sted cosine similarity" :) Unfortunately though, my question still stands - if I had to implement an "= adjusted cosine similarity", how would I do it? For sake of simplicity I'll move the discussion to my other question "fast = performance way of writing preferences to file?" and I'll explain there why= I have to do it (or at least I *think* I have to do it). Looking forward to hearing from you in my other discussion, and thank you a= gain (I'll reply on the other thread in a few minutes). Pier Lorenzo -------------------------------------------- On Fri, 4/3/15, Pat Ferrel wrote: Subject: Re: adjusted cosine similarity for item-based recommender? To: "user@mahout.apache.org" Date: Friday, April 3, 2015, 6:05 PM =20 I=E2=80=99d add that overview and references are here: http://mahout.apache.org/users/algorithms/recomme= nder-overview.html =20 There are many benefits to this architecture including being able to make recs to anonymous users (as long as you have a little history for them) no need to retrain the recommender. The server is a fast scalable search engine and so the multimodal part is accessed by changing the query, which also means you can use context in realtime. Like what category you want recs to favor or be filtered by.=20 =20 In a cooccurrence recommender ratings are ignored. You want to gather history of some user action that is as close to the intended action you want to recommend. For ecom that=E2=80=99s probably purchase, for movies it might be a full movie view. The quality of this action is important. =20 =20 For an example movie recommender check the demo app, which uses most of the techniques mentioned in the references; https://guide.finderbots.com=20 =20 =20 On Apr 3, 2015, at 8:48 AM, Ted Dunning wrote: =20 For practical recommendation systems, ratings are almost irrelevant. Ratings were prominent in the original academic work on recommendations largely because with the early research systems, users had no recordable interactions with content other than ratings.=C2=A0 The Taste component of Mahout was written largely by following that literature. =20 In fact, in a real world system, this turns out to be very wrong.=C2=A0 Ratings are a very poor source of recommendation information in most real applications for two reasons: =20 1) interpretation is hard (your issues with bias are just the beginning) =20 2) volume is very low because most (often >>99%) users don't rate =20 If you are building a production recommender you should be looking at indicator-based techniques and moving to multi-modal recommendations. =20 The code in question was deprecated precisely because it has little practical impact. =20 =20 =20 On Fri, Apr 3, 2015 at 3:36 AM, PierLorenzo Bianchini < pielle87@yahoo.com.invalid> wrote: =20 > hello everyone, > I'm trying to build item-based recommender that would take the users' > rating-bias into account. According to my class' slides, that should > involve "adjusted cosine similarity". I couldn't find such implementation > in Mahout... did I search it in the wrong place or should I implement it? > If so, how? Any tips would be welcome since I'm new to Mahout... >=20 > I found out that in mahout-0.8 there was a (deprecated) class that seemed > to do what I'm looking for ( > http://archive.cloudera.com/cdh5/cdh/5/mahout-0.8-cdh5.0.0/mahout-core/o= rg/apache/mahout/cf/taste/impl/recommender/BiasedItemBasedRecommender.html)= . > Does anybody know why this was removed? >=20 > Note: I also posted the question on stackoferlow ( > http://stackoverflow.com/questions/29419222/mahout-adjusted-cosine-simil= arity-for-item-based-recommender > ) > Thank you in advance! Regards, >=20 > Pier Lorenzo >=20