Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id DA2B4200D3D for ; Mon, 13 Nov 2017 18:53:27 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id D8A01160BF3; Mon, 13 Nov 2017 17:53:27 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id AAF63160BF0 for ; Mon, 13 Nov 2017 18:53:26 +0100 (CET) Received: (qmail 92637 invoked by uid 500); 13 Nov 2017 17:53:25 -0000 Mailing-List: contact user-help@predictionio.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.apache.org Delivered-To: mailing list user@predictionio.apache.org Received: (qmail 92626 invoked by uid 99); 13 Nov 2017 17:53:25 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Nov 2017 17:53:25 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 3B5AF1807FA for ; Mon, 13 Nov 2017 17:53:24 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.789 X-Spam-Level: X-Spam-Status: No, score=-0.789 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, KAM_SHORT=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, T_REMOTE_IMAGE=0.01] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=occamsmachete-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id SHDTE0wq-JiP for ; Mon, 13 Nov 2017 17:53:22 +0000 (UTC) Received: from mail-pf0-f195.google.com (mail-pf0-f195.google.com [209.85.192.195]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id C03F65F2C3 for ; Mon, 13 Nov 2017 17:53:21 +0000 (UTC) Received: by mail-pf0-f195.google.com with SMTP id d28so12346277pfe.2 for ; Mon, 13 Nov 2017 09:53:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=occamsmachete-com.20150623.gappssmtp.com; s=20150623; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=oe2X/OrlhMmxggTlSanNG2w+xZ794pIynGxMmu2+oq4=; b=Ux2vkDWksdXaILgc22FH//kl3dFrmfLYlLQq+a+dd7/7ZhGU+k5rsyY9Xy+pUdtPOU czgF2VPBOaHMjDBWrqZkalljTYNumRQCqKAJmGqzlbpW8lniFX15dXtJieguianAxNRX 0KJnmPmoHa74OtifK18ulDyc28HXauFkhbD0BJ+AyHnN3XjLyMWzBO/tqcOHmJiTzVGQ 8/cxCQDNfwMH3qgRdF893Lv8qFy7yI+UrD7blCUfqqXGc9URgD6J2z96OX4Q2cKrUFxM kea1YG9HQfiiWUAWggmsnXNzkcPvz8RgJAVvcDEB+7SZS+IAXVOo7lqCi1dtfIECRkCF GCpQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=oe2X/OrlhMmxggTlSanNG2w+xZ794pIynGxMmu2+oq4=; b=r7RATkl3gOQ7tgakwIhVPnw9+FNzpemzfHwcVS2F6eCGLPwh4w16ZK5Ewe73UUDTiH Ij1Dtubqz8zmIzUz8LIj+vImppf6Z6+hpzIj4ChBtEHQm6CirgPIX4XAC5SHZwxwdg6D C73/V9ihlyW0H3T80DSGfrSIz3m/88kTrZwNLDg7dQrT9O+pTCvLnFfSvjsWDcV0r083 dEwSCNgb2SBGJt6ItfgirmxZxHNYnzHsfCYDQczPkVcj2a35IvEgoBcwZ75LfuUVmPow 1lhziyC9j+5pBJhcyYBBouyHPaUNf92GIuseQkio34aLMFcWoFHVR/GDzgyXaS47Hp5b NupA== X-Gm-Message-State: AJaThX7I3qz1fWBbrS8b5Njfh9IxxCu3Cs/eTScmflapo6I5pz9o4NJx GDC688RN+iajcVa3jetIpmUpD4txbTY= X-Google-Smtp-Source: AGs4zMYIB0xRnBq6HP3g57a8GmKyN5KfET3OwMf0bGRldMSLxG0kP0GMxqR5nHpazU3sopLI/kb0wA== X-Received: by 10.98.209.8 with SMTP id z8mr10619010pfg.184.1510595594863; Mon, 13 Nov 2017 09:53:14 -0800 (PST) Received: from [192.168.0.4] (c-24-18-213-211.hsd1.wa.comcast.net. [24.18.213.211]) by smtp.gmail.com with ESMTPSA id w20sm35050826pfi.89.2017.11.13.09.53.13 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 13 Nov 2017 09:53:14 -0800 (PST) From: Pat Ferrel Message-Id: <37C43283-3B9C-4EDC-9E26-09ED84B1A91F@occamsmachete.com> Content-Type: multipart/alternative; boundary="Apple-Mail=_140BFA8E-5D85-4932-AB86-B9196B32F5DF" Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Re: Which template for predicting ratings? Date: Mon, 13 Nov 2017 09:53:13 -0800 In-Reply-To: <1AEA4152-EA26-4A3F-BEFE-4A8046A35212@occamsmachete.com> Cc: user@predictionio.incubator.apache.org To: user@predictionio.apache.org References: <1AEA4152-EA26-4A3F-BEFE-4A8046A35212@occamsmachete.com> X-Mailer: Apple Mail (2.3273) archived-at: Mon, 13 Nov 2017 17:53:28 -0000 --Apple-Mail=_140BFA8E-5D85-4932-AB86-B9196B32F5DF Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 What I was saying is the UR can use ratings, but not predict them. Use = MLlib ALS recommenders if you want to predict them for all items. On Nov 13, 2017, at 9:32 AM, Pat Ferrel wrote: What we did in the article I attached is assume 1-2 is dislike, and 4-5 = is like. These are treated as indicators and will produce a score from the = recommender but these do not relate to 1-5 scores. If you need to predict what the user would score an item MLlib ALS = templates will do it. On Nov 13, 2017, at 2:42 AM, Noelia Os=C3=A9s Fern=C3=A1ndez = > wrote: Hi Pat, I truly appreciate your advice. However, what to do with a client that is adamant that they want to = display the predicted ratings in the form of 1 to 5-stars? That's my = case right now.=20 I will pose a more concrete question. Is there any template for which = the scores predicted by the algorithm are in the same range as the = ratings in the training set? Thank you very much for your help! Noelia On 10 November 2017 at 17:57, Pat Ferrel > wrote: Any of the Spark MLlib ALS recommenders in the PIO template gallery = support ratings. However I must warn that ratings are not very good for recommendations = and none of the big players use ratings anymore, Netflix doesn=E2=80=99t = even display them. The reason is that your 2 may be my 3 or 4 and that = people rate different categories differently. For instance Netflix found = Comedies were rated lower than Independent films. There have been many = solutions proposed and tried but none have proven very helpful. There is another more fundamental problem, why would you want to = recommend the highest rated item? What do you buy on Amazon or watch on = Netflix? Are they only your highest rated items. Research has shown that = they are not. There was a whole misguided movement around ratings that = affected academic papers and cross-validation metrics that has fairly = well been discredited. It all came from the Netflix prize that used = both. Netflix has since led the way in dropping ratings as they saw the = things I have mentioned. What do you do? Categorical indicators work best (like, dislike)or = implicit indicators (buy) that are unambiguous. If a person buys = something, they like it, if the rate it 3 do they like it? I buy many 3 = rated items on Amazon if I need them.=20 My advice is drop ratings and use thumbs up or down. These are = unambiguous and the thumbs down can be used in some cases to predict = thumbs up: = https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occure= nces/ = This uses data from a public web site to show significant lift = by using =E2=80=9Clike=E2=80=9D and =E2=80=9Cdislike=E2=80=9D in = recommendations. This used the Universal Recommender. On Nov 10, 2017, at 5:02 AM, Noelia Os=C3=A9s Fern=C3=A1ndez = > wrote: Hi all, I'm new to PredictionIO so I apologise if this question is silly. I have an application in which users are rating different items in a = scale of 1 to 5 stars. I want to recommend items to a new user and give = her the predicted rating in number of stars. Which template should I use = to do this? Note that I need the predicted rating to be in the same = range of 1 to 5 stars. Is it possible to do this with the ecommerce recommendation engine? Thank you very much for your help! Noelia --=20 Noelia Os=C3=A9s Fern=C3=A1ndez, PhD Senior Researcher | Investigadora Senior noses@vicomtech.org +[34] 943 30 92 30 Data Intelligence for Energy and Industrial Processes | Inteligencia de Datos para Energ=C3=ADa y Procesos Industriales = = member of: Legal Notice - Privacy policy = --Apple-Mail=_140BFA8E-5D85-4932-AB86-B9196B32F5DF Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8 What I was saying is the UR can use ratings, but not predict = them. Use MLlib ALS recommenders if you want to predict them for all = items.


On Nov 13, 2017, at 9:32 AM, Pat Ferrel = <pat@occamsmachete.com> wrote:

What we did in = the article I attached is assume 1-2 is dislike, and 4-5 is like.

These are treated as = indicators and will produce a score from the recommender but these do = not relate to 1-5 scores.

If you need to predict what the user would score an item = MLlib ALS templates will do it.



On Nov 13, 2017, at 2:42 AM, = Noelia Os=C3=A9s Fern=C3=A1ndez <noses@vicomtech.org> wrote:

Hi Pat,

I = truly appreciate your advice.

However, = what to do with a client that is adamant that they want to display the = predicted ratings in the form of 1 to 5-stars? That's my case right now. =

I will pose a more concrete = question. Is there any template for which the scores = predicted by the algorithm are in the same range as the ratings in the = training set?

Thank you = very much for your help!
Noelia

On 10 November 2017 at 17:57, Pat Ferrel <pat@occamsmachete.com> = wrote:
Any of the Spark MLlib ALS = recommenders in the PIO template gallery support ratings.

However I must warn that = ratings are not very good for recommendations and none of the big = players use ratings anymore, Netflix doesn=E2=80=99t even display them. = The reason is that your 2 may be my 3 or 4 and that people rate = different categories differently. For instance Netflix found Comedies = were rated lower than Independent films. There have been many solutions = proposed and tried but none have proven very helpful.

There is another more = fundamental problem, why would you want to recommend the highest rated = item? What do you buy on Amazon or watch on Netflix? Are they only your = highest rated items. Research has shown that they are not. There was a = whole misguided movement around ratings that affected academic papers = and cross-validation metrics that has fairly well been discredited. It = all came from the Netflix prize that used both. Netflix has since led = the way in dropping ratings as they saw the things I have = mentioned.

What = do you do? Categorical indicators work best (like, dislike)or implicit = indicators (buy) that are unambiguous. If a person buys something, they = like it, if the rate it 3 do they like it? I buy many 3 rated items on = Amazon if I need them. 

My advice is drop ratings and use = thumbs up or down. These are unambiguous and the thumbs down can be used = in some cases to predict thumbs up: https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occurences/ This = uses data from a public web site to show significant lift by using = =E2=80=9Clike=E2=80=9D and =E2=80=9Cdislike=E2=80=9D in recommendations. = This used the Universal Recommender.


On Nov 10, 2017, at 5:02 AM, = Noelia Os=C3=A9s Fern=C3=A1ndez <noses@vicomtech.org> = wrote:


Hi = all,

I'm new to PredictionIO so I = apologise if this question is silly.

I = have an application in which users are rating different items in a scale = of 1 to 5 stars. I want to recommend items to a new user and give her = the predicted rating in number of stars. Which template should I use to = do this? Note that I need the predicted rating to be in the same range = of 1 to 5 stars.

Is it possible to do this with the ecommerce recommendation = engine?

Thank = you very much for your help!
Noelia









--

Noelia = Os=C3=A9s Fern=C3=A1ndez, PhD
Senior Researcher |
Investigadora = Senior


noses@vicomtech.org
+[34] 943 30 92 30
Data Intelligence for Energy and
Industrial Processes | Inteligencia
de Datos = para Energ=C3=ADa y Procesos
Industriales


  

member of:     

Legal Notice - Privacy = policy


= --Apple-Mail=_140BFA8E-5D85-4932-AB86-B9196B32F5DF--