Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id F1ADC200D3D for ; Mon, 13 Nov 2017 18:32:37 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EFE40160BF0; Mon, 13 Nov 2017 17:32:37 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E7784160BF3 for ; Mon, 13 Nov 2017 18:32:36 +0100 (CET) Received: (qmail 43699 invoked by uid 500); 13 Nov 2017 17:32:36 -0000 Mailing-List: contact user-help@predictionio.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.apache.org Delivered-To: mailing list user@predictionio.apache.org Received: (qmail 43689 invoked by uid 99); 13 Nov 2017 17:32:36 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Nov 2017 17:32:36 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 485E81A03A3 for ; Mon, 13 Nov 2017 17:32:35 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.789 X-Spam-Level: X-Spam-Status: No, score=-0.789 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, KAM_SHORT=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, T_REMOTE_IMAGE=0.01] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=occamsmachete-com.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id KCOAeKijZ-0e for ; Mon, 13 Nov 2017 17:32:33 +0000 (UTC) Received: from mail-pf0-f194.google.com (mail-pf0-f194.google.com [209.85.192.194]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 2404C60EF1 for ; Mon, 13 Nov 2017 17:32:32 +0000 (UTC) Received: by mail-pf0-f194.google.com with SMTP id m88so5042711pfi.9 for ; Mon, 13 Nov 2017 09:32:32 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=occamsmachete-com.20150623.gappssmtp.com; s=20150623; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=Q/o1PzukQv16RDsRXe3s9xFPK1trRxK7W0SuR/uSN7g=; b=gjRp/xWcHD7yjkyIt7vUlVkkfqzprP3cUJm6fgHxCDcPTh/C5BxUODNLcUX6Us0s0C HmFRbphPVZ+pyu0mlh0TBDJ4qnkGEEB/tAsCKZu9nxTTyFIEvspgYBRNYSqzCArtl+Ed Ml9cPOP7Kmj47hidyT+svM05vXrshWNlqIiv9YlzCqLCgzo0FXy7HLSvn9+2liaYE8Jf RPogqNgLklckBeoRvxb6NEUZbNF6roCwKgD8kA7n5lcUk0ykk8XL0LqMToWSD6A50Kbl vtM5MqyA/fBlUcEJLpO8L5t+oEnjSLrvWnMCemx1jG+4YJFYbWD2DkQcVTwIjEzzqD8T l80A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=Q/o1PzukQv16RDsRXe3s9xFPK1trRxK7W0SuR/uSN7g=; b=WkXzYCkhJXag7gpITMiRdlpCxLIYP10HFoqzHPO4Q87mclDwmgcGF5pmB5uDbYLt7x vupBkjtPtF2cyhoiNzAZT/Xl2HkIcjdPGqzeI2a7Fnh6diDACWacij/wa9VvrYVSIx/F rnEx057g1RV49Kt9GoNCAFq4pa161Qh3/EKMNBMTSNCm4AG3icGa7II4eNT35PjMuyh/ Rbw1J5rkCmjgSQc9EgM+pm5VW6hswqWp8uF8KVVVVYMxhD9Wwk8Q1j0h8QJR1NOGtIoE nqzFDkXefaM7Bpy660Z4JB6/KKENqyFVSDVbjz+Kgylx3aQ9U8Ta/7YhC1aFOpy4UzJV oVSQ== X-Gm-Message-State: AJaThX4xAnUqoK02VNf8gHSv2IkB+9kAJly2BKWIjke04Clpkxf86i6j nKB0k2DSqnYd88MO0j9DrFTJoA== X-Google-Smtp-Source: AGs4zMac8qIFrvzZ8dLkCm8p474W+M7PHTqGkckz5lQgbmVVzY83yVSuh1W9xVAkXXMu7bxcTM86rw== X-Received: by 10.101.77.140 with SMTP id p12mr9412947pgq.145.1510594350502; Mon, 13 Nov 2017 09:32:30 -0800 (PST) Received: from [192.168.0.4] (c-24-18-213-211.hsd1.wa.comcast.net. [24.18.213.211]) by smtp.gmail.com with ESMTPSA id z22sm1939846pfl.135.2017.11.13.09.32.29 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 13 Nov 2017 09:32:29 -0800 (PST) From: Pat Ferrel Message-Id: <1AEA4152-EA26-4A3F-BEFE-4A8046A35212@occamsmachete.com> Content-Type: multipart/alternative; boundary="Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85" Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Re: Which template for predicting ratings? Date: Mon, 13 Nov 2017 09:32:29 -0800 In-Reply-To: Cc: user@predictionio.incubator.apache.org To: user@predictionio.apache.org References: X-Mailer: Apple Mail (2.3273) archived-at: Mon, 13 Nov 2017 17:32:38 -0000 --Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 What we did in the article I attached is assume 1-2 is dislike, and 4-5 = is like. These are treated as indicators and will produce a score from the = recommender but these do not relate to 1-5 scores. If you need to predict what the user would score an item MLlib ALS = templates will do it. On Nov 13, 2017, at 2:42 AM, Noelia Os=C3=A9s Fern=C3=A1ndez = wrote: Hi Pat, I truly appreciate your advice. However, what to do with a client that is adamant that they want to = display the predicted ratings in the form of 1 to 5-stars? That's my = case right now.=20 I will pose a more concrete question. Is there any template for which = the scores predicted by the algorithm are in the same range as the = ratings in the training set? Thank you very much for your help! Noelia On 10 November 2017 at 17:57, Pat Ferrel > wrote: Any of the Spark MLlib ALS recommenders in the PIO template gallery = support ratings. However I must warn that ratings are not very good for recommendations = and none of the big players use ratings anymore, Netflix doesn=E2=80=99t = even display them. The reason is that your 2 may be my 3 or 4 and that = people rate different categories differently. For instance Netflix found = Comedies were rated lower than Independent films. There have been many = solutions proposed and tried but none have proven very helpful. There is another more fundamental problem, why would you want to = recommend the highest rated item? What do you buy on Amazon or watch on = Netflix? Are they only your highest rated items. Research has shown that = they are not. There was a whole misguided movement around ratings that = affected academic papers and cross-validation metrics that has fairly = well been discredited. It all came from the Netflix prize that used = both. Netflix has since led the way in dropping ratings as they saw the = things I have mentioned. What do you do? Categorical indicators work best (like, dislike)or = implicit indicators (buy) that are unambiguous. If a person buys = something, they like it, if the rate it 3 do they like it? I buy many 3 = rated items on Amazon if I need them.=20 My advice is drop ratings and use thumbs up or down. These are = unambiguous and the thumbs down can be used in some cases to predict = thumbs up: = https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occure= nces/ = This uses data from a public web site to show significant lift = by using =E2=80=9Clike=E2=80=9D and =E2=80=9Cdislike=E2=80=9D in = recommendations. This used the Universal Recommender. On Nov 10, 2017, at 5:02 AM, Noelia Os=C3=A9s Fern=C3=A1ndez = > wrote: Hi all, I'm new to PredictionIO so I apologise if this question is silly. I have an application in which users are rating different items in a = scale of 1 to 5 stars. I want to recommend items to a new user and give = her the predicted rating in number of stars. Which template should I use = to do this? Note that I need the predicted rating to be in the same = range of 1 to 5 stars. Is it possible to do this with the ecommerce recommendation engine? Thank you very much for your help! Noelia --=20 Noelia Os=C3=A9s Fern=C3=A1ndez, PhD Senior Researcher | Investigadora Senior noses@vicomtech.org +[34] 943 30 92 30 Data Intelligence for Energy and Industrial Processes | Inteligencia de Datos para Energ=C3=ADa y Procesos Industriales = = member of: Legal Notice - Privacy policy = --Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8 What we did in the article I attached is assume 1-2 is = dislike, and 4-5 is like.

These are treated as indicators and will produce a score from = the recommender but these do not relate to 1-5 scores.

If you need to predict = what the user would score an item MLlib ALS templates will do = it.



On = Nov 13, 2017, at 2:42 AM, Noelia Os=C3=A9s Fern=C3=A1ndez <noses@vicomtech.org>= wrote:

Hi Pat,

I truly appreciate your advice.

However, what to do with a client that is adamant that = they want to display the predicted ratings in the form of 1 to 5-stars? = That's my case right now.

I will = pose a more concrete question. Is there any template for = which the scores predicted by the algorithm are in the same range as the = ratings in the training set?

Thank you very much for your help!
Noelia
On 10 November 2017 at 17:57, Pat = Ferrel <pat@occamsmachete.com> wrote:
Any of the Spark MLlib ALS = recommenders in the PIO template gallery support ratings.

However I must warn that = ratings are not very good for recommendations and none of the big = players use ratings anymore, Netflix doesn=E2=80=99t even display them. = The reason is that your 2 may be my 3 or 4 and that people rate = different categories differently. For instance Netflix found Comedies = were rated lower than Independent films. There have been many solutions = proposed and tried but none have proven very helpful.

There is another more = fundamental problem, why would you want to recommend the highest rated = item? What do you buy on Amazon or watch on Netflix? Are they only your = highest rated items. Research has shown that they are not. There was a = whole misguided movement around ratings that affected academic papers = and cross-validation metrics that has fairly well been discredited. It = all came from the Netflix prize that used both. Netflix has since led = the way in dropping ratings as they saw the things I have = mentioned.

What = do you do? Categorical indicators work best (like, dislike)or implicit = indicators (buy) that are unambiguous. If a person buys something, they = like it, if the rate it 3 do they like it? I buy many 3 rated items on = Amazon if I need them. 

My advice is drop ratings and use = thumbs up or down. These are unambiguous and the thumbs down can be used = in some cases to predict thumbs up: https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occurences/ This = uses data from a public web site to show significant lift by using = =E2=80=9Clike=E2=80=9D and =E2=80=9Cdislike=E2=80=9D in recommendations. = This used the Universal Recommender.


On Nov 10, 2017, at 5:02 AM, = Noelia Os=C3=A9s Fern=C3=A1ndez <noses@vicomtech.org> = wrote:


Hi = all,

I'm new to PredictionIO so I = apologise if this question is silly.

I = have an application in which users are rating different items in a scale = of 1 to 5 stars. I want to recommend items to a new user and give her = the predicted rating in number of stars. Which template should I use to = do this? Note that I need the predicted rating to be in the same range = of 1 to 5 stars.

Is it possible to do this with the ecommerce recommendation = engine?

Thank = you very much for your help!
Noelia









--

Noelia = Os=C3=A9s Fern=C3=A1ndez, PhD
Senior Researcher |
Investigadora = Senior


noses@vicomtech.org
+[34] 943 30 92 30
Data Intelligence for Energy and
Industrial Processes | Inteligencia
de Datos = para Energ=C3=ADa y Procesos
Industriales


  

member of:     

Legal Notice - Privacy = policy

= --Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85--