Mailing-List: contact user-help@predictionio.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@predictionio.apache.org
From: Pat Ferrel <pat@occamsmachete.com>
Message-Id: <1AEA4152-EA26-4A3F-BEFE-4A8046A35212@occamsmachete.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85"
Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\))
Subject: Re: Which template for predicting ratings?
Date: Mon, 13 Nov 2017 09:32:29 -0800
In-Reply-To: <CAMysefuiJ=gk6hdhMy2OSNOtEqMn6hY+xe0OmoM2euamatyiLw@mail.gmail.com>
Cc: user@predictionio.incubator.apache.org
To: user@predictionio.apache.org
References: <CAMysefszT-5T_+1D8qyf16-089KuzLS1cCJRYnZNxz3RmeXwJQ@mail.gmail.com>
 <E3ABF7AF-C44D-4EC7-B8D4-974E3050FD7B@occamsmachete.com>
 <CAMysefuiJ=gk6hdhMy2OSNOtEqMn6hY+xe0OmoM2euamatyiLw@mail.gmail.com>
archived-at: Mon, 13 Nov 2017 17:32:38 -0000


--Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

What we did in the article I attached is assume 1-2 is dislike, and 4-5 =
is like.

These are treated as indicators and will produce a score from the =
recommender but these do not relate to 1-5 scores.

If you need to predict what the user would score an item MLlib ALS =
templates will do it.


On Nov 13, 2017, at 2:42 AM, Noelia Os=C3=A9s Fern=C3=A1ndez =
<noses@vicomtech.org> wrote:

Hi Pat,

I truly appreciate your advice.

However, what to do with a client that is adamant that they want to =
display the predicted ratings in the form of 1 to 5-stars? That's my =
case right now.=20

I will pose a more concrete question. Is there any template for which =
the scores predicted by the algorithm are in the same range as the =
ratings in the training set?

Thank you very much for your help!
Noelia

On 10 November 2017 at 17:57, Pat Ferrel <pat@occamsmachete.com =
<mailto:pat@occamsmachete.com>> wrote:
Any of the Spark MLlib ALS recommenders in the PIO template gallery =
support ratings.

However I must warn that ratings are not very good for recommendations =
and none of the big players use ratings anymore, Netflix doesn=E2=80=99t =
even display them. The reason is that your 2 may be my 3 or 4 and that =
people rate different categories differently. For instance Netflix found =
Comedies were rated lower than Independent films. There have been many =
solutions proposed and tried but none have proven very helpful.

There is another more fundamental problem, why would you want to =
recommend the highest rated item? What do you buy on Amazon or watch on =
Netflix? Are they only your highest rated items. Research has shown that =
they are not. There was a whole misguided movement around ratings that =
affected academic papers and cross-validation metrics that has fairly =
well been discredited. It all came from the Netflix prize that used =
both. Netflix has since led the way in dropping ratings as they saw the =
things I have mentioned.

What do you do? Categorical indicators work best (like, dislike)or =
implicit indicators (buy) that are unambiguous. If a person buys =
something, they like it, if the rate it 3 do they like it? I buy many 3 =
rated items on Amazon if I need them.=20

My advice is drop ratings and use thumbs up or down. These are =
unambiguous and the thumbs down can be used in some cases to predict =
thumbs up: =
https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occure=
nces/ =
<https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occur=
ences/> This uses data from a public web site to show significant lift =
by using =E2=80=9Clike=E2=80=9D and =E2=80=9Cdislike=E2=80=9D in =
recommendations. This used the Universal Recommender.


On Nov 10, 2017, at 5:02 AM, Noelia Os=C3=A9s Fern=C3=A1ndez =
<noses@vicomtech.org <mailto:noses@vicomtech.org>> wrote:


Hi all,

I'm new to PredictionIO so I apologise if this question is silly.

I have an application in which users are rating different items in a =
scale of 1 to 5 stars. I want to recommend items to a new user and give =
her the predicted rating in number of stars. Which template should I use =
to do this? Note that I need the predicted rating to be in the same =
range of 1 to 5 stars.

Is it possible to do this with the ecommerce recommendation engine?

Thank you very much for your help!
Noelia


--=20
 <http://www.vicomtech.org/>

Noelia Os=C3=A9s Fern=C3=A1ndez, PhD
Senior Researcher |
Investigadora Senior

noses@vicomtech.org <mailto:noses@vicomtech.org>
+[34] 943 30 92 30
Data Intelligence for Energy and
Industrial Processes | Inteligencia
de Datos para Energ=C3=ADa y Procesos
Industriales

 <https://www.linkedin.com/company/vicomtech>  =
<https://www.youtube.com/user/VICOMTech>  =
<https://twitter.com/@Vicomtech_IK4>

member of:  <http://www.graphicsmedia.net/>     <http://www.ik4.es/>

Legal Notice - Privacy policy =
<http://www.vicomtech.org/en/proteccion-datos>

--Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" =
class=3D"">What we did in the article I attached is assume 1-2 is =
dislike, and 4-5 is like.<div class=3D""><br class=3D""></div><div =
class=3D"">These are treated as indicators and will produce a score from =
the recommender but these do not relate to 1-5 scores.</div><div =
class=3D""><br class=3D""></div><div class=3D"">If you need to predict =
what the user would score an item MLlib ALS templates will do =
it.</div><div class=3D""><br class=3D""></div><div class=3D""><br =
class=3D""></div><div class=3D""><br class=3D""><div><div class=3D"">On =
Nov 13, 2017, at 2:42 AM, Noelia Os=C3=A9s Fern=C3=A1ndez &lt;<a =
href=3D"mailto:noses@vicomtech.org" class=3D"">noses@vicomtech.org</a>&gt;=
 wrote:</div><br class=3D"Apple-interchange-newline"><div class=3D""><div =
dir=3D"ltr" class=3D""><div class=3D""><div class=3D""><div =
class=3D""><div class=3D""><div class=3D"">Hi Pat,<br class=3D""><br =
class=3D""></div>I truly appreciate your advice.<br class=3D""><br =
class=3D""></div>However, what to do with a client that is adamant that =
they want to display the predicted ratings in the form of 1 to 5-stars? =
That's my case right now. <br class=3D""><br class=3D""></div>I will =
pose a more concrete question. <b class=3D"">Is there any template for =
which the scores predicted by the algorithm are in the same range as the =
ratings in the training set?</b></div><div class=3D""><br =
class=3D""></div>Thank you very much for your help!<br =
class=3D""></div>Noelia<br class=3D""></div><div class=3D"gmail_extra"><br=
 class=3D""><div class=3D"gmail_quote">On 10 November 2017 at 17:57, Pat =
Ferrel <span dir=3D"ltr" class=3D"">&lt;<a =
href=3D"mailto:pat@occamsmachete.com" target=3D"_blank" =
class=3D"">pat@occamsmachete.com</a>&gt;</span> wrote:<br =
class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
style=3D"word-wrap:break-word" class=3D"">Any of the Spark MLlib ALS =
recommenders in the PIO template gallery support ratings.<div =
class=3D""><br class=3D""></div><div class=3D"">However I must warn that =
ratings are not very good for recommendations and none of the big =
players use ratings anymore, Netflix doesn=E2=80=99t even display them. =
The reason is that your 2 may be my 3 or 4 and that people rate =
different categories differently. For instance Netflix found Comedies =
were rated lower than Independent films. There have been many solutions =
proposed and tried but none have proven very helpful.</div><div =
class=3D""><br class=3D""></div><div class=3D"">There is another more =
fundamental problem, why would you want to recommend the highest rated =
item? What do you buy on Amazon or watch on Netflix? Are they only your =
highest rated items. Research has shown that they are not. There was a =
whole misguided movement around ratings that affected academic papers =
and cross-validation metrics that has fairly well been discredited. It =
all came from the Netflix prize that used both. Netflix has since led =
the way in dropping ratings as they saw the things I have =
mentioned.</div><div class=3D""><br class=3D""></div><div class=3D"">What =
do you do? Categorical indicators work best (like, dislike)or implicit =
indicators (buy) that are unambiguous. If a person buys something, they =
like it, if the rate it 3 do they like it? I buy many 3 rated items on =
Amazon if I need them.&nbsp;</div><div class=3D""><br =
class=3D""></div><div class=3D"">My advice is drop ratings and use =
thumbs up or down. These are unambiguous and the thumbs down can be used =
in some cases to predict thumbs up:&nbsp;<a =
href=3D"https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cros=
s-occurences/" target=3D"_blank" class=3D"">https://developer.ibm.com/<wbr=
 class=3D"">dwblog/2017/mahout-spark-<wbr =
class=3D"">correlated-cross-occurences/</a>&nbsp;<wbr class=3D"">This =
uses data from a public web site to show significant lift by using =
=E2=80=9Clike=E2=80=9D and =E2=80=9Cdislike=E2=80=9D in recommendations. =
This used the Universal Recommender.</div><div class=3D""><div =
class=3D"h5"><div class=3D""><br class=3D""></div><div class=3D""><br =
class=3D""><div class=3D""><div class=3D"">On Nov 10, 2017, at 5:02 AM, =
Noelia Os=C3=A9s Fern=C3=A1ndez &lt;<a href=3D"mailto:noses@vicomtech.org"=
 target=3D"_blank" class=3D"">noses@vicomtech.org</a>&gt; =
wrote:</div><br =
class=3D"m_-3732178135098576068Apple-interchange-newline"><div =
class=3D""><div dir=3D"ltr" class=3D""><div class=3D""><div =
class=3D""><div class=3D""><div class=3D""><br class=3D""></div>Hi =
all,<br class=3D""><br class=3D""></div>I'm new to PredictionIO so I =
apologise if this question is silly.<br class=3D""><br class=3D""></div>I =
have an application in which users are rating different items in a scale =
of 1 to 5 stars. I want to recommend items to a new user and give her =
the predicted rating in number of stars. Which template should I use to =
do this? Note that I need the predicted rating to be in the same range =
of 1 to 5 stars.</div><div class=3D""><br class=3D""></div><div =
class=3D"">Is it possible to do this with the ecommerce recommendation =
engine?</div><div class=3D""><br class=3D""></div><div class=3D"">Thank =
you very much for your help!</div><div class=3D"">Noelia<br =
class=3D""></div><div class=3D""><br class=3D""></div><div class=3D""><div=
 class=3D""><div class=3D""><div class=3D""><div =
class=3D"m_-3732178135098576068gmail_signature" =
data-smartmail=3D"gmail_signature"><div dir=3D"ltr" class=3D""><table =
cellspacing=3D"0" cellpadding=3D"2" border=3D"0" class=3D""><tbody =
class=3D""><tr class=3D""><td class=3D""></td></tr><tr class=3D""><td =
class=3D""></td></tr><tr class=3D""><td =
style=3D"border-width:2px;border-color:#00abc9;border-bottom-style:solid" =
class=3D""></td></tr><tr class=3D""><td class=3D""><br =
class=3D""></td></tr><tr class=3D""><td class=3D""><br =
class=3D""></td></tr><tr class=3D""><td class=3D""><br =
class=3D""></td></tr><tr class=3D""><td class=3D""><span =
style=3D"font-size:10px;font-family:'CENTURY =
GOTHIC';font-weight:normal;font-style:italic" class=3D""></span><br =
class=3D""></td></tr></tbody></table></div></div>
</div></div></div></div></div>
</div></div><br class=3D""></div></div></div></div></blockquote></div><br =
class=3D""><br clear=3D"all" class=3D""><br class=3D"">-- <br =
class=3D""><div class=3D"gmail_signature" =
data-smartmail=3D"gmail_signature"><div dir=3D"ltr" class=3D""><table =
cellspacing=3D"0" cellpadding=3D"2" border=3D"0" class=3D""><tbody =
class=3D""><tr class=3D""><td class=3D""><a =
href=3D"http://www.vicomtech.org/" target=3D"_blank" class=3D""><img =
src=3D"http://www.vicomtech.org/firmas/html/Vicomtech209.png" =
width=3D"209px" height=3D"50px" border=3D"0" class=3D""></a></td></tr><tr =
class=3D""><td class=3D""><br class=3D""><span style=3D"font-size: 12px; =
font-family: 'CENTURY GOTHIC'; font-weight: bold;" class=3D"">Noelia =
Os=C3=A9s Fern=C3=A1ndez, PhD</span><br class=3D""><span =
style=3D"font-size: 12px; font-family: 'CENTURY GOTHIC';" =
class=3D"">Senior Researcher |<br class=3D"">Investigadora =
Senior</span><br class=3D""></td></tr><tr class=3D""><td =
style=3D"border-width:2px;border-color:#00abc9;border-bottom-style:solid" =
class=3D""><br class=3D""><span style=3D"font-size: 12px; font-family: =
'CENTURY GOTHIC';" class=3D""><a href=3D"mailto:noses@vicomtech.org" =
style=3D"" target=3D"_blank" class=3D"">noses@vicomtech.org</a></span><br =
class=3D""><span style=3D"font-size: 12px; font-family: 'CENTURY =
GOTHIC';" =
class=3D"">+[34]&nbsp;943&nbsp;30&nbsp;92&nbsp;30</span></td></tr><tr =
class=3D""><td class=3D""><span style=3D"font-size: 11px; font-family: =
'CENTURY GOTHIC';" class=3D"">Data Intelligence for Energy and<br =
class=3D"">Industrial Processes | Inteligencia<br class=3D"">de Datos =
para Energ=C3=ADa y Procesos<br class=3D"">Industriales</span><br =
class=3D""></td></tr><tr class=3D""><td class=3D""><br class=3D""><a =
href=3D"https://www.linkedin.com/company/vicomtech" target=3D"_blank" =
class=3D""><img =
src=3D"http://www.vicomtech.org/firmas/html/linkedinCuadrado.png" =
longdesc=3D"https://ci3.googleusercontent.com/proxy/hW852P1NQyBr95ExDzqjjx=
hidZSIWKCCUdU1VT29kxBMDqN19A=3Ds0-d-e1-ft#http://Linkedin" border=3D"0" =
class=3D""></a>&nbsp;<a href=3D"https://www.youtube.com/user/VICOMTech" =
target=3D"_blank" class=3D""><img =
src=3D"http://www.vicomtech.org/firmas/html/youtubeCuadrado.png" =
longdesc=3D"https://ci4.googleusercontent.com/proxy/AnwyIZ_mq9hO7MAdBT799p=
rJM8zvoMuTVX3TlSdGwW8lEzoH=3Ds0-d-e1-ft#http://YouTube" border=3D"0" =
class=3D""></a>&nbsp;<a href=3D"https://twitter.com/@Vicomtech_IK4" =
target=3D"_blank" class=3D""><img =
src=3D"http://www.vicomtech.org/firmas/html/twitterCuadrado.png" =
longdesc=3D"https://ci4.googleusercontent.com/proxy/yobGDeBa5vD8JNPMjMOSuu=
qnm76ITf8qsSr_hssY10Xy7jZb=3Ds0-d-e1-ft#http://Twitter" border=3D"0" =
class=3D""></a></td></tr><tr class=3D""><td class=3D""><br =
class=3D""><span style=3D"font-size: 12px; font-family: 'CENTURY =
GOTHIC';" class=3D"">member of:&nbsp;<a =
href=3D"http://www.graphicsmedia.net/" target=3D"_blank" class=3D""><img =
src=3D"http://www.vicomtech.org/firmas/html/gmn68.png" =
longdesc=3D"https://ci6.googleusercontent.com/proxy/56MP72EbETJKMabagk3wan=
WxpF4rXW-fgRKFsT0ioMI-W_jElHl4xInIBxwz=3Ds0-d-e1-ft#http://GraphicsMediaNe=
t" style=3D"vertical-align:middle" width=3D"68px" height=3D"25px" =
border=3D"0" class=3D""></a>&nbsp;&nbsp;&nbsp;&nbsp;<a =
href=3D"http://www.ik4.es/" target=3D"_blank" class=3D""><img =
src=3D"http://www.vicomtech.org/firmas/html/IK4_43.png" =
longdesc=3D"https://ci5.googleusercontent.com/proxy/q30xZwwA01h1TWHBqQ87Oz=
jUODQQFFxVPgcf7kwux00=3Ds0-d-e1-ft#http://IK4" =
style=3D"vertical-align:middle" width=3D"43px" height=3D"24px" =
border=3D"0" class=3D""></a></span></td></tr><tr class=3D""><td =
class=3D""><br class=3D""><span style=3D"font-size: 10px; font-family: =
'CENTURY GOTHIC'; font-weight: normal; font-style: italic;" class=3D""><a =
href=3D"http://www.vicomtech.org/en/proteccion-datos" style=3D"" =
target=3D"_blank" class=3D"">Legal Notice - Privacy =
policy</a></span></td></tr></tbody></table></div></div>
</div>
</div></div><br class=3D""></div></body></html>=

--Apple-Mail=_41569018-F77C-4BAD-BE6D-52293C186D85--