Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8143A200CAB for ; Sun, 4 Jun 2017 06:14:48 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7FD26160BD1; Sun, 4 Jun 2017 04:14:48 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 02E3F160BCD for ; Sun, 4 Jun 2017 06:14:46 +0200 (CEST) Received: (qmail 36537 invoked by uid 500); 4 Jun 2017 04:14:46 -0000 Mailing-List: contact user-help@predictionio.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.incubator.apache.org Delivered-To: mailing list user@predictionio.incubator.apache.org Received: (qmail 36526 invoked by uid 99); 4 Jun 2017 04:14:46 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 04 Jun 2017 04:14:46 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id A4922C0362 for ; Sun, 4 Jun 2017 04:14:45 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.38 X-Spam-Level: ** X-Spam-Status: No, score=2.38 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, HTML_OBFUSCATE_05_10=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id CsF5xUeeLbdW for ; Sun, 4 Jun 2017 04:14:42 +0000 (UTC) Received: from mail-it0-f43.google.com (mail-it0-f43.google.com [209.85.214.43]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 410BC5F2A8 for ; Sun, 4 Jun 2017 04:14:42 +0000 (UTC) Received: by mail-it0-f43.google.com with SMTP id m62so37054504itc.0 for ; Sat, 03 Jun 2017 21:14:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=SgNRgwyYr+GI+jwxyQrMvEn0jj1fZbQ9ybQbRzwRUi0=; b=I0+qMPJEjZxVm+2NxUWBJEg3NMTTPkWEBDqtiIwVhRNe+eMhh0WCElJB8AM8PUJs0o v9adO5vdMtiuyN7Qz7j5oLUrEUMO/CaHMsd549760s+R4qtjIr2oict9FUbOYRYbBl0q cq03lOe0Eh9COo7SVxkgZpXPW5ru6OTgtyPOFBvX8zDPUPA66421xpIukbcLHhWdGkYd TuyGL3Tr/qMpgM4U9Bc4/2n0WBBW6w1DezXkr8KJiILp6kSV/watA/qN38l9km6+wvjw BKLLmJfD+EmyraAMWwNRFw2dYi9JGGfLvZnQ72d4lNHVhdyKhGzLNjsCY9+KMwonbMhR OdOg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=SgNRgwyYr+GI+jwxyQrMvEn0jj1fZbQ9ybQbRzwRUi0=; b=oHeXiCiUjQY/A5lbPO+bKv6hk92mve/RyViQTp6iyGsXEPgNTPi+oiAmX9oyvOuCS6 bhYdnBonREG8S5EVSGbump6JatZmhLOdKaH9dn2f66fRdlQsRFiMxjwH9H8gXL10YOh8 C6AlJewRX4Jt79bPKT7ABBk4ItIRqFlI65KerCdKNQhJQVDaq0ZYMzMAt9JaLOn2+e5j ic2/xVSiRvTnvia+t1xMa/hMotoPazx0cU+kCA3KGCLHdvfloJ62gDkL95/bRdMya2OW HIidDf1cCcShNsXX+0YSH8ysTGF2R3KAp2U7T/Z/9/6vf4l4N+FtDH/ecaJ/Fl3/S2+x cAMA== X-Gm-Message-State: AODbwcAmDTwOiMuAa+25CV8c+/mkGV5fSF7Vad69P8Q46oMFyXYvfksk hRyjJXSf0YYR8JSuZY3xbeu2iQ3YGw== X-Received: by 10.36.69.218 with SMTP id c87mr6045637itd.48.1496549681671; Sat, 03 Jun 2017 21:14:41 -0700 (PDT) MIME-Version: 1.0 Received: by 10.79.88.6 with HTTP; Sat, 3 Jun 2017 21:14:41 -0700 (PDT) In-Reply-To: References: <7F3E0A0D-A7D4-410F-B8AE-EADBE6C99961@occamsmachete.com> From: Marius Rabenarivo Date: Sun, 4 Jun 2017 08:14:41 +0400 Message-ID: Subject: Re: Use of latent informations associated to items with Mahout's SimilarityAnalysis.cooccurrences To: Pat Ferrel Cc: actionml-user , user@predictionio.incubator.apache.org Content-Type: multipart/alternative; boundary="001a11c146542a37ca05511aa0d0" archived-at: Sun, 04 Jun 2017 04:14:48 -0000 --001a11c146542a37ca05511aa0d0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable And what the T in the slides is for? How can we implement it if it's is not implemented yet? 2017-06-04 8:11 GMT+04:00 Pat Ferrel : > Buy purchasing an item with a tag that you have given it, they are > displaying a preference for that tag. > > > On Jun 3, 2017, at 12:36 PM, Marius Rabenarivo > wrote: > > So the tag here is assumed to be a tag given by the user to an item? > > I was thinking that it was some kind of tag we give to the item by some > mean (classification, LDA, etc) > > 2017-06-03 21:14 GMT+04:00 Pat Ferrel : > >> A =3D history of all purchases (in the e-com case) >> B =3D history of all tag preferences >> >> r =3D [A=E2=80=99A]h_a + [A=E2=80=99B]h_b >> >> The part in the slides about content-based recs is not needed here >> because you have captured them as user preferences. >> >> >> On Jun 2, 2017, at 7:22 PM, Marius Rabenarivo >> wrote: >> >> Please correct side to size in my previous e-mail >> >> 2017-06-03 6:14 GMT+04:00 Marius Rabenarivo = : >> >>> What will be the size of the matrix if we send an event like tag-pref >>> We will get a |U|x|T| matrix I think (where T is the set of all tags). >>> >>> So [AtA] will be a |T| x |T| matrix and we will do a dot product with >>> the user history hT to get recommendation right? >>> >>> I was assuming that A should be of side |U| x |I| where I is the set of >>> all items as it should be added to other terms of the whole enchilada >>> formula afterwards. >>> >>> Thank you for your guidance Pat. >>> >>> 2017-06-02 21:35 GMT+04:00 Pat Ferrel : >>> >>>> Please refer to the documents. The =E2=80=9Cevent=E2=80=9D is the name= of the type of >>>> event or indicator if preference, it implies the type of >>>> the targetEntityId. So a =E2=80=9Ctag-pref=E2=80=99 event would be acc= ompanied by >>>> a targetEntityId =3D tag-id. This is separate from attaching =E2=80=9C= tag=E2=80=9D properties >>>> to items with the $set event for use with filter and boost rules. One = looks >>>> at the data as a possible preference indicator and the other is used t= o >>>> restrict results. This is why we usually name events so they sound lik= e a >>>> user preference of some type, whereas item property values are simply = item >>>> attributes, intrinsic to the items and independent of an individual us= er. >>>> >>>> The event can have any name that makes sense to you. >>>> >>>> >>>> On Jun 2, 2017, at 9:19 AM, Marius Rabenarivo < >>>> mariusrabenarivo@gmail.com> wrote: >>>> >>>> so, the event field should be the token and targetEntityId the item ID= , >>>> right? >>>> >>>> 2017-06-02 20:07 GMT+04:00 Pat Ferrel : >>>> >>>>> Yes, each is analyzed separately as a separate event. If you are usin= g >>>>> REST you can send up to 50 events in a single array. Some SDKs may su= pport >>>>> this too. >>>>> >>>>> >>>>> On Jun 2, 2017, at 8:56 AM, Marius Rabenarivo < >>>>> mariusrabenarivo@gmail.com> wrote: >>>>> >>>>> So I have to send an event like category-preference for each tag >>>>> associated to an item right? >>>>> >>>>> entityId: userd-id >>>>> event: category-preference >>>>> targetEntityId : tag/token >>>>> >>>>> 2017-06-02 19:47 GMT+04:00 Pat Ferrel : >>>>> >>>>>> When a user expresses a preference for a tag, word or term as in >>>>>> search or even in content like descriptions, these can be considered >>>>>> secondary events. The most useful are tags and search terms in our >>>>>> experience. Content can be used but each term/token needs to be sent= as a >>>>>> separate preference while search phrases can be used though again tu= rning >>>>>> them into tokens may be better. >>>>>> >>>>>> Please looks through the docs here: http://actionml.com/docs/ur or >>>>>> the siide deck here: https://www.slideshare.n >>>>>> et/pferrel/unified-recommender-39986309 >>>>>> >>>>>> The major innovation of CCO, the algorithm behind the UR, is the use >>>>>> of these cross-domain indicators. They are not guaranteed to predict >>>>>> conversions but the CCO algo tests them and weights them low if they= do not >>>>>> so we tend to test for strength of prediction of the entire category= of >>>>>> indictor and drop them if weak or set a minLLR threshold and filter = weak >>>>>> individual indicators out. >>>>>> >>>>>> Technically these are not called latent, that has another meaning in >>>>>> Machine Learning having to do with Latent Factor Analysis. >>>>>> >>>>>> >>>>>> On Jun 1, 2017, at 11:26 PM, Marius Rabenarivo < >>>>>> mariusrabenarivo@gmail.com> wrote: >>>>>> >>>>>> Hello everyone! >>>>>> >>>>>> Do you have an idea on how to use latent informations associated to >>>>>> items like tag, word vector embedding in Mahout's >>>>>> SimilarityAnalysis.cooccurrences? >>>>>> >>>>>> Regards, >>>>>> >>>>>> Marius >>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "actionml-user" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to actionml-user+unsubscribe@googlegroups.com. >>>>>> To post to this group, send email to actionml-user@googlegroups.com. >>>>>> To view this discussion on the web visit https://groups.google. >>>>>> com/d/msgid/actionml-user/CAC-ATVEO_YON-5E95iPJjBR- >>>>>> FUgEv8TQsOA0rtD-xg0u-tNA_g%40mail.gmail.com >>>>>> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>>> >>>>> >>>>> >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "actionml-user" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to actionml-user+unsubscribe@googlegroups.com. >>>> To post to this group, send email to actionml-user@googlegroups.com. >>>> To view this discussion on the web visit https://groups.google. >>>> com/d/msgid/actionml-user/CAC-ATVFMsZw3uKtJQ8Mi00vvfRz4wOo3b >>>> acs5KMzcqS0kDdc0A%40mail.gmail.com >>>> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>>> >>> >> >> -- >> You received this message because you are subscribed to the Google Group= s >> "actionml-user" group. >> To unsubscribe from this group and stop receiving emails from it, send a= n >> email to actionml-user+unsubscribe@googlegroups.com. >> To post to this group, send email to actionml-user@googlegroups.com. >> To view this discussion on the web visit https://groups.google. >> com/d/msgid/actionml-user/CAC-ATVEuH6iFKAyzDt8_MdAWQuzjgb%3D >> x3EdULpqjHK3LtEfdcQ%40mail.gmail.com >> >> . >> For more options, visit https://groups.google.com/d/optout. >> >> > > -- > You received this message because you are subscribed to the Google Groups > "actionml-user" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to actionml-user+unsubscribe@googlegroups.com. > To post to this group, send email to actionml-user@googlegroups.com. > To view this discussion on the web visit https://groups.google. > com/d/msgid/actionml-user/CAC-ATVHa-v4Aw8Ebo4xESzKUxvyyhfEfBoSPnD% > 2Bv_-4ZCpR0AQ%40mail.gmail.com > > . > For more options, visit https://groups.google.com/d/optout. > > --001a11c146542a37ca05511aa0d0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
And what the T in the slides is for?

How= can we implement it if it's is not implemented yet?

2017-06-04 8:11 GMT+04:00 = Pat Ferrel <pat@occamsmachete.com>:
Buy purchasing an item wi= th a tag that you have given it, they are displaying a preference for that = tag.


On Jun 3, 2017= , at 12:36 PM, Marius Rabenarivo <mariusrabenarivo@gmail.com> wrote:
So the tag here is assumed to be = a tag given by the user to an item?

I was thinking that it was= some kind of tag we give to the item by some mean (classification, LDA, et= c)

2017-06-03= 21:14 GMT+04:00 Pat Ferrel=C2=A0<pat@occamsmachete.com>:
A =3D history of all= purchases (in the e-com case)
B =3D history of all tag preferences

r =3D [A=E2=80=99A]h_a + [A=E2=80=99B]h_b
<= br>
The part in the slides about content-based recs is not needed= here because you have captured them as user preferences.


On Jun 2= , 2017, at 7:22 PM, Marius Rabenarivo <mariusrabenarivo@gmail.com> wrote:
<= div dir=3D"ltr">Please correct side to size in my previous e-mail
=

2017-06-03 6:14 G= MT+04:00 Marius Rabenarivo=C2=A0<mariusrabenarivo@gmail.com><= /span>:
What will be th= e size of the matrix if we send an event like tag-pref=C2=A0
We will get a |U|x|= T| matrix I think (where T is the set of all tags).

So [AtA] w= ill be a |T| x |T| matrix and we will do a dot product with the user histor= y hT to get recommendation right?

I was assuming that A should= be of side |U| x |I| where I is the set of all items as it should be added= to other terms of the whole enchilada formula afterwards.

Tha= nk you for your guidance Pat.

= 2017-06-02 21:35 GMT+04:00 Pat Ferrel=C2=A0<pat@occamsmachete.com>:
Please ref= er to the documents. The =E2=80=9Cevent=E2=80=9D is the name of the type of= event or indicator if preference, it implies the type of the=C2=A0targetEn= tityId. So a =E2=80=9Ctag-pref=E2=80=99 event would be accompanied by a=C2= =A0targetEntityId =3D tag-id. This is separate from attaching =E2=80=9Ctag= =E2=80=9D properties to items with the $set event for use with filter and b= oost rules. One looks at the data as a possible preference indicator and th= e other is used to restrict results. This is why we usually name events so = they sound like a user preference of some type, whereas item property value= s are simply item attributes, intrinsic to the items and independent of an = individual user.

The event can have any name that makes = sense to you.


On Jun 2,= 2017, at 9:19 AM, Marius Rabenarivo <mariusrabenarivo@gmail.com> wrote:
so, the event field should be the token and = targetEntityId the item ID, right?

2017-06-02 20:07 GMT+04:00 Pat Ferrel=C2=A0<p= at@occamsmachete.com>:
Yes, each is analyzed separately as a separate event= . If you are using REST you can send up to 50 events in a single array. Som= e SDKs may support this too.


On Jun 2, 2017, at 8:56 AM, Marius Rabenarivo <mariusrabenari= vo@gmail.com> wrote:

So I ha= ve to send an event like category-preference for each tag associated to an = item right?

entityId: userd-id
event: category-prefer= ence
targetEntityId : tag/token

2017-06-02 19:47 GMT+04:00 Pat Ferrel=C2=A0<p= at@occamsmachete.com>:
When a user expresses a preference for a tag, word o= r term as in search or even in content like descriptions, these can be cons= idered secondary events. The most useful are tags and search terms in our e= xperience. Content can be used but each term/token needs to be sent as a se= parate preference while search phrases can be used though again turning the= m into tokens may be better.

Please looks through the do= cs here:=C2=A0<= /span>http://acti= onml.com/docs/ur=C2=A0or the siide deck here:=C2=A0http= s://www.slideshare.net/pferrel/unified-recommender-39986309

The major innovation of CCO, the algorithm behind the= UR, is the use of these cross-domain indicators. They are not guaranteed t= o predict conversions but the CCO algo tests them and weights them low if t= hey do not so we tend to test for strength of prediction of the entire cate= gory of indictor and drop them if weak or set a minLLR threshold and filter= weak individual indicators out.

Technically these= are not called latent, that has another meaning in Machine Learning having= to do with Latent Factor Analysis.


On Jun 1, 2017, at 11= :26 PM, Marius Rabenarivo <mariusrabenarivo@gmail.com> wrote:

Hello everyone!

Do you have an id= ea on how to use latent informations associated to items like tag, word vec= tor embedding in Mahout's SimilarityAnalysis.cooccurrences?
Regards,

Marius

<= /div>
--=C2=A0
You received this message because you are subscribed to= the Google Groups "actionml-user" group.
To unsubscribe from = this group and stop receiving emails from it, send an email to=C2=A0actionml= -user+unsubscribe@googlegroups.com.
To post to this group, send= email to=C2=A0= = actionml-user@googlegroups.com.
To view this discussion on the = web visit=C2=A0= https://groups.google.c= om/d/msgid/actionml-user/CAC-ATVEO_YON-5E95iPJjBR-FUgEv8TQsOA0rtD= -xg0u-tNA_g%40mail.gmail.com.
For more options, visit=C2=A0https://groups.google.= com/d/optout.

<= /div>


<= /blockquote>


--=C2=A0
You received this message because you are subscribed to = the Google Groups "actionml-user" group.
To unsubscribe from t= his group and stop receiving emails from it, send an email to=C2=A0actionml-us= er+unsubscribe@googlegroups.com.
To post to this group, send em= ail to=C2=A0act= ionml-user@googlegroups.com.
To view this discussio= n on the web visit=C2=A0https://groups.goog= le.com/d/msgid/actionml-user/CAC-ATVFMsZw3uKtJQ8Mi00vvfRz4wOo3bacs5KMzcqS0kDdc0A%40mail.gmail.com.
For more options, = visit=C2=A0https:/= /groups.google.com/d/optout.




--=C2=A0
You received this message because you are subscribed to = the Google Groups "actionml-user" group.
To unsubscribe from t= his group and stop receiving emails from it, send an email to=C2=A0actionml-us= er+unsubscribe@googlegroups.com.
To post to this group, send em= ail to=C2=A0act= ionml-user@googlegroups.com.
To view this discussio= n on the web visit=C2=A0https://groups.go= ogle.com/d/msgid/actionml-user/CAC-ATVEuH6iFKAyzDt8_MdAWQuzjgb%3D= x3EdULpqjHK3LtEfdcQ%40mail.gmail.com.
For more optio= ns, visit=C2=A0= htt= ps://groups.google.com/d/optout.

<= /div>


--=C2=A0
You received this message because you are su= bscribed to the Google Groups "actionml-user" group.
To unsubscribe from this group and stop recei= ving emails from it, send an email to=C2=A0actionml-user+unsubscribe@go= oglegroups.com.
To post to this group, send email to=C2=A0actionml-user@goog= legroups.com.
To view this discussion on the web v= isit=C2=A0https://groups.google.c= om/d/msgid/actionml-user/CAC-ATVHa-v4Aw8Ebo4xESzKUxvyyhfEfBoSPnD%= 2Bv_-4ZCpR0AQ%40mail.gmail.com.
For more= options, visit= =C2=A0https://groups.google.com/d/optout.

--001a11c146542a37ca05511aa0d0--