Mailing-List: contact user-help@predictionio.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@predictionio.apache.org
MIME-Version: 1.0
In-Reply-To: <CAEWeDuzki5TJD=FHsvqMEL7iXeVrq=Va96dySqADDpKu1MuvKQ@mail.gmail.com>
References: <CAMysefvrPhjcVdjDLBhCvBiYX-eAxSLMz-o6XYOGRrk1PGT72w@mail.gmail.com>
 <CAEWeDuzki5TJD=FHsvqMEL7iXeVrq=Va96dySqADDpKu1MuvKQ@mail.gmail.com>
From: Suneel Marthi <smarthi@apache.org>
Date: Thu, 16 Nov 2017 15:59:53 +0000
Message-ID: <CAOtpBjiixy7OiXSBT7EnXstv6Esr9U3yYp4fPOrMcZKtSCjw5A@mail.gmail.com>
Subject: Re: Log-likelihood based correlation test?
To: user@predictionio.apache.org
Cc: actionml-user <actionml-user@googlegroups.com>
Content-Type: multipart/alternative; boundary="001a11488ef2f599cc055e1bb58b"
archived-at: Thu, 16 Nov 2017 15:59:57 -0000

--001a11488ef2f599cc055e1bb58b
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Indeed so. Ted Dunning is an Apache Mahout PMC and committer and the whole
idea of Search-based Recommenders stems from his work and insights.  If u
didn't know, the PIO UR uses Apache Mahout under the hood and hence u see
the LLR.

On Thu, Nov 16, 2017 at 3:49 PM, Daniel Gabrieli <dgabrieli@salesforce.com>
wrote:

> I am pretty sure the LLR stuff in UR is based off of this blog post and
> associated paper:
>
> http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html
>
> Accurate Methods for the Statistics of Surprise and Coincidence
> by Ted Dunning
>
> http://citeseerx.ist.psu.edu/viewdoc/summary?doi=3D10.1.1.14.5962
>
>
> On Thu, Nov 16, 2017 at 10:26 AM Noelia Os=C3=A9s Fern=C3=A1ndez <
> noses@vicomtech.org> wrote:
>
>> Hi,
>>
>> I've been trying to understand how the UR algorithm works and I think I
>> have a general idea. But I would like to have a *mathematical
>> description* of the step in which the LLR comes into play. In the CCO
>> presentations I have found it says:
>>
>> (PtP) compares column to column using
>> *log-likelihood based correlation test*
>>
>> However, I have searched for "log-likelihood based correlation test" in
>> google but no joy. All I get are explanations of the likelihood-ratio te=
st
>> to compare two models.
>>
>> I would very much appreciate a math explanation of log-likelihood based
>> correlation test. Any pointers to papers or any other literature that
>> explains this specifically are much appreciated.
>>
>> Best regards,
>> Noelia
>>
>

--001a11488ef2f599cc055e1bb58b
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Indeed so. Ted Dunning is an Apache Mahout PMC and committ=
er and the whole idea of Search-based Recommenders stems from his work and =
insights.=C2=A0 If u didn&#39;t know, the PIO UR uses Apache Mahout under t=
he hood and hence u see the LLR.</div><div class=3D"gmail_extra"><br><div c=
lass=3D"gmail_quote">On Thu, Nov 16, 2017 at 3:49 PM, Daniel Gabrieli <span=
 dir=3D"ltr">&lt;<a href=3D"mailto:dgabrieli@salesforce.com" target=3D"_bla=
nk">dgabrieli@salesforce.com</a>&gt;</span> wrote:<br><blockquote class=3D"=
gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-=
left:1ex"><div dir=3D"ltr"><div><div>I am pretty sure the LLR stuff in UR i=
s based off of this blog post and associated paper:</div><div><br></div><di=
v><a href=3D"http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.=
html" target=3D"_blank">http://tdunning.blogspot.com/<wbr>2008/03/surprise-=
and-<wbr>coincidence.html</a></div><div><br></div><div>Accurate Methods for=
 the Statistics of Surprise and Coincidence</div><div>by Ted Dunning</div><=
div><br></div><div><a href=3D"http://citeseerx.ist.psu.edu/viewdoc/summary?=
doi=3D10.1.1.14.5962" target=3D"_blank">http://citeseerx.ist.psu.edu/<wbr>v=
iewdoc/summary?doi=3D10.1.1.14.<wbr>5962</a></div></div><div><br></div></di=
v><br><div class=3D"gmail_quote"><div dir=3D"ltr">On Thu, Nov 16, 2017 at 1=
0:26 AM Noelia Os=C3=A9s Fern=C3=A1ndez &lt;<a href=3D"mailto:noses@vicomte=
ch.org" target=3D"_blank">noses@vicomtech.org</a>&gt; wrote:<br></div><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex"><div dir=3D"ltr"><div><div><div><div><div><div>Hi=
,<br><br></div>I&#39;ve been trying to understand how the UR algorithm work=
s and I think I have a general idea. But I would like to have a <u><b>mathe=
matical description</b></u> of the step in which the LLR comes into play. I=
n the CCO presentations I have found it says:<br><br></div>(PtP) compares c=
olumn to column using <b>log-likelihood based correlation test<br></b><br><=
br></div>However, I have searched for &quot;log-likelihood based correlatio=
n test&quot; in google but no joy. All I get are explanations of the likeli=
hood-ratio test to compare two models. <br><br></div>I would very much appr=
eciate a math explanation of log-likelihood based correlation test. Any poi=
nters to papers or any other literature that explains this specifically are=
 much appreciated.<br><br></div>Best regards,<br></div>Noelia<br></div>
</blockquote></div>
</blockquote></div><br></div>

--001a11488ef2f599cc055e1bb58b--