Mailing-List: contact openrelevance-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: openrelevance-user@lucene.apache.org
Received-SPF: pass (nike.apache.org: domain of mbennett.ideaeng@gmail.com
 designates 209.85.223.196 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:sender:in-reply-to:references:from:date
         :x-google-sender-auth:message-id:subject:to:content-type;
        b=kHT4XIGc9c3IMIiblhWDWdsteUNvGSiGj5pqeiUJQ8piwolpF49S3GIDrQOZa8L97j
         FZxsSTzVlRPdyDibD36hNkmM6OwOBZ8SiR80ZBq/uDxFzz9hwmi7d3SA07GOTaPba68K
         IiMZLvanzjucM0GGBqvPoGtGNBSbfvNNcHFPg=
MIME-Version: 1.0
Sender: mbennett.ideaeng@gmail.com
In-Reply-To: <8f0ad1f31002111302m29d17ac9v4138b29abbd64feb@mail.gmail.com>
References: <3504767f1002110958y1d12650h8c6b22c062f5f271@mail.gmail.com>
	<8f0ad1f31002111149t2795b6d8p848668f3d1171c77@mail.gmail.com>
	<3504767f1002111249s1b11046dweefd3c4bb55032f1@mail.gmail.com>
	<8f0ad1f31002111302m29d17ac9v4138b29abbd64feb@mail.gmail.com>
From: Mark Bennett <mbennett@ideaeng.com>
Date: Thu, 11 Feb 2010 13:31:20 -0800
Message-ID: <3504767f1002111331j26a4afa5h41596da65db9e294@mail.gmail.com>
Subject: Re: Comments on ORP Wiki Additions ?
To: openrelevance-user@lucene.apache.org
Content-Type: multipart/alternative; boundary=0016e68ee03964171a047f59e0b2

--0016e68ee03964171a047f59e0b2
Content-Type: text/plain; charset=ISO-8859-1

Hi Robert,

By "pooling", you mean they combine different sets of source docs and
question sets, in kind of a patch work?  If that's what you mean, do you
know how that process was generally done?  How close to "perfection", ie
total coverage by humans, do you think they got?

If that's not what you meant by "pooling" then I'm a bit confused...

Thanks,
Mark

--
Mark Bennett / New Idea Engineering, Inc. / mbennett@ideaeng.com
Direct: 408-733-0387 / Main: 866-IDEA-ENG / Cell: 408-829-6513


On Thu, Feb 11, 2010 at 1:02 PM, Robert Muir <rcmuir@gmail.com> wrote:

> in this case pooling is what is typically used.
>
>
> On Thu, Feb 11, 2010 at 3:49 PM, Mark Bennett <mbennett@ideaeng.com>wrote:
>
>> Thanks Robert,
>>
>> Excellent comments, I'll try to add something to the outline.  Either a
>> higher level top section, or some intro text.
>>
>> Robert, in particular, I wonder if you could look at:
>>
>> http://cwiki.apache.org/confluence/display/ORP/Relevancy+Assertion+Testing
>>
>> In the section on "Full-Grid Assertions (TREC-Style!)"
>>
>> It talks about the "M x N" problem of creating relevancy judgment data.
>> It also explores some of the shortcuts that could be used.
>>
>> We're actually working through these problems with a couple clients.  On
>> the one hand they want "perfect" measurements, but on the other hand nobody
>> wants to fund the work to create completely curated test sets.  This is the
>> classic "good vs. cheap" argument, and I DO think there are reasonable
>> compromises to be had.
>>
>> TREC has evolved over the years and I wonder how they've addressed these.
>> Did they take any shortcuts?  Or did they get enough manpower to really
>> curate every single document and relevancy judgment?
>>
>> I'll be adding more about some of the compromises we've considered and
>> worked on, but it'd be great to get other experts to chime in.  Either y'all
>> will come back with other ideas we didn't think, or we get to say "we told
>> you so" - I'm happy either way.
>>
>> And what I love about the ORP process is that all of this is captured and
>> vetted in an accessible public forum.  TREC was also peer reviewed, so this
>> continues that tradition in the newer medium.  And I'll work on an even
>> clearer outline
>>
>>
>> Mark
>>
>> --
>> Mark Bennett / New Idea Engineering, Inc. / mbennett@ideaeng.com
>> Direct: 408-733-0387 / Main: 866-IDEA-ENG / Cell: 408-829-6513
>>
>>
>> On Thu, Feb 11, 2010 at 11:49 AM, Robert Muir <rcmuir@gmail.com> wrote:
>>
>>> first of all, thanks for adding this content!
>>>
>>> in my opinion one thing that might be helpful would be an 'introduction'
>>> section that is VERY high-level. I don't want to sound negative but your
>>> 'high level outline' is actually quite technical :)
>>>
>>> it might be a good thing for this project if we had some content
>>> somewhere that explained at a very very high level what this whole relevance
>>> testing thing is all about...
>>>
>>>
>>> On Thu, Feb 11, 2010 at 12:58 PM, Mark Bennett <mbennett@ideaeng.com>wrote:
>>>
>>>> Good morning Relevancy comrades,
>>>>
>>>> I've tried to take a stab at outlining this rather complex subject in
>>>> the wiki.  Of course it's a work in progress.
>>>>
>>>> I've done a high level outline here:
>>>> http://cwiki.apache.org/confluence/display/ORP/Relevancy+Testing+Outline
>>>>
>>>> And an expansion of the first section of the outline here:
>>>>
>>>> http://cwiki.apache.org/confluence/display/ORP/Relevancy+Assertion+Testing
>>>>
>>>> I actually could use some feedback.  I promise you this is not vanity,
>>>> there are actually some very pragmatic motives for my postings.
>>>>
>>>> I guess some specific questions:
>>>> * I'm trying to create a bit of a "crash course" in Relevancy Testing,
>>>> are there major areas I've overlooked?
>>>> * I've outlined 2 broad categories of testing, do you agree?
>>>> * I've tried to explore some of the high level strengths and drawbacks
>>>> of certain methodologies
>>>> * Is the "tone" reasonably neutral?  What I mean is that some folks may
>>>> be attached to certain methods, I don't want to seem like I'm "trashing"
>>>> anything, just trying to point out the strengths and weaknesses in a fair
>>>> way.
>>>>
>>>> I look forward to any comments.
>>>>
>>>> Mark
>>>>
>>>> --
>>>> Mark Bennett / New Idea Engineering, Inc. / mbennett@ideaeng.com
>>>> Direct: 408-733-0387 / Main: 866-IDEA-ENG / Cell: 408-829-6513
>>>>
>>>
>>>
>>>
>>> --
>>> Robert Muir
>>> rcmuir@gmail.com
>>>
>>
>>
>
>
> --
> Robert Muir
> rcmuir@gmail.com
>

--0016e68ee03964171a047f59e0b2
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi Robert,<br><br>By &quot;pooling&quot;, you mean they combine different s=
ets of source docs and question sets, in kind of a patch work?=A0 If that&#=
39;s what you mean, do you know how that process was generally done?=A0 How=
 close to &quot;perfection&quot;, ie total coverage by humans, do you think=
 they got?<br>

<br>If that&#39;s not what you meant by &quot;pooling&quot; then I&#39;m a =
bit confused...<br><br>Thanks,<br>Mark<br clear=3D"all"><br>--<br>Mark Benn=
ett / New Idea Engineering, Inc. / <a href=3D"mailto:mbennett@ideaeng.com">=
mbennett@ideaeng.com</a><br>

Direct: 408-733-0387 / Main: 866-IDEA-ENG / Cell: 408-829-6513<br>
<br><br><div class=3D"gmail_quote">On Thu, Feb 11, 2010 at 1:02 PM, Robert =
Muir <span dir=3D"ltr">&lt;<a href=3D"mailto:rcmuir@gmail.com">rcmuir@gmail=
.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"bo=
rder-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding=
-left: 1ex;">

in this case pooling is what is typically used.<div><div></div><div class=
=3D"h5"><br><br><div class=3D"gmail_quote">On Thu, Feb 11, 2010 at 3:49 PM,=
 Mark Bennett <span dir=3D"ltr">&lt;<a href=3D"mailto:mbennett@ideaeng.com"=
 target=3D"_blank">mbennett@ideaeng.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Thanks Robert,<br=
><br>Excellent comments, I&#39;ll try to add something to the outline.=A0 E=
ither a higher level top section, or some intro text.<br>


<br>Robert, in particular, I wonder if you could look at:<div><br><a href=
=3D"http://cwiki.apache.org/confluence/display/ORP/Relevancy+Assertion+Test=
ing" target=3D"_blank">http://cwiki.apache.org/confluence/display/ORP/Relev=
ancy+Assertion+Testing</a><br>


<br></div>In the section on &quot;Full-Grid Assertions (TREC-Style!)&quot;<=
br clear=3D"all"><br>It talks about the &quot;M x N&quot; problem of creati=
ng relevancy judgment data.=A0 It also explores some of the shortcuts that =
could be used.<br>


<br>We&#39;re actually working through these problems with a couple clients=
.=A0 On the one hand they want &quot;perfect&quot; measurements, but on the=
 other hand nobody wants to fund the work to create completely curated test=
 sets.=A0 This is the classic &quot;good vs. cheap&quot; argument, and I DO=
 think there are reasonable compromises to be had.<br>


<br>TREC has evolved over the years and I wonder how they&#39;ve addressed =
these.=A0 Did they take any shortcuts?=A0 Or did they get enough manpower t=
o really curate every single document and relevancy judgment?<br><br>
I&#39;ll be adding more about some of the compromises we&#39;ve considered =
and worked on, but it&#39;d be great to get other experts to chime in.=A0 E=
ither y&#39;all will come back with other ideas we didn&#39;t think, or we =
get to say &quot;we told you so&quot; - I&#39;m happy either way.<br>


<br>And what I love about the ORP process is that all of this is captured a=
nd vetted in an accessible public forum.=A0 TREC was also peer reviewed, so=
 this continues that tradition in the newer medium.=A0 And I&#39;ll work on=
 an even clearer outline<div>


<br>

<br>Mark<br><br>--<br>Mark Bennett / New Idea Engineering, Inc. / <a href=
=3D"mailto:mbennett@ideaeng.com" target=3D"_blank">mbennett@ideaeng.com</a>=
<br>Direct: 408-733-0387 / Main: 866-IDEA-ENG / Cell: 408-829-6513<br>
<br><br></div><div><div></div><div><div class=3D"gmail_quote">On Thu, Feb 1=
1, 2010 at 11:49 AM, Robert Muir <span dir=3D"ltr">&lt;<a href=3D"mailto:rc=
muir@gmail.com" target=3D"_blank">rcmuir@gmail.com</a>&gt;</span> wrote:<br=
>

<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

first of all, thanks for adding this content!<br><br>in my opinion one thin=
g that might be helpful would be an &#39;introduction&#39; section that is =
VERY high-level. I don&#39;t want to sound negative but your &#39;high leve=
l outline&#39; is actually quite technical :)<br>


<br>it might be a good thing for this project if we had some content somewh=
ere that explained at a very very high level what this whole relevance test=
ing thing is all about...<div><div></div><div><br><br><div class=3D"gmail_q=
uote">


On Thu, Feb 11, 2010 at 12:58 PM, Mark Bennett <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:mbennett@ideaeng.com" target=3D"_blank">mbennett@ideaeng.com</a=
>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Good morning Rele=
vancy comrades,<br><br>I&#39;ve tried to take a stab at outlining this rath=
er complex subject in the wiki.=A0 Of course it&#39;s a work in progress.<b=
r>


<br>I&#39;ve done a high level outline here:<br><a href=3D"http://cwiki.apa=
che.org/confluence/display/ORP/Relevancy+Testing+Outline" target=3D"_blank"=
>http://cwiki.apache.org/confluence/display/ORP/Relevancy+Testing+Outline</=
a><br>


<br>And an expansion of the first section of the outline here:<br><a href=
=3D"http://cwiki.apache.org/confluence/display/ORP/Relevancy+Assertion+Test=
ing" target=3D"_blank">http://cwiki.apache.org/confluence/display/ORP/Relev=
ancy+Assertion+Testing</a><br>


<br>I actually could use some feedback.=A0 I promise you this is not vanity=
, there are actually some very pragmatic motives for my postings.<br><br>I =
guess some specific questions:<br>* I&#39;m trying to create a bit of a &qu=
ot;crash course&quot; in Relevancy Testing, are there major areas I&#39;ve =
overlooked?<br>


* I&#39;ve outlined 2 broad categories of testing, do you agree?<br>* I&#39=
;ve tried to explore some of the high level strengths and drawbacks of cert=
ain methodologies<br clear=3D"all">* Is the &quot;tone&quot; reasonably neu=
tral?=A0 What I mean is that some folks may be attached to certain methods,=
 I don&#39;t want to seem like I&#39;m &quot;trashing&quot; anything, just =
trying to point out the strengths and weaknesses in a fair way.<br>


<br>I look forward to any comments.<br><br>Mark<br><font color=3D"#888888">=
<br>--<br>Mark Bennett / New Idea Engineering, Inc. / <a href=3D"mailto:mbe=
nnett@ideaeng.com" target=3D"_blank">mbennett@ideaeng.com</a><br>Direct: 40=
8-733-0387 / Main: 866-IDEA-ENG / Cell: 408-829-6513<br>


</font></blockquote></div><br><br clear=3D"all"><br></div></div><font color=
=3D"#888888">-- <br>Robert Muir<br><a href=3D"mailto:rcmuir@gmail.com" targ=
et=3D"_blank">rcmuir@gmail.com</a><br>
</font></blockquote></div><br>
</div></div></blockquote></div><br><br clear=3D"all"><br></div></div>-- <br=
><div><div></div><div class=3D"h5">Robert Muir<br><a href=3D"mailto:rcmuir@=
gmail.com" target=3D"_blank">rcmuir@gmail.com</a><br>
</div></div></blockquote></div><br>

--0016e68ee03964171a047f59e0b2--