Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 56119 invoked from network); 4 May 2010 00:42:26 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 May 2010 00:42:26 -0000 Received: (qmail 34054 invoked by uid 500); 4 May 2010 00:42:23 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 33993 invoked by uid 500); 4 May 2010 00:42:23 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 33950 invoked by uid 99); 4 May 2010 00:42:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 May 2010 00:42:23 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.136.167.18] (HELO web113318.mail.gq1.yahoo.com) (98.136.167.18) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 04 May 2010 00:42:15 +0000 Received: (qmail 44733 invoked by uid 60001); 4 May 2010 00:41:53 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1272933713; bh=1leZgRhzPirtnxJ6U0K7+gJVF9Fyp7OzXe9V2iYQ0wE=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=WsihbG+FHV2tBcdwFhNbGs+a3Xz7KGEBhz4T8I3Ykv6t1o/U2kG8Iqb5ChSQsliS+NGNVU0wSfJC4SrRkzQ20jrlghDbeFrXlFlO1iM/TSX6skeEuTK+shUj9ahLuhRJTaWqn/Y4OP8bXOH0hJpInys3DZ+JVG6oQCzen2hXbW0= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=2VjaV5Jvi+s4R5BZx03B6E+0zL75/E7KS3l8EPKK4TLeVzyOhTne3jb9rF4LBijXyEfaYf0v3tmsza2GhXiSrxGCZLJEyzOUlzUhWvEx3SCOp0sZPlG1UY1RRMkP03c2w34KIKU5PV+38Rv9+Lvl3GVZ6Ptk7oUTS4pwTXvUeMQ=; Message-ID: <713587.43972.qm@web113318.mail.gq1.yahoo.com> X-YMail-OSG: ByVo5eUVM1khUdDhwdY68NTh6c.mJUr8JvX7ln45rV6gYfz boQTxYPjB2FIeRjQ6y6pMInYdMRNkgqg46zv6RMd1JrekOY6WDGZ73F9AS7n KppdH0JGWtYGS_wwZdYCafr6uB1XWUkZiL1ktXd1bljV1HDFtybyy4WC841k Oh0awW9_llm1mTRNuspu96NmBMrAWgZCoZi9Kjti1AntpCRf8OCp2EzWOYPD A2R5vQenO1KIZ2N70zKqhXxI7qNN84xL11lNklJPJbHAIa5jMJyKrX7SrIBu RrXp.fhVoTZcZcjrIcpuYDd3MtEdeNKviARfOR4lraaF2CpzZX.S4UA-- Received: from [99.150.141.64] by web113318.mail.gq1.yahoo.com via HTTP; Mon, 03 May 2010 17:41:53 PDT X-Mailer: YahooMailClassic/10.1.11 YahooMailWebService/0.8.103.269680 Date: Mon, 3 May 2010 17:41:53 -0700 (PDT) From: Ivan Provalov Subject: Re: Relevancy Practices To: java-user@lucene.apache.org In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Grant,=0A=0AWe are currently working on a relevancy improvement project. W= e took the IBM's paper from 2007 TREC and followed the approaches they desc= ribed to improve Lucene's relevance. It also gave us some idea of Lucene= =E2=80=99s out-of-the-box precision performance (MAP). In addition to it w= e used some of the best practices described in TREC's book (Voorhees 2005, = MIT). We also looked into the probability scoring model (BM25). =0A=0AWe s= tarted by comparing =E2=80=9Cvanilla=E2=80=9D Lucene to our Lucene-based pr= oduct=E2=80=99s performance. We obtained the collections and the judgments= from the past TREC which were close to the genre of the content we store. = We then proceeded to study how different tunings affected the scores. We = used Lucene's benchmarking module to run against the TREC data. Even thoug= h there were a few old TREC document/topic format related issues along the = way, this benchmarking tool was all together great in helping find the MAP = and measure where we were at. =0A=0AThen we applied the Sweet Spot similar= ity, Pivot Point document length normalization (Lnb/Ltc), and BM25 scoring = algorithms. After applying these different scoring mechanism changes and o= ther techniques (different stemmers, query expansion), we saw some improvem= ents. We then compared this to our current production system and started t= uning it as well. =0A=0AOur second goal here was to include the relevance = measurement into the continuous integration tests running nightly. The tho= ught here is that if one of the system=E2=80=99s changes inadvertently affe= cted the scoring, we would find out right away. This second phase also hel= ped us discover hidden bugs in our production system. =0A=0AIn addition to = the English-based analyzers, we also studied Chinese analyzers and compared= the results with the English collection runs. We used TREC data for that.= =0A=0ASome observations:=0A1.=09Even though the Vector Space model with Boo= lean query (OR) gives good MAP scores, in some products the large number of= returned results makes the product less usable. So, defaulting to AND ope= rator may be a better option as was mentioned in this user group post earli= er.=0A2.=09This TREC-based evaluation is just of many tools to use. For ex= ample, user feed-back is still the most important evaluation one can do.=0A= 3.=09We will continue studying how different scoring mechanisms affect rele= vance quality before making a decision whether to switch from the default V= SM. Some of our concerns are over-tuning and performance testing.=0A4.=09L= ucene user community has been very helpful. Robert Muir, Joaquin Iglesias,= and others helped with applying the scoring algorithms and providing great= suggestions. =0A5.=09Some of the tools we use constantly - Lucene=E2=80=99= s query Explanation and Luke.=0A=0AThanks,=0A=0AIvan Provalov=0A=0A=0A=0A= =0A--- On Thu, 4/29/10, Grant Ingersoll wrote:=0A=0A>= From: Grant Ingersoll =0A> Subject: Relevancy Practic= es=0A> To: java-user@lucene.apache.org=0A> Date: Thursday, April 29, 2010, = 10:14 AM=0A> I'm putting on a talk at Lucene=0A> Eurocon (http://lucene-eur= ocon.org/sessions-track1-day2.html#1)=0A> on "Practical Relevance" and I'm = curious as to what people=0A> put in practice for testing and improving rel= evance.=C2=A0 I=0A> have my own inclinations, but I don't want to muddy the= =0A> water just yet.=C2=A0 So, if you have a few moments, I'd=0A> love to h= ear responses to the following questions.=0A> =0A> What worked?=C2=A0 =0A> = What didn't work?=C2=A0 =0A> What didn't you understand about it?=C2=A0 =0A= > What tools did you use?=C2=A0 =0A> What tools did you wish you had either= for debugging=0A> relevance or "fixing" it?=0A> How much time did you spen= d on it?=0A> How did you avoid over/under tuning?=0A> What stage of develop= ment/testing/production did you decide=0A> to do relevance tuning?=C2=A0 Wa= s that timing planned or=0A> not?=0A> =0A> =0A> Thanks,=0A> Grant=0A> =0A= =0A=0A --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org