Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 570839A37 for ; Fri, 13 Apr 2012 09:05:23 +0000 (UTC) Received: (qmail 23258 invoked by uid 500); 13 Apr 2012 09:05:20 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 23106 invoked by uid 500); 13 Apr 2012 09:05:20 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 23084 invoked by uid 99); 13 Apr 2012 09:05:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Apr 2012 09:05:19 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kissuenow@gmail.com designates 209.85.217.176 as permitted sender) Received: from [209.85.217.176] (HELO mail-lb0-f176.google.com) (209.85.217.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Apr 2012 09:05:14 +0000 Received: by lbbgm13 with SMTP id gm13so2332030lbb.35 for ; Fri, 13 Apr 2012 02:04:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=h2b03pFcjbYMnPPUOB8ac41GJDZu81YW8rTYNaFVXus=; b=w9YL/xtcOHiNWniqbnYEDPQRqOeRWkZ0HvYBLwZnVwAqJhPlW9Qk5AAS2tJywL+Jpa WRhIdRIaujoVXkxSqEPs+aNdd8jziEN3qNdDg6lOBzGZbHu/sxyOs7wew2DOiZBrwU52 Y4sd1c9js2HVpwM1YWW0vZTTG8y0I5SXmj1akHXs22gIVHhb5s5KlRvDuv5VQGzL/MCC 5nGvt30fmbk7sg5nukbrAuFHpM1YANEHFfvut8dRFzkzyKKfH9GlAcOgiv3QFGuARwPf P0HGEj08DBuKjF2gZskZwvuG0U4dfWIZgKylTIwvWetTNGjl1q9BegwCMNUy3C+WJUY4 3tOQ== MIME-Version: 1.0 Received: by 10.152.102.228 with SMTP id fr4mr803756lab.23.1334307892554; Fri, 13 Apr 2012 02:04:52 -0700 (PDT) Received: by 10.152.6.136 with HTTP; Fri, 13 Apr 2012 02:04:52 -0700 (PDT) In-Reply-To: References: <46314A2B-3504-4022-9D07-62A0AC400E55@wunderwood.org> Date: Fri, 13 Apr 2012 10:04:52 +0100 Message-ID: Subject: Re: Solr Scoring From: Kissue Kissue To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=f46d040710abf4ea2504bd8bc3b2 X-Virus-Checked: Checked by ClamAV on apache.org --f46d040710abf4ea2504bd8bc3b2 Content-Type: text/plain; charset=ISO-8859-1 Thanks a lot. I had already implemented Walter's solution and was wondering if this was the right way to deal with it. This has now given me the confidence to go with the solution. Many thanks. On Fri, Apr 13, 2012 at 1:04 AM, Erick Erickson wrote: > GAH! I had my head in "make this happen in one field" when I wrote my > response, without being explicit. Of course Walter's solution is pretty > much the standard way to deal with this. > > Best > Erick > > On Thu, Apr 12, 2012 at 5:38 PM, Walter Underwood > wrote: > > It is easy. Create two fields, text_exact and text_stem. Don't use the > stemmer in the first chain, do use the stemmer in the second. Give the > text_exact a bigger weight than text_stem. > > > > wunder > > > > On Apr 12, 2012, at 4:34 PM, Erick Erickson wrote: > > > >> No, I don't think there's an OOB way to make this happen. It's > >> a recurring theme, "make exact matches score higher than > >> stemmed matches". > >> > >> Best > >> Erick > >> > >> On Thu, Apr 12, 2012 at 5:18 AM, Kissue Kissue > wrote: > >>> Hi, > >>> > >>> I have a field in my index called itemDesc which i am applying > >>> EnglishMinimalStemFilterFactory to. So if i index a value to this field > >>> containing "Edges", the EnglishMinimalStemFilterFactory applies > stemming > >>> and "Edges" becomes "Edge". Now when i search for "Edges", documents > with > >>> "Edge" score better than documents with the actual search word - > "Edges". > >>> Is there a way i can make documents with the actual search word in this > >>> case "Edges" score better than document with "Edge"? > >>> > >>> I am using Solr 3.5. My field definition is shown below: > >>> > >>> positionIncrementGap="100"> > >>> > >>> > >>> >>> synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/> > >>> >>> ignoreCase="true" > >>> words="stopwords_en.txt" > >>> enablePositionIncrements="true" > >>> > >>> > >>> > >>> > >>> > >>> > >>> synonyms="synonyms.txt" > >>> ignoreCase="true" expand="true"/> > >>> >>> ignoreCase="true" > >>> words="stopwords_en.txt" > >>> enablePositionIncrements="true" > >>> /> > >>> > >>> > >>> >>> protected="protwords.txt"/> > >>> > >>> > >>> > >>> > >>> Thanks. > > > > > > > > > > > --f46d040710abf4ea2504bd8bc3b2--