lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kissue Kissue <kissue...@gmail.com>
Subject Re: Solr Scoring
Date Fri, 13 Apr 2012 09:04:52 GMT
Thanks a lot. I had already implemented Walter's solution and was wondering
if this was the right way to deal with it. This has now given me the
confidence to go with the solution.

Many thanks.

On Fri, Apr 13, 2012 at 1:04 AM, Erick Erickson <erickerickson@gmail.com>wrote:

> GAH! I had my head in "make this happen in one field" when I wrote my
> response, without being explicit. Of course Walter's solution is pretty
> much the standard way to deal with this.
>
> Best
> Erick
>
> On Thu, Apr 12, 2012 at 5:38 PM, Walter Underwood <wunder@wunderwood.org>
> wrote:
> > It is easy. Create two fields, text_exact and text_stem. Don't use the
> stemmer in the first chain, do use the stemmer in the second. Give the
> text_exact a bigger weight than text_stem.
> >
> > wunder
> >
> > On Apr 12, 2012, at 4:34 PM, Erick Erickson wrote:
> >
> >> No, I don't think there's an OOB way to make this happen. It's
> >> a recurring theme, "make exact matches score higher than
> >> stemmed matches".
> >>
> >> Best
> >> Erick
> >>
> >> On Thu, Apr 12, 2012 at 5:18 AM, Kissue Kissue <kissuenow@gmail.com>
> wrote:
> >>> Hi,
> >>>
> >>> I have a field in my index called itemDesc which i am applying
> >>> EnglishMinimalStemFilterFactory to. So if i index a value to this field
> >>> containing "Edges", the EnglishMinimalStemFilterFactory applies
> stemming
> >>> and "Edges" becomes "Edge". Now when i search for "Edges", documents
> with
> >>> "Edge" score better than documents with the actual search word -
> "Edges".
> >>> Is there a way i can make documents with the actual search word in this
> >>> case "Edges" score better than document with "Edge"?
> >>>
> >>> I am using Solr 3.5. My field definition is shown below:
> >>>
> >>> <fieldType name="text_en" class="solr.TextField"
> positionIncrementGap="100">
> >>>      <analyzer type="index">
> >>>        <tokenizer class="solr.StandardTokenizerFactory"/>
> >>>               <filter class="solr.SynonymFilterFactory"
> >>> synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
> >>>             <filter class="solr.StopFilterFactory"
> >>>                ignoreCase="true"
> >>>                words="stopwords_en.txt"
> >>>                enablePositionIncrements="true"
> >>>             <filter class="solr.LowerCaseFilterFactory"/>
> >>>    <filter class="solr.EnglishPossessiveFilterFactory"/>
> >>>        <filter class="solr.EnglishMinimalStemFilterFactory"/>
> >>>      </analyzer>
> >>>      <analyzer type="query">
> >>>        <tokenizer class="solr.StandardTokenizerFactory"/>
> >>>        <filter class="solr.SynonymFilterFactory"
> synonyms="synonyms.txt"
> >>> ignoreCase="true" expand="true"/>
> >>>        <filter class="solr.StopFilterFactory"
> >>>                ignoreCase="true"
> >>>                words="stopwords_en.txt"
> >>>                enablePositionIncrements="true"
> >>>                />
> >>>        <filter class="solr.LowerCaseFilterFactory"/>
> >>>    <filter class="solr.EnglishPossessiveFilterFactory"/>
> >>>        <filter class="solr.KeywordMarkerFilterFactory"
> >>> protected="protwords.txt"/>
> >>>        <filter class="solr.EnglishMinimalStemFilterFactory"/>
> >>>      </analyzer>
> >>>    </fieldType>
> >>>
> >>> Thanks.
> >
> >
> >
> >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message