lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zheng Lin Edwin Yeo <edwinye...@gmail.com>
Subject Re: Relevancy Score and Proximity Search
Date Fri, 29 May 2015 03:14:39 GMT
I've tried to use the site. I saw that when I search for Matex, it actually
only gives a boost of 0.8 to the word Latex as it is not the main word that
is search, but I still can't understand why the score can be so high?

This is what I get from the output explanation:

{

   -
   - "match": true,
   - "value": 2.3716807,
   - "description": "weight(text:latex^0.8 in 106449) [DefaultSimilarity],
   result of:",
   - "details":
   -
   [
      - -
      {
         -
         - "match": true,
         - "value": 2.3716807,
         - "description": "score(doc=106449,freq=1.0), product of:",
         - "details":
         -
         [
            - -
            {
               -
               - "match": true,
               - "value": 0.95434946,
               - "description": "queryWeight, product of:",
               - "details":
               -
               [
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 0.8,
                     - "description": "boost"
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 13.254017,
                     - "description": "idf(docFreq=1, maxDocs=419645)"
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 0.09000568,
                     - "description": "queryNorm"
                  }
               ]
            },
            - -
            {
               -
               - "match": true,
               - "value": 2.4851282,
               - "description": "fieldWeight in 106449, product of:",
               - "details":
               -
               [
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 1,
                     - "description": "tf(freq=1.0), with freq of:",
                     - "details":
                     -
                     [
                        - -
                        {
                           -
                           - "match": true,
                           - "value": 1,
                           - "description": "termFreq=1.0"
                        }
                     ]
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 13.254017,
                     - "description": "idf(docFreq=1, maxDocs=419645)"
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 0.1875,
                     - "description": "fieldNorm(doc=106449)"
                  }
               ]
            }
         ]
      }
   ]

}


For the record with Matex, here is the output explanation:

{

   -
   - "match": true,
   - "value": 0.18585733,
   - "description": "weight(text:matex in 163) [DefaultSimilarity], result
   of:",
   - "details":
   -
   [
      - -
      {
         -
         - "match": true,
         - "value": 0.18585733,
         - "description": "score(doc=163,freq=1.0), product of:",
         - "details":
         -
         [
            - -
            {
               -
               - "match": true,
               - "value": 0.2986924,
               - "description": "queryWeight, product of:",
               - "details":
               -
               [
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 3.318595,
                     - "description": "idf(docFreq=41297, maxDocs=419645)"
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 0.09000568,
                     - "description": "queryNorm"
                  }
               ]
            },
            - -
            {
               -
               - "match": true,
               - "value": 0.62223655,
               - "description": "fieldWeight in 163, product of:",
               - "details":
               -
               [
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 1,
                     - "description": "tf(freq=1.0), with freq of:",
                     - "details":
                     -
                     [
                        - -
                        {
                           -
                           - "match": true,
                           - "value": 1,
                           - "description": "termFreq=1.0"
                        }
                     ]
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 3.318595,
                     - "description": "idf(docFreq=41297, maxDocs=419645)"
                  },
                  - -
                  {
                     -
                     - "match": true,
                     - "value": 0.1875,
                     - "description": "fieldNorm(doc=163)"
                  }
               ]
            }
         ]
      }
   ]

}


Regards,
Edwin


On 28 May 2015 at 22:48, John Blythe <john@curvolabs.com> wrote:

> this site has been a great help to me in seeing how things shake out as far
> as the scores are concerned: http://splainer.io/
>
> --
> *John Blythe*
> Product Manager & Lead Developer
>
> 251.605.3071 | john@curvolabs.com
> www.curvolabs.com
>
> 58 Adams Ave
> Evansville, IN 47713
>
> On Thu, May 28, 2015 at 10:06 AM, Zheng Lin Edwin Yeo <
> edwinyeozl@gmail.com>
> wrote:
>
> > Hi,
> >
> > Does anyone knows how Solr does its scoring with a query that has
> proximity
> > search enabled.
> >
> > For example, when I issue a query q=Matex~1, the result with the top
> score
> > that came back was actually 'Latex', and with a score of 2.27. This is
> with
> > the fact that there are several documents in my index which contain the
> > exact word 'Matex'. All these results came back below, and only with a
> > score of 0.19.
> >
> > Shouldn't documents with the exact match supposed to have a higher score
> > then those which are found by proximity search?
> >
> > I'm using Solr 5.1 and have not change any settings. All my settings are
> > the default settings.
> >
> >
> > Regards,
> > Edwin
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message