lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "S.L" <simpleliving...@gmail.com>
Subject Re: DirectSpellChecker not returning expected suggestions.
Date Mon, 02 Jun 2014 18:15:24 GMT
Thanks, you mean "wrangler" , has been stemmed to "wrangle" , if thats the
case then why does it not return any results for "wrangle" ?


On Mon, Jun 2, 2014 at 2:07 PM, david.w.smiley@gmail.com <
david.w.smiley@gmail.com> wrote:

> It appears to be stemmed.
>
> ~ David Smiley
> Freelance Apache Lucene/Solr Search Consultant/Developer
> http://www.linkedin.com/in/davidwsmiley
>
>
> On Mon, Jun 2, 2014 at 2:06 PM, S.L <simpleliving016@gmail.com> wrote:
>
> > OK, I just realized that "wrangle" is a proper english word, probably
> thats
> > why I dont get a suggestion for "wrangler" in this case. How ever in my
> > test index there is no "wrangle" present , so even though this is a
> proper
> > english word , since there is no occurence of it in the index should'nt
> > Solr suggest me "wrangler" ?
> >
> >
> > On Mon, Jun 2, 2014 at 2:00 PM, S.L <simpleliving016@gmail.com> wrote:
> >
> > > I do not get any suggestion (when I search for "wrangle") , however I
> > > correctly get the suggestion wrangler when I search for wranglr , I am
> > > using the Direct and WordBreak spellcheckers in combination, I have not
> > > tried using anything else.
> > >
> > > Is the distance calculation of Solr different than what Levestien
> > distance
> > > calculation ? I have set maxEdits to 1 , assuming that this corresponds
> > to
> > > the maxDistance.
> > >
> > > Thanks for your help!
> > >
> > >
> > > On Mon, Jun 2, 2014 at 1:54 PM, david.w.smiley@gmail.com <
> > > david.w.smiley@gmail.com> wrote:
> > >
> > >> What do you get then?  Suggestions, but not the one you’re looking
> for,
> > or
> > >> is it deemed correctly spelled?
> > >>
> > >> Have you tried another spellChecker impl, for troubleshooting
> purposes?
> > >>
> > >> ~ David Smiley
> > >> Freelance Apache Lucene/Solr Search Consultant/Developer
> > >> http://www.linkedin.com/in/davidwsmiley
> > >>
> > >>
> > >> On Sat, May 31, 2014 at 12:33 AM, S.L <simpleliving016@gmail.com>
> > wrote:
> > >>
> > >> > Hi All,
> > >> >
> > >> > I have a small test index of 400 documents , it happens to have an
> > entry
> > >> > for  "wrangler", When I search for "wranglr", I correctly get the
> > >> collation
> > >> > suggestion as "wrangler", however when I search for "wrangle" , I
do
> > not
> > >> > get a suggestion for "wrangler".
> > >> >
> > >> > The Levenstien distance between wrangle --> wrangler is same as
the
> > >> > Levestien distance between wranglr-->wrangler , I am just wondering
> > why
> > >> I
> > >> > do not get a suggestion for wrangle.
> > >> >
> > >> > Below is my Direct spell checker configuration.
> > >> >
> > >> >     <lst name="spellchecker">
> > >> >       <str name="name">direct</str>
> > >> >       <str name="field">suggestAggregate</str>
> > >> >       <str name="classname">solr.DirectSolrSpellChecker</str>
> > >> >       <!-- the spellcheck distance measure used, the default is
the
> > >> > internal levenshtein -->
> > >> >       <str name="distanceMeasure">internal</str>
> > >> >       <str name="comparatorClass">score</str>
> > >> >
> > >> >       <!-- minimum accuracy needed to be considered a valid
> spellcheck
> > >> > suggestion -->
> > >> >       <float name="accuracy">0.7</float>
> > >> >       <!-- the maximum #edits we consider when enumerating terms:
> can
> > >> be 1
> > >> > or 2 -->
> > >> >       <int name="maxEdits">1</int>
> > >> >       <!-- the minimum shared prefix when enumerating terms -->
> > >> >       <int name="minPrefix">3</int>
> > >> >       <!-- maximum number of inspections per result. -->
> > >> >       <int name="maxInspections">5</int>
> > >> >       <!-- minimum length of a query term to be considered for
> > >> correction
> > >> > -->
> > >> >       <int name="minQueryLength">4</int>
> > >> >       <!-- maximum threshold of documents a query term can appear
to
> > be
> > >> > considered for correction -->
> > >> >       <float name="maxQueryFrequency">0.01</float>
> > >> >       <!-- uncomment this to require suggestions to occur in 1%
of
> the
> > >> > documents -->
> > >> >       <!--
> > >> >       <float name="thresholdTokenFrequency">.01</float>
> > >> >       -->
> > >> >     </lst>
> > >> >
> > >>
> > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message