lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From davidphilip cherian <davidphilipcher...@gmail.com>
Subject Re: understanding phonetic matching
Date Wed, 23 Mar 2016 07:01:56 GMT
The "phonetic_en" analyzer definition available in solr-schema does return
documents having "Jon", "JN", "John" when search term is "John". Checkout
screen shot here : http://imgur.com/0R6SvX2

This wiki page explains how phonetic matching works :
https://cwiki.apache.org/confluence/display/solr/Phonetic+Matching#PhoneticMatching-DoubleMetaphone


Hope that helps.



On Wed, Mar 23, 2016 at 11:18 AM, Alexandre Rafalovitch <arafalov@gmail.com>
wrote:

> I'd start by putting LowerCaseFF before the PhoneticFilter.
>
> But then, you say you were using Analysis screen and what? Do you get
> the matches when you put your sample text and the query text in the
> two boxes in the UI? I am not sure what "look at my solr data" means
> in this particular context.
>
> Regards,
>    Alex.
> ----
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
>
>
> On 23 March 2016 at 16:27, Jay Potharaju <jspotharaju@gmail.com> wrote:
> > Hi,
> > I am trying to do name matching using the phonetic filter factory. As
> part
> > of that I was analyzing the data using analysis screen in solr UI. If i
> > search for john, any documents containing john or jon should be found.
> >
> > Following is my definition of the custom field that I use for indexing
> the
> > data. When I look at my solr data I dont see any similar sounding names
> in
> > my solr data, even though I have set inject="true". Is that not how it is
> > supposed to work?
> > Can someone explain how phonetic matching works?
> >
> >  <fieldType name="text_phonetic" class="solr.TextField"
> positionIncrementGap
> > ="100">
> >
> >      <analyzer>
> >
> >         <tokenizer class="solr.StandardTokenizerFactory"/>
> >
> >         <filter class="solr.PhoneticFilterFactory"
> encoder="DoubleMetaphone"
> > inject="true" maxCodeLength="5"/>
> >
> >         <filter class="solr.LowerCaseFilterFactory"/>
> >
> >      </analyzer>
> >
> >     </fieldType>
> >
> > --
> > Thanks
> > Jay
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message