lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: understanding phonetic matching
Date Sun, 08 May 2016 18:43:12 GMT
Jay:

Here's what's currently available:

https://cwiki.apache.org/confluence/display/solr/Phonetic+Matching

Not sure what version of Solr some of them were added in....

Best,
Erick

On Sat, May 7, 2016 at 9:30 PM, Jay Potharaju <jspotharaju@gmail.com> wrote:
> Thanks will check it out.
>
>
> On Sat, May 7, 2016 at 7:05 PM, Susheel Kumar <susheel2777@gmail.com> wrote:
>
>> Jay,
>>
>> There are mainly three phonetics algorithms available in Solr i.e.
>> RefinedSoundex, DoubleMetaphone & BeiderMorse.  We did extensive comparison
>> considering various tests cases and found BeiderMorse to be the best among
>> those for finding sound like matches and it also supports multiple
>> languages.  We also customized Beider Morse extensively for our use case.
>>
>> So please take a closer look at Beider Morse and i am sure it will help you
>> out.
>>
>> Thanks,
>> Susheel
>>
>> On Sat, May 7, 2016 at 2:13 PM, Jay Potharaju <jspotharaju@gmail.com>
>> wrote:
>>
>> > Thanks for the feedback, I was getting correct results when searching for
>> > jon & john. But when I tried other names like 'khloe' it matched on
>> > 'collier' because the phonetic filter generated KL as the token.
>> > Is phonetic filter the best way to find similar sounding names?
>> >
>> >
>> > On Wed, Mar 23, 2016 at 12:01 AM, davidphilip cherian <
>> > davidphilipcherian@gmail.com> wrote:
>> >
>> > > The "phonetic_en" analyzer definition available in solr-schema does
>> > return
>> > > documents having "Jon", "JN", "John" when search term is "John".
>> Checkout
>> > > screen shot here : http://imgur.com/0R6SvX2
>> > >
>> > > This wiki page explains how phonetic matching works :
>> > >
>> > >
>> >
>> https://cwiki.apache.org/confluence/display/solr/Phonetic+Matching#PhoneticMatching-DoubleMetaphone
>> > >
>> > >
>> > > Hope that helps.
>> > >
>> > >
>> > >
>> > > On Wed, Mar 23, 2016 at 11:18 AM, Alexandre Rafalovitch <
>> > > arafalov@gmail.com>
>> > > wrote:
>> > >
>> > > > I'd start by putting LowerCaseFF before the PhoneticFilter.
>> > > >
>> > > > But then, you say you were using Analysis screen and what? Do you
get
>> > > > the matches when you put your sample text and the query text in the
>> > > > two boxes in the UI? I am not sure what "look at my solr data" means
>> > > > in this particular context.
>> > > >
>> > > > Regards,
>> > > >    Alex.
>> > > > ----
>> > > > Newsletter and resources for Solr beginners and intermediates:
>> > > > http://www.solr-start.com/
>> > > >
>> > > >
>> > > > On 23 March 2016 at 16:27, Jay Potharaju <jspotharaju@gmail.com>
>> > wrote:
>> > > > > Hi,
>> > > > > I am trying to do name matching using the phonetic filter factory.
>> As
>> > > > part
>> > > > > of that I was analyzing the data using analysis screen in solr
UI.
>> > If i
>> > > > > search for john, any documents containing john or jon should
be
>> > found.
>> > > > >
>> > > > > Following is my definition of the custom field that I use for
>> > indexing
>> > > > the
>> > > > > data. When I look at my solr data I dont see any similar sounding
>> > names
>> > > > in
>> > > > > my solr data, even though I have set inject="true". Is that not
how
>> > it
>> > > is
>> > > > > supposed to work?
>> > > > > Can someone explain how phonetic matching works?
>> > > > >
>> > > > >  <fieldType name="text_phonetic" class="solr.TextField"
>> > > > positionIncrementGap
>> > > > > ="100">
>> > > > >
>> > > > >      <analyzer>
>> > > > >
>> > > > >         <tokenizer class="solr.StandardTokenizerFactory"/>
>> > > > >
>> > > > >         <filter class="solr.PhoneticFilterFactory"
>> > > > encoder="DoubleMetaphone"
>> > > > > inject="true" maxCodeLength="5"/>
>> > > > >
>> > > > >         <filter class="solr.LowerCaseFilterFactory"/>
>> > > > >
>> > > > >      </analyzer>
>> > > > >
>> > > > >     </fieldType>
>> > > > >
>> > > > > --
>> > > > > Thanks
>> > > > > Jay
>> > > >
>> > >
>> >
>> >
>> >
>> > --
>> > Thanks
>> > Jay Potharaju
>> >
>>
>
>
>
> --
> Thanks
> Jay Potharaju

Mime
View raw message