lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley" <yo...@apache.org>
Subject Re: dismax catenated token search
Date Thu, 19 Jul 2007 22:14:28 GMT
On 7/19/07, Mike Klaas <mike.klaas@gmail.com> wrote:
>
> On 19-Jul-07, at 2:49 PM, Yonik Seeley wrote:
>
> > Does anyone have a good idea how to go about searching for
> > concatenated tokens?
> >
> > Say that the index has "painkiller" and the user types in
> > "pain killer" (without the quotes).
> >
> > If one were using the standard request handler, the easiest would be
> > to have the client handle it by sending in both variants:
> > pain OR killer OR painkiller
> >  or a variant like
> > "pain killer" OR painkiller
> >
> > But is there any answer when using dismax?
> > Requiring the client to send in pain killer painkiller seems like it
> > may decrease relevance too much if you currently use "pf" (phrase
> > fields) since the phrase "pain killer painkiller" isn't going to match
> > anything.
> >
> > Thoughts?
>
> Yes, pf should be replaced by a word proximity query that doesn't
> require all words to match :)

Some other quick ideas:
1) client issues two separate queries... "pain killer" and
"painkiller" and merges
   results.
2) dismax parameter that throws word catenations into the MaxDisjunction:
   "a b c" would also search for ab and bc.

-Yonik

Mime
View raw message