lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley" <yo...@apache.org>
Subject Re: Phrase query-like query that doesn't requre all the terms?
Date Fri, 14 Nov 2008 17:57:32 GMT
On Fri, Nov 14, 2008 at 12:05 PM, Teruhiko Kurosaka <Kuro@basistech.com> wrote:
> My problem with Phrase Query is that it requires
> existence of all the terms in documents.  I want them more
> permissible.  I want it to match with lower score.
> Does dismax also requires all the terms?

The mandatory part +(DEF GHI) selects documents with either term and
scores higher if both terms are present.  The sloppy phrase query part
"DEF GHI"~10^5 is optional and only contributes to the score when both
terms appear near each other (and scores higher the closer together
they are).

-Yonik


>> Solr's dismax parser can generate queries that do most of
>> this... it's a combination of term queries and sloppy phrase queries.
>>
>> Simplest example:
>> +(DEF GHI) "DEF GHI"~10^5
>>
>> The only thing that it doesn't work for is the terms out of
>> order (they will still be matched).  You could use span
>> queries if you really need that ordering, but sloppy phrase
>> queries already penalize the out-of-order since it's a bigger
>> edit distance (but it won't be a "severe" penalty).
>
> ----
> T. "Kuro" Kurosaka, Basis Technology
> San Francisco, California, U.S.A.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message