lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeroen Steggink (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-6493) MoreLikeThis minimum should match query
Date Wed, 20 May 2015 20:14:00 GMT
Jeroen Steggink created LUCENE-6493:
---------------------------------------

             Summary: MoreLikeThis minimum should match query
                 Key: LUCENE-6493
                 URL: https://issues.apache.org/jira/browse/LUCENE-6493
             Project: Lucene - Core
          Issue Type: Improvement
          Components: core/search
    Affects Versions: 5.x
            Reporter: Jeroen Steggink
            Priority: Minor


When performing MoreLikeThis queries we have a lot of parameters at hand in order to finetune
the results. However, one thing I personally missed, being able to set a minimum of matching
terms.

The advantage of this parameter is that Lucene will retrieve only the documents which have
at least the given number of matching terms and you won't get any results of just one or two
keyword matches when you are looking for documents that have more than a few matching interesting
keywords.

An example:

doc1: lucene
doc2: lucene release
doc3: apache
doc4: apache lucene
doc5: apache lucene release

Let's say the query is "apache lucene release" and you set the minimum number of matching
interesting terms to 2.

MoreLikeThis will return:
doc2, doc4 and doc5

Would you set it to 3, it would return just doc5.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message