lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches
Date Fri, 09 Mar 2012 14:26:57 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226094#comment-13226094
] 

Robert Muir commented on LUCENE-2557:
-------------------------------------

In the example of the link, I'm still lost as to why its really useful.

If all queries are prefix queries, what is the use case for that? Maybe really what is needed
is 
suggester, stemming, decompounding, or some combination of those?

If its really some unique use case that isnt one of the above, and every query must be a prefix
query,
at least bake it in with edge-ngrams filter at index time for better performance. (and just
use termquery)
                
> FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches
> ------------------------------------------------------------------------------
>
>                 Key: LUCENE-2557
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2557
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: core/query/scoring
>    Affects Versions: 3.0.2
>            Reporter: Jingkei Ly
>         Attachments: LUCENE-2557.patch, idf-scoring-test-case.patch
>
>
> The FuzzyQuery often causes misspellings to be ranked higher than the exact match, which
seems to be an undesirable property generally. 
> For example, in an index of surnames, if I search using a FuzzyQuery for "smith", the
misspellings such as "smiith", or "smiht" would appear near the top of the search results
ahead of documents that match "smith".

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message