lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Randy Puttick (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-524) Current implementation of fuzzy and wildcard queries inappropriately implemented as Boolean query rewrites
Date Sat, 18 Mar 2006 00:47:42 GMT
Current implementation of fuzzy and wildcard queries inappropriately implemented as Boolean
query rewrites
----------------------------------------------------------------------------------------------------------

         Key: LUCENE-524
         URL: http://issues.apache.org/jira/browse/LUCENE-524
     Project: Lucene - Java
        Type: Improvement
  Components: Search  
    Versions: 1.9    
    Reporter: Randy Puttick


The implementation of MultiTermQuery in terms of BooleanQuery introduces several problems:

1) Collisions with maximum clause limit on boolean queries which throws an exception.  This
is most problematic because it is difficult to ascertain in advance how many terms a fuzzy
query or wildcard query might involve.

2) The boolean disjunctive scoring is not appropriate for either fuzzy or wildcard queries.
 In effect the score is divided by the number of terms in the query which has nothing to do
with the relevancy of the results.

3) Performance of disjunctive boolean queries for large term sets is quite sub-optimal

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message