lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Engels" <reng...@ix.netcom.com>
Subject RE: Fuzzy query with Jaro-Winkler distance
Date Tue, 20 Apr 2004 12:50:49 GMT
I think it is as simple as 'modifying' FuzzyQuery.java, check a 'environment
variable', and then either instantiate  a FuzzTermEnum(), or a
SpecialFuzzyTermEnum().

All of the logic of 'FuzzyTerm' is contained in FuzzyTermEnum.java. If it is
nothing more than a different difference calculation, then just check the
environment variable in FuzzTermEnum.java, and call the appropriate distance
calculation routine.

If you create a new 'Query' class, then you have to modify the expression
language to add a new 'term' character, which could get messy eventually
(run out of characters).

The only issue with the current FuzzyTerm implementation is that is requires
a full linear search of all of the terms in index.

Robert

-----Original Message-----
From: Erik Hatcher [mailto:erik@ehatchersolutions.com]
Sent: Tuesday, April 20, 2004 4:44 AM
To: Lucene Developers List
Subject: Re: Fuzzy query with Jaro-Winkler distance


On Apr 20, 2004, at 5:11 AM, eks dev wrote:
> Hi All,
>    I would like to use Fuzzy Query with another
> type(s)  of string distance.

You will have to write your own Query (probably subclass
MultiTermQuery) to do this.  The FuzzyQuery calculations are buried
deep and not customizable (at least not currently).

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message