lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pete Lewis" <p...@uptima.co.uk>
Subject Re: Using MySpell iso the Snowball Analyzer
Date Thu, 09 Sep 2004 13:07:41 GMT
Hi Aad

Use the stemmed result as what you index, but then also remember to stem the
query terms as well - you need to do the same on the way out as on the way
in.

We don't use MySpell but we do use our own stemmer in this way, as there are
many examples where Snowball falls down like:

caught -> caught instead of catch
buses -> buse instead of bus

and Snowball gets worse for none-English languages like Dutch....

Cheers
Pete

----- Original Message ----- 
From: "Aad Nales" <aad.nales@rotterdam-cs.com>
To: <lucene-user@jakarta.apache.org>
Sent: Thursday, September 09, 2004 8:44 AM
Subject: Using MySpell iso the Snowball Analyzer


> For an eductational customer we have been requested to add spell
> checking to queries that enter lucene. The MySpell classes of
> Pietschmann seem to makes this more than feasible. What i wonder if
> somebody else has done this before? Any tips, questions or remarks?
>
> MySpell is the successor of ISpell and is used as the spellchecker in
> OpenOffice. It excutes a stemming algoritm in combination with a
> dictionary. My second question is if any has extracted the stemming
> result to be used in an index?
>
> Thanks for any or all feedback,
> cheers,
> Aad
>
>
> --
> Aad Nales
> aad.nales@rotterdam-cs.com, +31-(0)6 54 207 340
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message