lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: Writing a stemmer
Date Fri, 04 Jun 2004 21:48:36 GMT
Leo Galambos wrote:

> Erik Hatcher <erik@ehatchersolutions.com> wrote:
> __________
> 
> 
>>>How proficient must I be in a language for which I wish to write the 
>>>stemmer?
>>
>>I would venture to say you would need to be an expert in a language to 
>>write a decent stemmer.
> 
> 
> I'm sorry for a self-promo ;), but
> the stemmer of egothor project can be
> adapted to any language, and you needn't be
> a language expert. Moreover, the stemmer
> achieves better F-measure than Porter's stemmers.

No reason to be too modest, Leo.. I tested your stemmer on English, 
Swedish and Polish texts (including F-measure vs. training set size 
plots), and it works exceptionally well indeed. Highly recommended!

-- 
Best regards,
Andrzej Bialecki

-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message