lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki>
Subject Re: Writing a stemmer
Date Fri, 04 Jun 2004 21:48:36 GMT
Leo Galambos wrote:

> Erik Hatcher <> wrote:
> __________
>>>How proficient must I be in a language for which I wish to write the 
>>I would venture to say you would need to be an expert in a language to 
>>write a decent stemmer.
> I'm sorry for a self-promo ;), but
> the stemmer of egothor project can be
> adapted to any language, and you needn't be
> a language expert. Moreover, the stemmer
> achieves better F-measure than Porter's stemmers.

No reason to be too modest, Leo.. I tested your stemmer on English, 
Swedish and Polish texts (including F-measure vs. training set size 
plots), and it works exceptionally well indeed. Highly recommended!

Best regards,
Andrzej Bialecki

Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
FreeBSD developer (

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message