lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Miller <markrmil...@gmail.com>
Subject Re: [jira] Commented: (LUCENE-1029) Illegal character replacements in ISOLatin1AccentFilter
Date Fri, 19 Oct 2007 12:24:22 GMT

> If you are to compare with stemmers, consider that these creates unique tokens that does
not interfere with semantic meanings.
>   
Not starting anything here again, but it took me so darn long to find 
something that porter stems and kills the semantic meaning that I had to 
share. That damn algorithm is amazing...I was coming to the conclusion 
that it was absolutely perfect on the English language...until after a 
couple days of searching I found international goes to intern. Eureka! 
Though a hollow victory at best. That algorithm is pretty amazing...

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message