lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Miller <>
Subject Re: [jira] Commented: (LUCENE-1029) Illegal character replacements in ISOLatin1AccentFilter
Date Fri, 19 Oct 2007 12:24:22 GMT

> If you are to compare with stemmers, consider that these creates unique tokens that does
not interfere with semantic meanings.
Not starting anything here again, but it took me so darn long to find 
something that porter stems and kills the semantic meaning that I had to 
share. That damn algorithm is amazing...I was coming to the conclusion 
that it was absolutely perfect on the English language...until after a 
couple days of searching I found international goes to intern. Eureka! 
Though a hollow victory at best. That algorithm is pretty amazing...

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message