lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Stromnov <strom...@gmail.com>
Subject Problem with Russian stemmer in Solr 1.2
Date Mon, 09 Jul 2007 15:06:12 GMT

Tried on JDK1.6p2 on MS Vista and CentOS.

query analyser config:
...
<filter class="solr.SnowballPorterFilterFactory" language="French" />
<filter class="solr.SnowballPorterFilterFactory" language="Russian" />
<filter class="solr.SnowballPorterFilterFactory" language="Finnish" />
...


Query: списки	arrondissement	turvallisuuden

Analysis.jsp result:
org.apache.solr.analysis.WhitespaceTokenizerFactory {}
term position 	1	2	3
term text 	списки	arrondissement	turvallisuuden

org.apache.solr.analysis.SnowballPorterFilterFactory {language=French}
term position 	1	2	3
term text 	списки	arrond	turvallisuuden

org.apache.solr.analysis.SnowballPorterFilterFactory {language=Russian}
term position 	1	2	3
term text 	списки	arrond	turvallisuuden

org.apache.solr.analysis.SnowballPorterFilterFactory {language=Finnish}
term position 	1	2	3
term text 	списки	arrond	turvallisuud


All stemmers, except Russian, works. Standalone snowball stemmer works
perfect.
Stemmed form of "списки" must be "списк".
-- 
View this message in context: http://www.nabble.com/Problem-with-Russian-stemmer-in-Solr-1.2-tf4049948.html#a11503583
Sent from the Solr - User mailing list archive at Nabble.com.


Mime
View raw message