lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] [Updated] (LUCENE-4279) Regenerate Snowball code so its not so heavy
Date Wed, 01 Aug 2012 16:30:03 GMT


Robert Muir updated LUCENE-4279:

    Attachment: LUCENE-4279.patch

patch: no need to regenerate the ones from the website that arent in the package as they already
work this way (Irish/Basque/Catalan/Armenian)

I also added a thread safety test (just checkRandomData against all the languages).
> Regenerate Snowball code so its not so heavy
> --------------------------------------------
>                 Key: LUCENE-4279
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Robert Muir
>             Fix For: 4.0, 5.0
>         Attachments: LUCENE-4279.patch
> Spinoff from LUCENE-3841 (and several threads on the list)
> Currently each SnowballStemmer is pretty heavy since each instance also contains a bunch
of Among objects (part of the stemming rules).
> This normally shouldnt be a problem, except it seems challenging
> for tomcat users to tune their threadpools (basically they are creating
> lots of tokenstreams, so lots of SnowballStemmers)
> Newer snowball just makes these static, and its easy enough to just
> regenerate so these aren't so heavy, it doesnt fix the real problem but it also doesn't

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message