lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: SnowballAnalyzer and StopAnalyzer.ENGLISH_STOP_WORDS_SET ?
Date Mon, 14 Dec 2009 17:34:33 GMT
Hmmm, that does seem a little confusing... Have you tried using
ENGLISH_STOP_WORDS_SET.toArray()?

WARNING: I haven't tried this myself and don't have access to
a test bed....

Erick

On Mon, Dec 14, 2009 at 7:55 AM, Nick Burch <nick@torchbox.com> wrote:

> Hi All
>
> I'm upgrading my code from 2.4 to 2.9, and I've hit an issue with
> deprecations.
>
> My old code was:
>        new SnowballAnalyzer("English", StopAnalyzer.ENGLISH_STOP_WORDS);
>
> Looking at the JavaDocs, I'd expected that the new format would be:
>        new SnowballAnalyzer(Version.LUCENE_CURRENT, "English",
> StopAnalyzer.ENGLISH_STOP_WORDS_SET);
>
> However, while StopAnalyzer has depcrecated ENGLISH_STOP_WORDS in favour of
> ENGLISH_STOP_WORDS_SET, and while SnowballAnalyzer seems to use a Set based
> stopwords list internally, there doesn't seem to be a constructor for
> SnowballAnalyzer that takes a set, only a String[] one
>
> Has anyone hit this before? Am I safe to make a patch to SnowballAnalyzer
> to create a matching constructor taking Set rather than String[] so I can
> use the non deprecated stopwords from StopAnalyzer, or am I barking up the
> wrong tree here?
>
> Thanks
> Nick
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message