lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <>
Subject Re: Automatic analyzer resolving based on Locale
Date Tue, 08 May 2007 13:26:57 GMT
There is nothing canned that I know of. I'm also not sure how this
would be used. If you're using a single index, how are you going
to index, then search using these analyzers? Or is there some
other magic going on?

Consider your document with a field "text". If you index into this
field with different analyzers at different times (language based),
how can you trust your results no matter *what* analyzer is used
at search time?

Conversely, if you only use a single analyzer at index time, how can
you trust queries created with different languages against that

Or am I completely off base and you have no intention of
putting more than one language in an index <G>?


On 5/8/07, Geoffrey De Smet <> wrote:
> I have a use case, in which I need to select the Analyzer based on a
> Locale.
> For example:
> "nl" => DutchAnalyzer
> "nl_BE" => DutchAnalyzer
> "fr" => FrenchAnalyzer
> "foobar" => StandardAnalyzer (fallback)
> I was wondering if lucene has any sort of "AutomaticAnalyzerResolver"
> class that could do this for me?
> I 'll write it myself if it doesn't exist, but it might be very handy
> for the community to have this in Lucene's jar, as when new Analyzers
> are added, they could be hooked in automatically.
> --
> With kind regards,
> Geoffrey De Smet
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message