lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: StandardTokenizer and splitting on mixedcase strings
Date Mon, 26 Feb 2018 15:47:36 GMT
Dan:

The admin UI analysis page is invaluable for understanding exactly
what element of your analysis chain does what. So when you restructure
your analysis chain you can use it to see if the input transforms the
way you want it to.

Best,
Erick

On Mon, Feb 26, 2018 at 7:21 AM, Shawn Heisey <apache@elyograg.org> wrote:
> On 2/23/2018 10:55 AM, Rick Leir wrote:
>> Lowercase filter before the tokenizer?
>
> Unless somebody invents a lowercasing CharFilter, which I don't think
> exists currently, that's not possible.
>
> Groups of Solr analysis components always run in the following order:
>
> First CharFilter entries are run.
> Then the Tokenizer is run.
> Then Filter entries are run.
>
> Within each group, individual components run in the order they are
> configured, but the filters will always run after charfilters and the
> tokenizer.
>
> Thanks,
> Shawn
>

Mime
View raw message