lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shawn Heisey <s...@elyograg.org>
Subject Re: Which Tokenizer to use at searching
Date Mon, 10 Mar 2014 12:43:29 GMT
On 3/10/2014 6:20 AM, abhishek jain wrote:
> <tokenizer class="solr.PatternTokenizerFactory" pattern="\s+" />
> <filter class="solr.PatternReplaceFilterFactory" pattern="([^-\w]+)"
> replacement=" punct " replace="all"/>

<snip>

> Is there a way i can tokenize after application of filter, please suggest i
> know i am missing something basic.

Use PatternReplaceCharFilterFactory instead.  CharFilters are performed
before tokenizers, regardless of where they are defined in the analysis
chain.

Thanks,
Shawn


Mime
View raw message