lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: WordDelimiter
Date Fri, 08 Aug 2014 22:53:24 GMT
You haven't really explained what you want to _do_. If you don't
want to split words up, just take WordDelimiterFilterFactory out.

Or do you want to split sometimes but not others?

Best,
Erick


On Fri, Aug 8, 2014 at 12:27 PM, EXTERNAL Taminidi Ravi (ETI,
Automotive-Service-Solutions) <external.Ravi.Taminidi@us.bosch.com> wrote:

> HI, I have a situation where I don't want to split the words, I am using
> the workdelimterfilter where it works good.
>
> For eg. If I send to analyszer for 100-001 , it is not splitting the
> keyword, but if I send 100-R8989 then the worddelimiter filter to 100 |
> R9889, below is the filed analyzer and filter. Same thing using for Query
> time.
>
> Let me know if I am missing something here.
>
> <analyzer type="index">
>
>                                   <charFilter
> class="solr.HTMLStripCharFilterFactory" />
>                                   <tokenizer
> class="solr.WhitespaceTokenizerFactory"/>
>
>                                   <filter class="solr.StopFilterFactory"
> ignoreCase="true" words="stopwords.txt" />
>
>                                   <filter
> class="solr.LowerCaseFilterFactory"/>
>                                   <filter class="solr.KStemFilterFactory"/>
>
>                                 <filter
> class="solr.WordDelimiterFilterFactory" generateWordParts="0"
> generateNumberParts="0" splitOnCaseChange="0"
>
> splitOnNumerics="0" stemEnglishPossessive="0" catenateWords="1"
> catenateNumbers="1"
>
> catenateAll="1" preserveOriginal="0"/>
>
>                                   <filter
> class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true"
> expand="true"/>
>
>                   </analyzer>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message