lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aloke Ghoshal <alghos...@gmail.com>
Subject Re: struggling with solr.WordDelimiterFilterFactory
Date Fri, 16 Aug 2013 11:30:18 GMT
Hi,

Based on your WhitespaceTokenizerFactory & due to the
LowerCaseFilterFactory the words actually indexed are:
speed, post, speedpost

You should get results for: q:Content:speedpost

So either remove the LowerCaseFilterFactory or add the
LowerCaseFilterFactory to as a query time Analyzer as well.

Regards,
Aloke




On Fri, Aug 16, 2013 at 4:53 PM, vicky desai <vicky.desai@germinait.com>wrote:

> Hi All,
>
> I have a query regarding the use of wordDelimiterFilterFactory.  My schema
> definition for the text field is as follows
>
>         <fieldType name="text" class="solr.TextField"
>                         positionIncrementGap="100">
>                         <analyzer>
>                                 <tokenizer
> class="solr.WhitespaceTokenizerFactory" />
>                                 <filter
> class="solr.WordDelimiterFilterFactory"
>                                         splitOnCaseChange="1"
> generateWordParts="1" generateNumberParts="1"
> catenateWords="1"
>                                         catenateNumbers="1"
> catenateAll="1"  preserveOriginal="1"/>
>                                 <filter
> class="solr.LowerCaseFilterFactory" />
>                         </analyzer>
>                 </fieldType>
>
> <field name="Content" type="text" indexed="true" stored="true"
> multiValued="false"/>
>
> If I make the following query q=Content:speedPost
>
> then docs having Content *speed post *are matched which is as expected but
> docs having Content *speedpost* do not match.
>
> Can anybody please highlight if I am going incorrect somewhere
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/struggling-with-solr-WordDelimiterFilterFactory-tp4085021.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message