lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From vit <bulgako...@yahoo.com>
Subject matching shingles issue
Date Mon, 24 Nov 2014 18:46:18 GMT
I have Solr 4.2.1
I am using the following analyser:
		<fieldType name="text_shingle" class="solr.TextField"
positionIncrementGap="100">
   			<analyzer type="index">
     			<tokenizer class="solr.WhitespaceTokenizerFactory"/>
     			<filter class="solr.ShingleFilterFactory" minShingleSize="2"
maxShingleSize="5"
             			outputUnigrams="true" outputUnigramsIfNoShingles="false"
tokenSeparator=" "/>
   			</analyzer>
   			<analyzer type="query">
     			<tokenizer class="solr.WhitespaceTokenizerFactory"/>
     			<filter class="solr.ShingleFilterFactory" minShingleSize="2"
maxShingleSize="5"
             			outputUnigrams="false" outputUnigramsIfNoShingles="true"
tokenSeparator=" "/>
   			</analyzer>
 		</fieldType>



for Query: 
description_shingle:Highest quality

I am getting Result:
<arr name="description_shingle">
      <str>Highest standards of quality installations!</str>
</arr>

So the result does not have shingle "Highest quality"
Instead it has 
"Highest standards of quality"

The question is why I am getting this match



--
View this message in context: http://lucene.472066.n3.nabble.com/matching-shingles-issue-tp4170685.html
Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message