lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Anupam Bhattacharya <anupam...@gmail.com>
Subject Re: No Suggestions from SpellCheck when _text_ field tokenizer set to solr.NGramTokenizerFactory
Date Sun, 04 Feb 2018 13:23:23 GMT
Pls. help to understand the root cause of this behavior.

Even though text_spellcheck fieldType is using solr.StandardTokenizerFactory
& doesnt have any relation with text_general field which is using
solr.NGramTokenizerFactory
tokenizer why the Solr Spell check services is not working as expected.

Regards,
Anupam

On Fri, Feb 2, 2018 at 1:04 PM, Anupam Bhattacharya <anupamb82@gmail.com>
wrote:

> I have configured Solr Managed-schema as following
>
> Below configuration is for Full Text Search:
>
> <fieldType name="text_general" class="solr.TextField"
> positionIncrementGap="100">
>     <analyzer type="index">
>       <!-- <tokenizer class="solr.StandardTokenizerFactory"/> -->
>       <tokenizer class="solr.NGramTokenizerFactory" minGramSize="2"
> maxGramSize="10"/>
>       <filter class="solr.StopFilterFactory" words="stopwords.txt"
> ignoreCase="true"/>
>       <filter class="solr.LowerCaseFilterFactory"/>
>     </analyzer>
>     <analyzer type="query">
>       <!-- <tokenizer class="solr.StandardTokenizerFactory"/> -->
>       <tokenizer class="solr.NGramTokenizerFactory" minGramSize="2"
> maxGramSize="10"/>
>       <filter class="solr.StopFilterFactory" words="stopwords.txt"
> ignoreCase="true"/>
>       <filter class="solr.SynonymGraphFilterFactory" expand="true"
> ignoreCase="true" synonyms="synonyms.txt"/>
>       <filter class="solr.LowerCaseFilterFactory"/>
>     </analyzer>
>   </fieldType>
>
> Following is the configuration for Spell check type of field.
>
> <fieldType name="text_spellcheck" class="solr.TextField"
> positionIncrementGap="100">
>     <analyzer>
>       <tokenizer class="solr.StandardTokenizerFactory"/>
>       <filter class="solr.StandardFilterFactory"/>
>       <filter class="solr.StopFilterFactory" words="stopwords.txt"
> ignoreCase="true"/>
>       <filter class="solr.LowerCaseFilterFactory"/>
>     </analyzer>
>   </fieldType>
>
> Below is the field on which I do spell check
>
> <field name="title_txt_spellcheck_ja" type="text_spellcheck"
> omitNorms="true" indexed="true" stored="true"/>
>
> Below is the full text field
> <field name="_text_" type="text_general" multiValued="true" indexed="true"
> stored="false"/>
>
> I copy text from another field for suggestions.
>
> <copyField source="title_txt_ja" dest="title_txt_spellcheck_ja"/>
>
> /spell?fl=id,title_txt_spellcheck_ja&wt=json&defType=
> edismax&q=te&spellcheck=on&spellcheck.count=10&spellcheck.collate=true&
> spellcheck.dictionary=title_txt_spellcheck_ja&spellcheck.
> collateExtendedResults=true&spellcheck.maxCollations=3
>
> All the configurations were working fine till i changed
> <tokenizer class="solr.StandardTokenizerFactory"/>  to
> <tokenizer class="solr.NGramTokenizerFactory" minGramSize="2"
> maxGramSize="10"/>
> in text_general field type.
>
> Any clues ?
>
> Regards
> Anupam Bhattacharya
>
>


-- 
Thanks & Regards
Anupam Bhattacharya

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message