lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shri Kanishka (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SOLR-3701) Solr Spellcheck for words with apostrophe
Date Thu, 02 Aug 2012 11:11:02 GMT
Shri Kanishka created SOLR-3701:
-----------------------------------

             Summary: Solr Spellcheck for words with apostrophe
                 Key: SOLR-3701
                 URL: https://issues.apache.org/jira/browse/SOLR-3701
             Project: Solr
          Issue Type: Bug
          Components: spellchecker
    Affects Versions: 3.5
         Environment: All
            Reporter: Shri Kanishka


Solr Spellcheck incorrect for words with Apostrophe.

http://10.224.64.10/solr5/select?q=pandora's star &spellcheck=true&spellcheck.collate=true&spellcheck.count=5

The result is

- <lst name="spellcheck">
- <lst name="suggestions">
- <lst name="pandora">
  <int name="numFound">2</int> 
  <int name="startOffset">6</int> 
  <int name="endOffset">13</int> 
- <arr name="suggestion">
  <str>pandora's</str> 
  <str>sandra</str> 
  </arr>
  </lst>
  <str name="collation">spell:pandora's's star</str> 
  </lst>
  </lst>

textSpell configuration in schema is as below

   <fieldType name="textSpell" class="solr.TextField" positionIncrementGap="100" omitNorms="true">
    <analyzer type="index">
	   <tokenizer class="solr.WhitespaceTokenizerFactory"/>

    	<!-- <tokenizer class="solr.StandardTokenizerFactory"/> -->
    	<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_selma.txt"/>
    	<filter class="solr.StandardFilterFactory"/>
    	<filter class="solr.LowerCaseFilterFactory"/> 
    	<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
  	</analyzer>
  <analyzer type="query">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>

    	<!--<tokenizer class="solr.StandardTokenizerFactory"/> -->
    	<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_selma.txt"/>
    	<filter class="solr.StandardFilterFactory"/>
    	<filter class="solr.LowerCaseFilterFactory"/>
    	<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
  	</analyzer>
   </fieldType>

But the same when given in &spellcheck.q paramter , it works,
http://10.224.64.10/solr5/select?q=spell:pandora's star&spellcheck=true&spellcheck.collate=true&spellcheck.q=pandora's
star

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message