lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-630) Spellchecker should not be case-sensitive and should be stopwords-aware
Date Wed, 26 Aug 2009 00:55:59 GMT

    [ https://issues.apache.org/jira/browse/SOLR-630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12747724#action_12747724
] 

Hoss Man commented on SOLR-630:
-------------------------------

bq. P.S. I believe that one can avoid case-sensitive issue by configuring properly the analyzers
(e.g. for the spellchecker field).

yeah ... without a concrete example of what kind of config can produce these bugs, my gut
assumption is that with *some* config for spellchecker this problem doesn't exist.

at which point this bug really just becomes an issue if our current example/documentation
isn't advocating the best solution.




> Spellchecker should not be case-sensitive and should be stopwords-aware
> -----------------------------------------------------------------------
>
>                 Key: SOLR-630
>                 URL: https://issues.apache.org/jira/browse/SOLR-630
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>            Reporter: Otis Gospodnetic
>            Priority: Minor
>             Fix For: 1.5
>
>
> Here are 2 more bugs:
> 1)
> Search for:
>   united states of America
> Suggests:
>  united states oft America
> It looks like the SC doesn't check stopwords, and "of" is a stopword.  Thus, it does
not exist in the index,
> but "oft" does, so SC suggests "oft" and thinks "of" is misspelled.  I think the SC component
should check the list of
> stopwords, too, no?
> 2)
> Search for:
>  united states of America
> Suggests:
>  united states oftAmericaa
> The of->oft is described above.  But note how SC suggested America->Americaa, but
it didn't do that for "america".
> This looks like case-sensitivity problem.  Shouldn't the SC be case-insensitive?
> I can't produce a patch now (no src handy), so I'm hoping Grant or somebody else can
do it based on this report.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message