lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: Spell checking ?'s
Date Sat, 23 Feb 2008 14:00:13 GMT

On Feb 23, 2008, at 3:05 AM, Otis Gospodnetic wrote:

> Aha, good example, Sean.  What's the explanation?  Note that doing:
>    http://www.google.com/search?q=abdur+choudhury
> offers this alternative:
>    http://www.google.com/searchq=abdur+chowdhury
>
> And that the number of hits is approximately the same in both cases  
> and that Google is smart enough to search for and highlight  
> chowdhury even when the search was for choudhury.
>
> Google's spelling corrections/suggestions are driven off of massive  
> query (refinement) logs.  Solr's suggestions are based on the index  
> field content.

They probably are based off both the logs and index, right?  One thing  
that might be of interest is Karl Wettin's adaptive spell checker  
patch in Lucene, which I think attempts to learn as it goes, plus uses  
the index, but I haven't looked at it at a low level.

-Grant


Mime
View raw message