lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Krimerman (JIRA)" <>
Subject [jira] Commented: (SOLR-395) Spell-check should return frequencies of word and suggestions
Date Fri, 02 Nov 2007 18:02:51 GMT


Mike Krimerman commented on SOLR-395:

The new format produces output as (querying for pithon+progremming, extendedResults=true):
    <lst name="responseHeader">
        <int name="status">0</int>
        <int name="QTime">173</int>
    <lst name="result">
        <lst name="pithon">
            <int name="frequency">5</int>
            <lst name="suggestions">
                <lst name="python">
                    <int name="frequency">18785</int>
        <lst name="progremming">
            <int name="frequency">0</int>
            <lst name="suggestions">
                <lst name="programming">
                    <int name="frequency">70997</int>
                <lst name="progressing">
                    <int name="frequency">1930</int>
                <lst name="programing">
                    <int name="frequency">597</int>
                <lst name="progamming">
                    <int name="frequency">113</int>
                <lst name="reprogramming">
                    <int name="frequency">344</int>
In this example the best suggestions are the first ones. Some queries may return a suggestion
which is very close to the query word, but with relatively low frequency (Lucene sorts results
by distance first). In that case suggestions that are somewhat farther but with a much higher
frequency should be chosen.

> Spell-check should return frequencies of word and suggestions
> -------------------------------------------------------------
>                 Key: SOLR-395
>                 URL:
>             Project: Solr
>          Issue Type: Improvement
>          Components: spellchecker
>    Affects Versions: 1.3
>            Reporter: Mike Krimerman
>            Assignee: Mike Klaas
>            Priority: Minor
>             Fix For: 1.3
>         Attachments: extended_results.diff, returnFrequencies.patch
> When issuing a spell-check, the word being searched for might be present in the index
with a very low frequency (i.e. a misspelling that made it's way into the index). It might
therefore be helpful if the client receives the frequency of the word plus the frequencies
of each of the suggestions.
> This feature should be optional (using a URL param).

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message