lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Krimerman (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-395) Spell-check should return frequencies of word and suggestions
Date Fri, 02 Nov 2007 18:02:51 GMT

    [ https://issues.apache.org/jira/browse/SOLR-395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12539653
] 

Mike Krimerman commented on SOLR-395:
-------------------------------------

The new format produces output as (querying for pithon+progremming, extendedResults=true):
{code:xml} 
<response>
    <lst name="responseHeader">
        <int name="status">0</int>
        <int name="QTime">173</int>
    </lst>
    <lst name="result">
        <lst name="pithon">
            <int name="frequency">5</int>
            <lst name="suggestions">
                <lst name="python">
                    <int name="frequency">18785</int>
                </lst>
            </lst>
        </lst>
        <lst name="progremming">
            <int name="frequency">0</int>
            <lst name="suggestions">
                <lst name="programming">
                    <int name="frequency">70997</int>
                </lst>
                <lst name="progressing">
                    <int name="frequency">1930</int>
                </lst>
                <lst name="programing">
                    <int name="frequency">597</int>
                </lst>
                <lst name="progamming">
                    <int name="frequency">113</int>
                </lst>
                <lst name="reprogramming">
                    <int name="frequency">344</int>
                </lst>
            </lst>
        </lst>
    </lst>
</response>
{code}
In this example the best suggestions are the first ones. Some queries may return a suggestion
which is very close to the query word, but with relatively low frequency (Lucene sorts results
by distance first). In that case suggestions that are somewhat farther but with a much higher
frequency should be chosen.


> Spell-check should return frequencies of word and suggestions
> -------------------------------------------------------------
>
>                 Key: SOLR-395
>                 URL: https://issues.apache.org/jira/browse/SOLR-395
>             Project: Solr
>          Issue Type: Improvement
>          Components: spellchecker
>    Affects Versions: 1.3
>            Reporter: Mike Krimerman
>            Assignee: Mike Klaas
>            Priority: Minor
>             Fix For: 1.3
>
>         Attachments: extended_results.diff, returnFrequencies.patch
>
>
> When issuing a spell-check, the word being searched for might be present in the index
with a very low frequency (i.e. a misspelling that made it's way into the index). It might
therefore be helpful if the client receives the frequency of the word plus the frequencies
of each of the suggestions.
> This feature should be optional (using a URL param).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message