lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shalin Shekhar Mangar (JIRA)" <j...@apache.org>
Subject [jira] Updated: (SOLR-785) Distributed SpellCheckComponent
Date Mon, 07 Dec 2009 19:53:18 GMT

     [ https://issues.apache.org/jira/browse/SOLR-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shalin Shekhar Mangar updated SOLR-785:
---------------------------------------

    Attachment: SOLR-785.patch

Thanks Simon!

# Overrides modifyRequest to enable spellcheck only in GET_TOP_IDS (otherwise we do the spellcheck
for each request)
# Request for at least 5 suggestions from each shard for more accurate merged results
# Re-enables stress testing (requires LUCENE-2108)
# Use LinkedHashMap for "origVsSuggested" map to preserve order of tokens because collation
algorithm only works in-order.

All tests pass.

This can definitely be optimized further. The four maps can be removed if we can collect by
iterating over the tokens in original query (we may need one to cache SpellCheckResponse).
However, I have bigger itches to scratch so I'll leave this here.

I'll commit this after a day or two in case somebody wants to review the patch (or improve
it).

> Distributed SpellCheckComponent
> -------------------------------
>
>                 Key: SOLR-785
>                 URL: https://issues.apache.org/jira/browse/SOLR-785
>             Project: Solr
>          Issue Type: Improvement
>          Components: spellchecker
>            Reporter: Shalin Shekhar Mangar
>            Assignee: Shalin Shekhar Mangar
>             Fix For: 1.5
>
>         Attachments: SOLR-785.patch, SOLR-785.patch, SOLR-785.patch, spelling-shard.patch
>
>
> Enhance the SpellCheckComponent to run in a distributed (sharded) environment.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message