Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 90FE0B9BE for ; Tue, 10 Jan 2012 18:45:03 +0000 (UTC) Received: (qmail 62820 invoked by uid 500); 10 Jan 2012 18:45:02 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 62710 invoked by uid 500); 10 Jan 2012 18:45:01 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 62698 invoked by uid 99); 10 Jan 2012 18:45:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Jan 2012 18:45:01 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Jan 2012 18:45:00 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id BABE714226D for ; Tue, 10 Jan 2012 18:44:39 +0000 (UTC) Date: Tue, 10 Jan 2012 18:44:39 +0000 (UTC) From: "Okke Klein (Commented) (JIRA)" To: dev@lucene.apache.org Message-ID: <966968173.26403.1326221079766.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <181280048.51968.1325180257928.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (SOLR-2993) Integrate WordBreakSpellChecker with Solr MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SOLR-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13183456#comment-13183456 ] Okke Klein commented on SOLR-2993: ---------------------------------- Thanks for the explanation. I experimented with onlyMorePopular and it worked a few times. Unfortunately it also showed unwanted behavior as expected. So https://issues.apache.org/jira/browse/SOLR-2585 would be a next step to see if it provides the behavior I'm looking for. For the English language this feature might not be very important, but for languages like Dutch and German that have a lot of compounded words, a spellchecker that also combines word parts even if one of them has a typo (like Google does) would be extremely useful. Unfortunately I'm not a programmer, but I'll gladly test anything you throw at me :) > Integrate WordBreakSpellChecker with Solr > ----------------------------------------- > > Key: SOLR-2993 > URL: https://issues.apache.org/jira/browse/SOLR-2993 > Project: Solr > Issue Type: Improvement > Components: SolrCloud, spellchecker > Affects Versions: 4.0 > Reporter: James Dyer > Priority: Minor > Fix For: 4.0 > > Attachments: SOLR-2993.patch > > > A SpellCheckComponent enhancement, leveraging the WordBreakSpellChecker from LUCENE-3523: > - Detect spelling errors resulting from misplaced whitespace without the use of shingle-based dictionaries. > - Seamlessly integrate word-break suggestions with single-word spelling corrections from the existing FileBased-, IndexBased- or Direct- spell checkers. > - Provide collation support for word-break errors including cases where the user has a mix of single-word spelling errors and word-break errors in the same query. > - Provide shard support. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org