Return-Path: Delivered-To: apmail-lucene-solr-dev-archive@locus.apache.org Received: (qmail 38317 invoked from network); 3 Jun 2008 19:53:41 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 3 Jun 2008 19:53:41 -0000 Received: (qmail 97997 invoked by uid 500); 3 Jun 2008 19:53:43 -0000 Delivered-To: apmail-lucene-solr-dev-archive@lucene.apache.org Received: (qmail 97962 invoked by uid 500); 3 Jun 2008 19:53:43 -0000 Mailing-List: contact solr-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-dev@lucene.apache.org Delivered-To: mailing list solr-dev@lucene.apache.org Received: (qmail 97951 invoked by uid 99); 3 Jun 2008 19:53:43 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jun 2008 12:53:43 -0700 X-ASF-Spam-Status: No, hits=3.5 required=10.0 tests=HTML_MESSAGE,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of shalinmangar@gmail.com designates 64.233.170.185 as permitted sender) Received: from [64.233.170.185] (HELO rn-out-0910.google.com) (64.233.170.185) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jun 2008 19:52:55 +0000 Received: by rn-out-0910.google.com with SMTP id j71so386782rne.4 for ; Tue, 03 Jun 2008 12:53:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=KepjmodpruKXSfkiZDJu1DpQWIWYvqlRzUCZZsTwOGI=; b=vGY7mxnc62QtNL3rOdPbF/84lGpLR2NABpJnjbhaR/fmBsPRf8+ojzstkLhRq0xFxjg1gRpLoayzq++icbZfzrtl+lLHxyWZh6aTd/0J90accvBs+6/vljYyo2jhhfcn42EaR2fh4bG8TW29/L/02V5KD+G87CB8UICp6wCp1OM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=aoBH0RCGjo165uCd0uGoAB+PRQ/od5am9fEKNEMC1LW5Saj1fzRSgebZQw1qxBUrGto87TP8Wdf6H4bmFwJ2gvBBiFhxhL4p4gwis/Cbo1LE0qq9AnOvP7yyYDavQjPdqonNY5I/OctXMhpjYi6koiWwnvgRD1ScYlBocxuLGRY= Received: by 10.143.2.19 with SMTP id e19mr4295299wfi.90.1212522791041; Tue, 03 Jun 2008 12:53:11 -0700 (PDT) Received: by 10.142.212.5 with HTTP; Tue, 3 Jun 2008 12:53:10 -0700 (PDT) Message-ID: <69de18140806031253v3e934fa8w6eccbe7772d08310@mail.gmail.com> Date: Wed, 4 Jun 2008 01:23:10 +0530 From: "Shalin Shekhar Mangar" To: solr-dev@lucene.apache.org Subject: Re: [jira] Commented: (SOLR-572) Spell Checker as a Search Component In-Reply-To: <449044530.1212182085487.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_4680_29852879.1212522791025" References: <1371618452.1210573795577.JavaMail.jira@brutus> <449044530.1212182085487.JavaMail.jira@brutus> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_4680_29852879.1212522791025 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline The current patch has been broken for some days now and implementing a correct query parsing logic may take time to get right. Let's not aim for everything to get into the 1.3 release. I would like to cut down the scope of this issue to a implementation that indexes files and Lucene indices (both Solr and arbitary) and gives suggestions while using the correct analyzer for multi-word queries. Let's get a spell checker working and commit it. We can deal with more enhancements like abstractions for custom spellcheckers and query parsing etc. in another issue which can be dealt with separately (in 1.3 or after). Thoughts? If there is a general consensus, I can give a new patch which can be good enough to go in. On Sat, May 31, 2008 at 2:44 AM, Oleg Gnatovskiy (JIRA) wrote: > > [ > https://issues.apache.org/jira/browse/SOLR-572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12601256#action_12601256] > > Oleg Gnatovskiy commented on SOLR-572: > -------------------------------------- > > I installed the latest patch. Still getting a NPE. Here is my config: > > class="org.apache.solr.handler.component.SpellCheckComponent"> > > > false > > false > > 1 > > > > name="classname">org.apache.solr.spelling.FileBasedSpellChecker > external > spellings.txt > UTF-8 > text_ws > name="indexDir">/usr/local/apache/lucene/solr2home/solr/data/spellIndex > > > > > Here is the URL I am hitting: > http://localhost:8983/solr/select/?q=pizza&spellcheck=true&spellcheck.dictionary=external&spellcheck.build=true > > Here is the error: > > HTTP Status 500 - null java.lang.NullPointerException at > org.apache.lucene.index.Term.(Term.java:39) at > org.apache.lucene.index.Term.(Term.java:36) at > org.apache.lucene.search.spell.SpellChecker.suggestSimilar(SpellChecker.java:228) > at > org.apache.solr.spelling.AbstractLuceneSpellChecker.getSuggestions(AbstractLuceneSpellChecker.java:71) > at > org.apache.solr.handler.component.SpellCheckComponent.process(SpellCheckComponent.java:177) > at > org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:153) > at > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:125) > at org.apache.solr.core.SolrCore.execute(SolrCore.java:965) at > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:339) > at > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:274) > at > org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235) > at > org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) > at > org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233) > at > org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:175) > at > org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:128) > at > org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102) > at > org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109) > at > org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:286) > at > org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:844) > at > org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583) > at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447) > at java.lang.Thread.run(Thread.java:619) > > spelling.txt is in my solr/home/conf. > > > Spell Checker as a Search Component > > ----------------------------------- > > > > Key: SOLR-572 > > URL: https://issues.apache.org/jira/browse/SOLR-572 > > Project: Solr > > Issue Type: New Feature > > Components: spellchecker > > Affects Versions: 1.3 > > Reporter: Shalin Shekhar Mangar > > Assignee: Grant Ingersoll > > Priority: Minor > > Fix For: 1.3 > > > > Attachments: SOLR-572.patch, SOLR-572.patch, SOLR-572.patch, > SOLR-572.patch, SOLR-572.patch, SOLR-572.patch, SOLR-572.patch, > SOLR-572.patch, SOLR-572.patch, SOLR-572.patch, SOLR-572.patch, > SOLR-572.patch, SOLR-572.patch, SOLR-572.patch, SOLR-572.patch > > > > > > Expose the Lucene contrib SpellChecker as a Search Component. Provide the > following features: > > * Allow creating a spell index on a given field and make it possible to > have multiple spell indices -- one for each field > > * Give suggestions on a per-field basis > > * Given a multi-word query, give only one consistent suggestion > > * Process the query with the same analyzer specified for the source field > and process each token separately > > * Allow the user to specify minimum length for a token (optional) > > Consistency criteria for a multi-word query can consist of the following: > > * Preserve the correct words in the original query as it is > > * Never give duplicate words in a suggestion > > -- > This message is automatically generated by JIRA. > - > You can reply to this email to add a comment to the issue online. > > -- Regards, Shalin Shekhar Mangar. ------=_Part_4680_29852879.1212522791025--