Return-Path: Delivered-To: apmail-lucene-solr-user-archive@locus.apache.org Received: (qmail 29869 invoked from network); 9 Jul 2007 10:21:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 9 Jul 2007 10:21:07 -0000 Received: (qmail 87925 invoked by uid 500); 9 Jul 2007 10:21:08 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 87416 invoked by uid 500); 9 Jul 2007 10:21:07 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 87407 invoked by uid 99); 9 Jul 2007 10:21:07 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 09 Jul 2007 03:21:07 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of climbingrose@gmail.com designates 64.233.162.228 as permitted sender) Received: from [64.233.162.228] (HELO nz-out-0506.google.com) (64.233.162.228) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 09 Jul 2007 03:21:03 -0700 Received: by nz-out-0506.google.com with SMTP id i28so546856nzi for ; Mon, 09 Jul 2007 03:20:42 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=QHgNzVSk08f588Xl/p8E9UPgq/8iJtUh57AacEgQAp9C82XH1h6J4sIwJ6YVwwcRFq2RAqerpP2a1OEHQM7WG4CdtkWl3FSe55RMUyD55psIye3TE1CGfOsdKk5OJ4xymTFJpvhHeBLsy8cPFkOz2IvdQHOCcGVGkYiMRBmRpwM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=RfYYWmIv65hRllgo0t5wj3PJ8oyBIGkTgOriM+/ufYROtshDWVsE/CZFo3GjpKbNBB1GbwNTM/ynrRZSfeknQmg6giNcMFJvxV1z56984tPqfMX3+hdUK/Z8eWjnPUoKbYsJXGtkb5RRBMTZi14WXkIy+uottffTlV0aqg0h8gI= Received: by 10.142.101.17 with SMTP id y17mr223005wfb.1183976442047; Mon, 09 Jul 2007 03:20:42 -0700 (PDT) Received: by 10.142.84.1 with HTTP; Mon, 9 Jul 2007 03:20:42 -0700 (PDT) Message-ID: <3e7716cd0707090320j1682884du9a26d7e8de132038@mail.gmail.com> Date: Mon, 9 Jul 2007 20:20:42 +1000 From: climbingrose To: solr-user@lucene.apache.org Subject: Re: Spell Check Handler In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_147881_30441947.1183976442016" References: <423642.58958.qm@web50308.mail.re2.yahoo.com> <3e7716cd0707090032g40a7c532p47f319c9d06ab911@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_147881_30441947.1183976442016 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Thanks for the quick reply. However, I'm still not able to setup spellchecker. Solr does create spell directory under data but doesn't seem to build the spellchecker index. Here are snippets of my schema.xml: 1 0.5 spell title I tried this url: http://localhost:8984/solr/select/?q=Accountent&qt=spellchecker&cmd=rebuildand receive this: 0 2 rebuild On 7/9/07, Tristan Vittorio wrote: > > The spellchecker should be available in 1.2 release, your query is > incorrect, try the following: > > > http://localhost:8984/solr/select/?q=java&qt=spellchecker&termSourceField=title_text&cmd=rebuild > > the 'q' parameter must only contain the word being checked; you must > specify > the field separately. You can set "termSourceField" in your > solrconfig.xmlfile so you do not need to explicitly set it each time > you want to run a > spell check query. Also make sure your field isn't heavily processed (i.e. > with porter stemmer analyzers) otherwise the suggestions will look a bit > weird / mangled. Take a look at the wiki page for more info: > > http://wiki.apache.org/solr/SpellCheckerRequestHandler > > cheers, > Tristan > > > > On 7/9/07, climbingrose wrote: > > > > Hi Tristan, > > > > Is this spellchecker available in 1.2 release or I have to build the > > trunk. > > I tried your instructions but Solr returns nothing: > > > > > > > http://localhost:8984/solr/select/?q=title_text:java&qt=spellchecker&cmd=rebuild > > > > Result: > > > > > > > > 0 > > 3 > > > > rebuild > > > > > > > > Thanks. > > > > > > On 7/8/07, Tristan Vittorio wrote: > > > > > > Hi Otis, > > > > > > I have written a draft wiki entry for the spell checker: > > > http://wiki.apache.org/solr/SpellCheckerRequestHandler > > > > > > I've learned that my initial observation about the suggestion ordering > > was > > > incorrect, it does in fact order the results by popularity (or term > > > frequency) of the word in the termSourceField, the problem I > experienced > > > was > > > caused by setting termSourceField to a field of type "text", which > > heavily > > > stemmed and analyzed the words. I found that using the > > StandardTokenizer > > > and StandardFilter and removing the PorterStemmer and LowerCaseFilter > > from > > > the field schema really improved the spell checker performance. > > > > > > I haven't included this info on the wiki page yet, I'll try to update > it > > > soon when I have a bit more time. > > > > > > cheers, > > > Tristan > > > > > > > > > > > > On 7/8/07, Otis Gospodnetic wrote: > > > > > > > > Tristan - good summary - want to copy that to the Solr Wiki? > > > > > > > > Thanks, > > > > Otis > > > > > > > > . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . > > > > Simpy -- http://www.simpy.com/ - Tag - Search - Share > > > > > > > > ----- Original Message ---- > > > > From: Tristan Vittorio > > > > To: solr-user@lucene.apache.org > > > > Sent: Saturday, July 7, 2007 1:51:15 AM > > > > Subject: Re: Spell Check Handler > > > > > > > > I couldn't find any documention on the spell check handler either > but > > > > found > > > > enough information from the solrconfig.xml file, simply search for > > > > "SpellCheckerRequestHandler" (online version here): > > > > > > > > > > > > > > http://svn.apache.org/repos/asf/lucene/solr/trunk/example/solr/conf/solrconfig.xml > > > > > > > > You can view the original development discussion from JIRA (not sure > > how > > > > helpful that will be for you though): > > > > https://issues.apache.org/jira/browse/SOLR-81 > > > > > > > > In a nutshell, the configuration parameters available are:: > > > > > > > > suggestionCount: determines how many spelling suggestions are > > returned. > > > > accuracy: a float value between 1.0 and 0.0 on how close the > suggested > > > > words > > > > should match the original word being checked. > > > > spellcheckerIndexDir and termSourceField: check solrconfig.xml for > a > > > full > > > > explanation. > > > > > > > > In order to use the spell checking hander for the first time, you > need > > > to > > > > explicitly build the spelling index with a sample query something > like > > > > this: > > > > > > > > > > > > > > http://localhost:8080/solr/select/?q=macrosoft&qt=spellchecker&cmd=rebuild > > > > > > > > Depending on how large you main index is, this rebuild operation > could > > > > take > > > > a while. Subsequent queries can omit '&cmd=rebuild' and will return > > > > results > > > > much faster: > > > > > > > > http://localhost:8080/solr/select/?q=macrosoft&qt=spellchecker > > > > > > > > The order of the suggestions returned seems to be based on the > > accuracy > > > > figure (i.e. how close it matches the original word). it would be > > great > > > to > > > > be able to sort these suggested results based on term frequency / > > > document > > > > frequency of the suggested word in the main index, since the most > > > accurate > > > > suggestion may not always be the most relevant. > > > > > > > > As far as I can tell there is currently no way of doing this using > the > > > > spellchecker handler alone (you could always run seperate standard > > > queries > > > > on each word suggestion and order by numDocs, but that would be very > > > > inefficient), has anybody else tried to achieve this? > > > > > > > > cheers, > > > > Tristan > > > > > > > > > > > > > > > > On 7/7/07, Andrew Nagy wrote: > > > > > > > > > > Hello, is there any documentation on how to use the new spell > check > > > > > module? > > > > > > > > > > Thanks > > > > > Andrew > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > Regards, > > > > Cuong Hoang > > > -- Regards, Cuong Hoang ------=_Part_147881_30441947.1183976442016--