Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 14941 invoked from network); 17 Sep 2010 20:42:17 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 17 Sep 2010 20:42:17 -0000 Received: (qmail 97674 invoked by uid 500); 17 Sep 2010 20:42:16 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 97544 invoked by uid 500); 17 Sep 2010 20:42:16 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 97537 invoked by uid 99); 17 Sep 2010 20:42:15 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 17 Sep 2010 20:42:15 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 17 Sep 2010 20:42:14 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o8HKfp1U019898 for ; Fri, 17 Sep 2010 20:41:52 GMT Message-ID: <9599478.259861284756111765.JavaMail.jira@thor> Date: Fri, 17 Sep 2010 16:41:51 -0400 (EDT) From: "John Beck (JIRA)" To: dev@lucene.apache.org Subject: [jira] Commented: (SOLR-1316) Create autosuggest component MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12910777#action_12910777 ] John Beck commented on SOLR-1316: --------------------------------- If you wish to use the comment text you have typed (shown below), please copy it now. This text will be lost when you leave this screen. Hey guys, really nice work on this, it is extremely fast! In production right now we're seeing 200-700ms latency, and with this I typically get between 1ms and 10ms. I did find one issue though. I'm going to use this with a dictionary of 150k medical terms, and it's works, except for when my query happens to be a popular starting word. If I use this as a dictionary, {noformat} Hepatitis B Viruses, Duck Hepatitis B e Antigens Hepatitis B virus Hepatitis B, Chronic Hepatitis Be Antigens Hepatitis C Hepatitis C Antibodies Hepatitis C Antigen {noformat} And then search for Hepatitis C, {noformat} curl "http://localhost:8982/solr/suggest/?spellcheck=true&spellcheck.dictionary=suggest&spellcheck.extendedResults=true&spellcheck.count=5&q=Hepatitis%20C" 01build509hepatitis b e antigenshepatitis b virushepatitis b viruses, duckhepatitis b, chronichepatitis be antigens {noformat} You can see it never makes it to Hepatitis C since it's #6 in that dictionary, and I'm limiting the results to 5. When I bump spellcheck.count=6, then I get the very first Hepatitis C result but not the rest. So there are about 2500 terms that start with "Receptor" and I don't want to have to bump it to 3000 results. Is there anything else that can be done? > Create autosuggest component > ---------------------------- > > Key: SOLR-1316 > URL: https://issues.apache.org/jira/browse/SOLR-1316 > Project: Solr > Issue Type: New Feature > Components: search > Affects Versions: 1.4 > Reporter: Jason Rutherglen > Assignee: Andrzej Bialecki > Priority: Minor > Fix For: Next > > Attachments: SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316.patch, SOLR-1316_3x-2.patch, SOLR-1316_3x.patch, suggest.patch, suggest.patch, suggest.patch, TST.zip > > Original Estimate: 96h > Remaining Estimate: 96h > > Autosuggest is a common search function that can be integrated > into Solr as a SearchComponent. Our first implementation will > use the TernaryTree found in Lucene contrib. > * Enable creation of the dictionary from the index or via Solr's > RPC mechanism > * What types of parameters and settings are desirable? > * Hopefully in the future we can include user click through > rates to boost those terms/phrases higher -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org