Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D504D9305 for ; Wed, 8 Feb 2012 11:25:22 +0000 (UTC) Received: (qmail 24299 invoked by uid 500); 8 Feb 2012 11:25:21 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 24253 invoked by uid 500); 8 Feb 2012 11:25:20 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 24245 invoked by uid 99); 8 Feb 2012 11:25:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2012 11:25:20 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2012 11:25:18 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 2D1CC1AAED4 for ; Wed, 8 Feb 2012 11:24:57 +0000 (UTC) Date: Wed, 8 Feb 2012 11:24:57 +0000 (UTC) From: "Michael McCandless (Commented) (JIRA)" To: dev@lucene.apache.org Message-ID: <675400261.14274.1328700297186.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <284712554.5923.1328578859871.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (SOLR-3104) Bad performance with distributed search when sort contains relevancy queries MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13203487#comment-13203487 ] Michael McCandless commented on SOLR-3104: ------------------------------------------ Shouldn't we commit this fix on 3.x as well...? Is there any reason not to? Sounds like a nasty bug, and it sounds like XJ has applied the patch on 3.x successfully. > Bad performance with distributed search when sort contains relevancy queries > ---------------------------------------------------------------------------- > > Key: SOLR-3104 > URL: https://issues.apache.org/jira/browse/SOLR-3104 > Project: Solr > Issue Type: Improvement > Components: search > Affects Versions: 3.6 > Reporter: XJ Wang > Priority: Critical > Fix For: 4.0 > > Attachments: SOLR-3104.patch > > > So I found this issue when trying out distributed search with solr 3.5 and noticed big performance degradation for some queries comparing to the single box search. > After some query analysis and comparison, it turns out that shard queries with "fsv=true" are much slower than the same queries w/o "fsv=true". Some examples are like 1200ms vs 200ms (start=0, rows=30, hits<100). > From the discussions with Yonik Seeley on solr mailing list, it may due to fact that I'm using lot of relevancy queries in sorting. But Solr is not retrieving those sort values efficiently . > This is critical for us and prevents us from moving to distributed search. I believe users like our scenarios will also suffer from this issue. Any patch/idea is welcomed. > Quote from Yonik Seeley on solr-user mailing list: > "OK, so basically it's slow because functions with embedded relevancy > queries are "forward only" - if you request the value for a docid > previous to the last, we need to reboot the query (re-weight, ask for > the scorer, etc). This means that for your 30 documents, that will > require rebooting the query about 15 times (assuming that roughly half > of the time the next docid will be less than the previous one). > Unfortunately there's not much you can do externally... we need to > implement optimizations at the Solr level for this." -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org