Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 15906 invoked from network); 25 Nov 2010 08:55:39 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 Nov 2010 08:55:39 -0000 Received: (qmail 80989 invoked by uid 500); 25 Nov 2010 08:55:37 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 80742 invoked by uid 500); 25 Nov 2010 08:55:36 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 80734 invoked by uid 99); 25 Nov 2010 08:55:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 25 Nov 2010 08:55:36 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=FREEMAIL_FROM,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_NEUTRAL,UNPARSEABLE_RELAY X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [121.101.151.225] (HELO nm3.bullet.mail.in.yahoo.com) (121.101.151.225) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 25 Nov 2010 08:55:28 +0000 Received: from [121.101.151.236] by nm3.bullet.mail.in.yahoo.com with NNFMP; 25 Nov 2010 08:55:08 -0000 Received: from [121.101.151.235] by tm1.bullet.mail.in.yahoo.com with NNFMP; 25 Nov 2010 08:55:11 -0000 Received: from [127.0.0.1] by omp1004.mail.in.yahoo.com with NNFMP; 25 Nov 2010 08:55:35 -0000 X-Yahoo-Newman-Id: 559083.84094.bm@omp1004.mail.in.yahoo.com Received: (qmail 87797 invoked from network); 25 Nov 2010 08:55:06 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.co.in; h=DKIM-Signature:Received:X-Yahoo-SMTP:X-YMail-OSG:X-Yahoo-Newman-Property:Message-ID:From:To:References:Subject:Date:MIME-Version:Content-Type:Content-Transfer-Encoding:X-Priority:X-MSMail-Priority:X-Mailer:X-MimeOLE; b=qivFEj02MG6CfiGt4t50zQT1BMZUSQaZ4cXOH9XgwGDOvgTTfZY29DrS6Os1cBlTvNjU2jweJPNvgFcEFdrF/Ihv8UtPoXmxxXKF9I+7doKQmGDi0x2hLl4FvXUzqj0rvNAqqKHLqpX/IsfUufqYsAFrRxff07Fx4DudlX6k/aI= ; DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.co.in; s=s1024; t=1290675306; bh=HZDmRU5EEySoJESNahg+6k3gtnYPEFgHv6WydvIGnqk=; h=Received:X-Yahoo-SMTP:X-YMail-OSG:X-Yahoo-Newman-Property:Message-ID:From:To:References:Subject:Date:MIME-Version:Content-Type:Content-Transfer-Encoding:X-Priority:X-MSMail-Priority:X-Mailer:X-MimeOLE; b=Nwwyxobgf4Phn/u9ipM0TxEagmR9PG9DqHXag+bqH1YGmCzUt3Aj6UztyY9/UrfjmEQJUK1Nli6VQ88+cY7iiFbHGCECMWjX//dkxAVnOmyC8nZ2WRfLyjb7YmnEHxOVl5E/Ubg5mi8EJJlZ6lC/yBsjG+7PeXL4mP3OpPZZLko= Received: from GaneshM (emailgane@121.244.159.130 with login) by smtp105.mail.in.yahoo.com with SMTP; 25 Nov 2010 14:25:05 +0530 IST X-Yahoo-SMTP: JObyHkuswBBrNSLZp.Ycd7Boqpr_GQ-- X-YMail-OSG: qQM3gYAVM1mdWTVuWrboTBkIIZ4VpZnkbtBrOBvH_C7q6yv JkxQi9_OHKXGUNiKzZFe_gcC9CcnijblD_vVDZoEm3x9E4xD1uH2XWmvKQOX cjOj4JNM1j.gAaODbmEpuCI5GeVyvgGTaIZQkNS1JRG7.6kmWQIVvFICgcK3 ntD.3Knrlj2EKVnuWnCBqd7E8ftXmy9qrmNoD0..PMS8_OTdoQDAmLpS3G4j _Fk05vdzrl4L_6Kk4heUGUmSkV6ZH4Lr2K8najP8PN1P7NQRUrQ-- X-Yahoo-Newman-Property: ymail-3 Message-ID: <565726F0A1EF4793BDFB7BAC5A44CB81@sv.us.sonicwall.com> From: "Ganesh" To: References: <00e401cb8a60$f390d5c0$dab28140$@thetaphi.de> <100E98B677A547C3BE16871066C3614D@sv.us.sonicwall.com> <001301cb8c76$8af2d230$a0d87690$@thetaphi.de> Subject: Re: best practice: 1.4 billions documents Date: Thu, 25 Nov 2010 14:25:05 +0530 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.5931 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5931 X-Virus-Checked: Checked by ClamAV on apache.org Thanks for the input. My results are sorted by date and i am not much bothered about score. = Will i still be in trouble? Regards Ganesh ----- Original Message -----=20 From: "Robert Muir" To: Sent: Thursday, November 25, 2010 1:45 PM Subject: Re: best practice: 1.4 billions documents On Thu, Nov 25, 2010 at 2:58 AM, Uwe Schindler wrote: > ParallelMultiSearcher as subclass of MultiSearcher has the same = problems. These are not crashes, but more that some queries do not = return correct scored results for some queries. This effects especially = all MultiTermQueries (TermRange, Fuzzy, NumericRange, Wildcard, Prefix) = if they are used in a negative fashion (using MUST_NOT resp. "-" in = QueryParser). For all of those queries except Fuzzy, you are safe if you = use CONSTANT_SCORE_REWRITE_METHOD (using setRewriteMethod). The same = problems apply for span queries. For *all* Fuzzy Queries (negative or = not), the scores are simply wrong and so scoring is broken with = (Parallel)MultiSearcher; wrong results are only returned when negative = clauses! > you can use constant score rewrite method with fuzzy, too. then it will work "correctly" (even negative) with multisearcher too. but it will be slow, with unbounded number of results, and the fuzziness will not affect the scoring. (this is what constant score rewrite implies) the reason i say "correctly" is that for all of these queries, constant score rewrite is just a general workaround, and might still be incorrect. This is because many queries often have special cases where they rewrite to simpler things and in general the MultiSearcher combine() logic is broken here, so there might be more problems. > A new class ParallelIndexSearcher could help with that, when it = parallelizes multiple segments, this is still in planning phase. The = difference to ParallelMultiSearcher would be that it takes a "single" = IndexReader (e.g. a MultiReader in your case) and parallelizes per = segment/segment bunches. > Besides the inherited broken-ness from multisearcher, parallelmultisearcher is broken further because it requires you to organize your index structure in a special way to get concurrency. This is all pretty silly though, since ParallelMultiSearcher on a single machine isn't going to increase QPS, so how useful really is it in general??? we should deprecate both the broken Multi & ParallelMulti Searchers and never look back. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org Send free SMS to your Friends on Mobile from your Yahoo! Messenger. Download Now! http://messenger.yahoo.com/download.php --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org