Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A342B200BFE for ; Mon, 16 Jan 2017 18:46:33 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id A1E69160B41; Mon, 16 Jan 2017 17:46:33 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id EA9F5160B28 for ; Mon, 16 Jan 2017 18:46:32 +0100 (CET) Received: (qmail 78663 invoked by uid 500); 16 Jan 2017 17:46:31 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 78633 invoked by uid 99); 16 Jan 2017 17:46:31 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Jan 2017 17:46:31 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 0163B180BB7 for ; Mon, 16 Jan 2017 17:46:31 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.321 X-Spam-Level: X-Spam-Status: No, score=-0.321 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 5YUigL3e7DjT for ; Mon, 16 Jan 2017 17:46:30 +0000 (UTC) Received: from mail-it0-f54.google.com (mail-it0-f54.google.com [209.85.214.54]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id B3C185F610 for ; Mon, 16 Jan 2017 17:46:29 +0000 (UTC) Received: by mail-it0-f54.google.com with SMTP id c7so79313993itd.1 for ; Mon, 16 Jan 2017 09:46:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=HbMu+QMNNkk6sWinvtLxc0usr5aSHtVbXoNg0xhAJeg=; b=CEEGKpqzlq4juiHQRM8TZ59Z59Yxq9LqwtM0o/Q88l+u76nHJ6oDUrDBz0s1HhwkGa wOWUs+gG5EnN2nw6ItpziU4OIvD7tQVhOA9e3XiF65Z5s3ynF3a3rg3G12QFYKwTnqyk 80D3UTQWxQKnKdszHPYhbpVeRivGU93SzF8ExzCFA26zE4PiKQKSM4Q53NatK6IHNpfd RBYMSskpL6YzNMoa07lqPWFW13HJA8qf5GI0C+28qitH2VkW6TsU0t1K++B0ke7V3Qve sE+UkFZzmdLTsbUjjO+NZ651+LAJkX+aQUzSiSF9r+gSt2/tnimsb0su2gj8Y5d9RcWr Rq+g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=HbMu+QMNNkk6sWinvtLxc0usr5aSHtVbXoNg0xhAJeg=; b=aTKx+yae01vE8NjthesABH0KcfPkETxxE3tAGrHU+Kdas6eyTub+fX8nOInnzBBt7/ J4IFKOGlulasHKwbSMxizS47b/jOe5uwwM5Hzo9Ty3BC4dzgv3hX2Y0+ODqcl0McPVLF jfIICivyMJTHvOvDLkqGIZjairag9ob53mshLcIxyM58G7DyfoyswAbqGxU7LISWTM4j Nz7o5qzZ9Dz5rrAMfFnE1YaYsaWs1UGO11igzAFuprgeT/viTQgbzhTFiznchpnO6dWM Z1gWvCpaEgUBm6V1hLw1uINHLT+0QhMf4dx5nNLE770Tb1OKwQq+SGcmzeJ2Ph3N3/GO n++w== X-Gm-Message-State: AIkVDXK0TLCIiOtqrpwNtCpGq4+cbQcw2rkc9P5vrWXJa5WWr5F6n3aZdWrMq/LnFF8qO+dtq3wO8gvhKEFGmw== X-Received: by 10.36.58.198 with SMTP id m189mr16465781itm.105.1484588778536; Mon, 16 Jan 2017 09:46:18 -0800 (PST) MIME-Version: 1.0 Received: by 10.107.9.153 with HTTP; Mon, 16 Jan 2017 09:45:38 -0800 (PST) In-Reply-To: References: From: Erick Erickson Date: Mon, 16 Jan 2017 09:45:38 -0800 Message-ID: Subject: Re: question To: java-user Content-Type: text/plain; charset=UTF-8 archived-at: Mon, 16 Jan 2017 17:46:33 -0000 "it depends". I'm assuming that your case 1 is intended to be phrase searches whereas case 2 is just boolean (and specifically AND is the operator). So, within 1 (assuming phrase queries) the results should NOT be the same, that is "sas institute" (as a phrase) should not return the same results as "institute sas", _unless_ a "slop factor" has been specified, which may be internally applied. "slop" (or, under the covers, Span queries) allow out-of-order phrases. I would expect the two queries in 2 to return the same results. whether you should get the same results from 1 as 2 depends on several things: a> whether the default operator is AND b> whether the phrase queries specify a slop c> whether other words are in between, e.g. "institute something something sas" Perhaps the best way to see what's going on would be to turn on highlighting and see if the returned documents make sense. Best, Erick On Mon, Jan 16, 2017 at 7:48 AM, Julius Kravjar wrote: > May I have one question? One company - we used their sw - talked to us that > in Lucene it is normal that the search results for > > 1. > "sas institute" > "institute sas" > are the same. > > 2. > sas institute > institute sas > are the same > > 3. > the number of searches of "sas institute" is smaller then sas institute > (analogically "institute sas" is smaller then institute sas > > > > Should we believe them? Manythanks in advance. > > Best regards > > J. Kravjar --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org