From solr-user-return-140551-archive-asf-public=cust-asf.ponee.io@lucene.apache.org Fri Apr 13 17:29:44 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id A4353180627 for ; Fri, 13 Apr 2018 17:29:43 +0200 (CEST) Received: (qmail 3920 invoked by uid 500); 13 Apr 2018 15:29:41 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 3908 invoked by uid 99); 13 Apr 2018 15:29:41 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Apr 2018 15:29:41 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 87374C0040 for ; Fri, 13 Apr 2018 15:29:40 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.192 X-Spam-Level: * X-Spam-Status: No, score=1.192 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URI_HEX=1.313] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id AsGv-9QPyhym for ; Fri, 13 Apr 2018 15:29:39 +0000 (UTC) Received: from mail-lf0-f41.google.com (mail-lf0-f41.google.com [209.85.215.41]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 2949E5F3B8 for ; Fri, 13 Apr 2018 15:29:39 +0000 (UTC) Received: by mail-lf0-f41.google.com with SMTP id j68-v6so13149190lfg.13 for ; Fri, 13 Apr 2018 08:29:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=om+TAnDmcsqrhPp3EW8y0BrfBWLanOUjhx5y73KnPwo=; b=Ik1rUfcF2Z7XOIIKxaODMrcJUyfTiRKXnlx9GQ9Vq5ebTXmfNb+Pe9jIpsrrnThWfC X978qq06xq2cRuM9zFKgpmHPQVNAEHKgbfUymp7TgSs2OiI61Ft8VyPF/j4qQb0cWKQc 01u3TRJ4ApO6iylKNP9jOOxiQVw84UkLBvvFKVPFRvleSqhL+zcaL2PkuiKiJibWalJB Ni0F411YL6IYMqoFccG+/RA9hHDfCoYpFqKwxR7t7swi2jv34+G2PEsS6/dD4qQ84tKr XKhcPUHwsKJVpmgZzol7zI55/CH90J8C9DyATBoY9AIFzXyfChdizbTzf/V2UeTWzc0D H4Kg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=om+TAnDmcsqrhPp3EW8y0BrfBWLanOUjhx5y73KnPwo=; b=OZTV7vDZJTbX3e2QQfMGVO4YIXousi0t70VU6Ga+iICz0ywAjtOS50mh0AYEzWY/R7 8QUfVypSZk2HTW0zBPYrsuOoCKsrT5jD+pCeDCw7tsP87mI+XetCQaDPUY8fE+zh3LL2 xDcqeY0f9JS1bVqbsCjHrySlidkG6JQZ+lX4WvN76ugk9EN8d0451nnjVxbGb+kp4VAj iaJnZrKUgSYtHA1IepPJwx1fddmDx6ulxafwUorBC1cYs8oZ1YeCR5rmfOpqJb3NTMO9 C9DuzzxQ2Jf/bRtOPSoyWzHxHRkqGmuJlFMm2yx3Pl7wWfOmvbe/M2ybCPbpusuZOi8g TBsw== X-Gm-Message-State: ALQs6tCJD3eefpUl+3LaHhN+mpCXCC8amcCQvjJQvejIdpNOJI20wYyE LhBFXVdxLMB/Ylienrcpprhz/iHjbZd8G6CtDjqiUtPc X-Google-Smtp-Source: AIpwx48U4MQ1K8ow4HOv9x5x1yvJoP+ePTPIdVL8BHQChLEJ6AIeDxupZtbtpTaapdj0dd61sTlTwTbX+kTzq+Hl4E0= X-Received: by 10.46.157.136 with SMTP id c8mr3840634ljj.85.1523633371504; Fri, 13 Apr 2018 08:29:31 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a19:1369:0:0:0:0:0 with HTTP; Fri, 13 Apr 2018 08:28:50 -0700 (PDT) In-Reply-To: <1523597560003-0.post@n3.nabble.com> References: <1523534018845-0.post@n3.nabble.com> <0d453090-4477-2b5b-b271-78b6774c091c@elyograg.org> <1523597560003-0.post@n3.nabble.com> From: Erick Erickson Date: Fri, 13 Apr 2018 08:28:50 -0700 Message-ID: Subject: Re: Solr still gives old data while faceting from the deleted documents To: solr-user Content-Type: text/plain; charset="UTF-8" expungeDeletes wont' do the trick for you, it purges documents in segments with > 10% deleted docs so you'll still have documents. I'd push back on "the requirement is to show facets with 0 count as disabled." Why? What use-case is satisfied here? Effectively this is saying "For my query, show me possible values that have no hits for that query". Optimize is a very costly operation and to really get this behavior you'll need to run it _every_ time the index changes. You really can't afford to run it for every update, so there'll be a period of time when you will still get these facets. Eventually you won't be displaying zero-count facets anyway, assuming that you have room for, say, only 10 facets and sort by frequency. If your index changes only periodically (say once a day) that may be fine. But more often than that and you won't be able to satisfy the requirement anyway. My point is that requirements like this are often created without understanding the consequences and cause a lot of effort to be expended without a good purpose. See: https://lucidworks.com/2017/10/13/segment-merging-deleted-documents-optimize-may-bad/ Best, Erick On Thu, Apr 12, 2018 at 10:32 PM, girish.vignesh wrote: > mincount will fix this issue for sure. I have tried that but the requirement > is to show facets with 0 count as disabled. > > I think I left with only 2 options. Either go with expungeDelets with update > URL or use optimize in a scheduler. > > Regards, > Vignesh > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html