Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 83791200BD4 for ; Thu, 1 Dec 2016 23:08:12 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 821F6160B0B; Thu, 1 Dec 2016 22:08:12 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C7F48160B05 for ; Thu, 1 Dec 2016 23:08:11 +0100 (CET) Received: (qmail 75675 invoked by uid 500); 1 Dec 2016 22:08:10 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 75663 invoked by uid 99); 1 Dec 2016 22:08:09 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Dec 2016 22:08:09 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 3084C180335 for ; Thu, 1 Dec 2016 22:08:09 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.2 X-Spam-Level: X-Spam-Status: No, score=-0.2 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=mikemccandless-com.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id sVrPmQspb3Sv for ; Thu, 1 Dec 2016 22:08:05 +0000 (UTC) Received: from mail-io0-f174.google.com (mail-io0-f174.google.com [209.85.223.174]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id B9E635F47D for ; Thu, 1 Dec 2016 22:08:04 +0000 (UTC) Received: by mail-io0-f174.google.com with SMTP id a124so449914701ioe.2 for ; Thu, 01 Dec 2016 14:08:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mikemccandless-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=SzJ4id8AEmMoQJIIV1Mg1q0Ly3ibtC3UCAE/yUkOmBU=; b=TsB5R0oBRqb2HaIMQdsXX5cziUSqdF6SaSQ1HlVVwlkjl/JzcUZM8M7Dz+jDBpGQSu bkVh/9nRgjYzHXYVvQmlPaEJGXC3DlnP8dklfWpSpAAfwM/1WWDLLjAtvhTLJEb/Fgxl QzEDfMZtoiEzple2At2eF5Al3Rrq8tN8BQCmEq6vH7EDlDQX5xwEvYCtUm+UgyCfh7z/ LfIeOM/mxHmkXJrahN9H8P/+gEUX3Iu1fXlT6niV4g51ErpwP2l/0eXy8/s42AkkGwrs HYOAoONB/a9jAvoev1YUoYOThN0rW2reNiba34v34h4wCBEOJc9w5w4mbTvWGcLZoqEa xijA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=SzJ4id8AEmMoQJIIV1Mg1q0Ly3ibtC3UCAE/yUkOmBU=; b=I2zzk+w1Od4YbXpdyycts5GfXGQdNFbIS74ayw2DZwzEvqBL76uaRtdue5n3YhKMqE VYeHchvVnqZKgfUGdLy1kri6JEi8MmP3nU9mV5eqzQVPu3mj2c4XBJLcEPQT41cjHinV 6UMNoE9CyhG9MHLOPcmfU2SC+TnXJAyM/cG4Zsgkzkj/xeq8LjOEO1toJuf80Hm3EhJu 2zEW5Ve8iTlgSIvSIy+6sq/PJfZfmoeV5ip6Mjrg15PEdZZWM41tJ77HlonTbp6VXPXV NGQAu4Qin51peDncwVdWcvZ40wTxhPDhvVOh8sh8r5Af1u0GREoCToZXmOnVw5iLsuOm LGTw== X-Gm-Message-State: AKaTC01lMGawc2LLp1KBhajfjA+tvDtD1rA6wvmp8CzcbEE2J+3dv88Cc3w9lOgaicljCgtwVIn7t3Oamf4tdg== X-Received: by 10.107.149.13 with SMTP id x13mr36311044iod.57.1480630083386; Thu, 01 Dec 2016 14:08:03 -0800 (PST) MIME-Version: 1.0 Received: by 10.107.162.136 with HTTP; Thu, 1 Dec 2016 14:07:42 -0800 (PST) In-Reply-To: References: From: Michael McCandless Date: Thu, 1 Dec 2016 17:07:42 -0500 Message-ID: Subject: Re: LeafCollector To: Lucene Users , Matt Hicks Content-Type: text/plain; charset=UTF-8 archived-at: Thu, 01 Dec 2016 22:08:12 -0000 Lucene used to have a DuplicateFilter to do this, but we removed it recently ... see https://issues.apache.org/jira/browse/LUCENE-6633 for some discussion as to why. Mike McCandless http://blog.mikemccandless.com On Thu, Dec 1, 2016 at 2:39 PM, Matt Hicks wrote: > I'm trying to write a LeafCollector that filters out duplicates for a > specific field. However, looking at the JavaDoc for `collect` it says not > to call `IndexSearch.doc` or `IndexReader.document`. How am I supposed to > determine the value of a field and then exclude it? --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org