Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C654B200CF0 for ; Thu, 7 Sep 2017 10:47:57 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C4DD71609DB; Thu, 7 Sep 2017 08:47:57 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 18EBD1609BF for ; Thu, 7 Sep 2017 10:47:56 +0200 (CEST) Received: (qmail 35959 invoked by uid 500); 7 Sep 2017 08:47:55 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 35947 invoked by uid 99); 7 Sep 2017 08:47:55 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Sep 2017 08:47:55 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id BB28BD1C46 for ; Thu, 7 Sep 2017 08:47:54 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.38 X-Spam-Level: X-Spam-Status: No, score=0.38 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id jeuv9XHiTe2S for ; Thu, 7 Sep 2017 08:47:50 +0000 (UTC) Received: from mail-wm0-f65.google.com (mail-wm0-f65.google.com [74.125.82.65]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id D00335FB9F for ; Thu, 7 Sep 2017 08:47:49 +0000 (UTC) Received: by mail-wm0-f65.google.com with SMTP id 187so906915wmn.0 for ; Thu, 07 Sep 2017 01:47:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:content-transfer-encoding:mime-version:subject:date:references :to:in-reply-to:message-id; bh=Zv9by67FF+P8oDNvf3tXH/FE2ZRk/chxNFXUF6KJH7c=; b=eSO/IF1QOENm66lB6tqf2l/tHDqI/GkIQFkLYd5g267UAJ7DT/bLij53mvXzRoXTIx +IM79Oxp4ETTZ3WwsY0U3V75eMjb2w4WVV1MQ+lsJdV3J9a5Oie6e2n7mgvY/nmR5s9O sHDSauArEsxzNdxkl2vWVB6xSng5cDPYtbQH14DlFcgrOJEkr9EXXfN483WV0nXMqVvR qT6hj/bHUzw0o3DDNfZ+H/Z4O7Ib6eX9Mcnb3aDtGGoCyM56NyOMAUxppW7WiDL2MmIz vhX0P8oC9SlDgzRm04Pwig6Q869NfxdPNVif/hih4RZUtZvzon1/fZi6LVhHxguFj5tB q03Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:content-transfer-encoding:mime-version :subject:date:references:to:in-reply-to:message-id; bh=Zv9by67FF+P8oDNvf3tXH/FE2ZRk/chxNFXUF6KJH7c=; b=bCiiAjr7IImba+ba1KwjR+37rzOsfFDypjruJPkitOsY2iueWjPJM3+pSWY8lMeUxJ B7SPtKY8mOJ6vtvytdYNxQV3wPmU2E0bad3gkp6duiwdfErB+6FBGb/jgdNedHPf6VMa tH7UvcvlTjc3zK1bIhhOA07x/ILi6fiJH3ZHwcROnw3+vcZa6uAnhTuLzebWT4wyIvxG tlOQDSSXo+FXXOIvTUjEAeHngFqrdUVMYk4VqvWAQRuJg6bM6v8EeMVwiPHUxotKSA7D QN4qaDyM0HOkwsng0uM510jp1n/9Zius5mBrOg4lcslR6FqofuNLx2hZYXstpMOKHnaV gQ4Q== X-Gm-Message-State: AHPjjUg2225vjryGwTGc6jn/B4YXph/BEphRknqwN2WT/eg9mgg6HsoD m2PqI341btmJ9KAYTFk= X-Google-Smtp-Source: ADKCNb50fP6Ek5JVw0Imd24PHCqVjg3mRjUL/8fy8Y8GY4p1zH45h0OpTIIngpjOki5TJO5MKd0lDQ== X-Received: by 10.28.6.149 with SMTP id 143mr1737256wmg.114.1504774062462; Thu, 07 Sep 2017 01:47:42 -0700 (PDT) Received: from [192.168.0.101] (cpc2-popl3-2-0-cust186.13-2.cable.virginm.net. [86.21.240.187]) by smtp.gmail.com with ESMTPSA id m86sm53124wmi.27.2017.09.07.01.47.41 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 07 Sep 2017 01:47:41 -0700 (PDT) From: Jean Claude van Johnson Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Re: What is the fastest way to loop over all documents in an index? Date: Thu, 7 Sep 2017 09:47:40 +0100 References: To: java-user@lucene.apache.org In-Reply-To: Message-Id: X-Mailer: Apple Mail (2.3273) archived-at: Thu, 07 Sep 2017 08:47:58 -0000 Many thanks for your answers! Cheers Claude > On 5 Sep 2017, at 21:54, Michael McCandless = wrote: >=20 > You can call MultiFields.getLiveDocs(IndexReader) to get the bitset > identifying which documents are not deleted. >=20 > Mike McCandless >=20 > http://blog.mikemccandless.com >=20 > On Tue, Sep 5, 2017 at 2:54 PM, Mikhail Khludnev = wrote: >=20 >> You can call searcher.search() with MatchAlldocsQuery passing own = collector >> impl which will be notified about every non-deleted doc via = collect(docId). >>=20 >> On Tue, Sep 5, 2017 at 3:09 AM, Jean Claude van Johnson < >> vanjohnsonjeanclaude@gmail.com> wrote: >>=20 >>> Hi there, >>>=20 >>> I have an use case, were I need to iterate over all documents in an = index >>> from time to time. >>> It seems that the MatchAllDocsQuery is what I should use for this, >> however >>> it creates a bunch of Objects (Score etc) that I don=E2=80=99t = really need. >>>=20 >>> My question to you is: >>>=20 >>> What is the fastest way to loop over all documents in an index? >>> Is it looping over all possible doc id=E2=80=99s (+filtering out = deleted >>> documents)? >>>=20 >>> Thank you very much. >>>=20 >>> Best regards >>> Claude >>>=20 >>>=20 >>=20 >>=20 >> -- >> Sincerely yours >> Mikhail Khludnev >>=20 --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org