Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A0734C5FC for ; Wed, 12 Mar 2014 17:41:47 +0000 (UTC) Received: (qmail 57173 invoked by uid 500); 12 Mar 2014 17:41:43 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 56642 invoked by uid 500); 12 Mar 2014 17:41:42 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 55797 invoked by uid 99); 12 Mar 2014 17:41:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Mar 2014 17:41:41 +0000 X-ASF-Spam-Status: No, hits=-1.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (athena.apache.org: transitioning domain of christian.reuschling@gmail.com does not designate 131.246.120.220 as permitted sender) Received: from [131.246.120.220] (HELO mailgw1.uni-kl.de) (131.246.120.220) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Mar 2014 17:41:36 +0000 Received: from dfki.uni-kl.de (dfki-1002.dfki.uni-kl.de [131.246.195.2]) by mailgw1.uni-kl.de (8.14.3/8.14.3/Debian-9.4) with ESMTP id s2CHfE0b030381 for ; Wed, 12 Mar 2014 18:41:14 +0100 Received: from serv-4100.kl.dfki.de (serv-4100.kl.dfki.de [192.168.41.180]) by dfki.uni-kl.de (8.13.8+Sun/8.11.4) with ESMTP id s2CHfEUU013745 for ; Wed, 12 Mar 2014 18:41:14 +0100 (CET) Received: from pc-4176.kl.dfki.de (pc-4176.kl.dfki.de [192.168.41.166]) by serv-4100.kl.dfki.de (8.14.4+Sun/8.14.4) with ESMTP id s2CHfDw7001021 for ; Wed, 12 Mar 2014 18:41:13 +0100 (CET) Message-ID: <53209C39.4000909@gmail.com> Date: Wed, 12 Mar 2014 18:41:13 +0100 From: Christian Reuschling User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.1.0 MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: create a Filter/DocIdSet from a number of documents X-Enigmail-Version: 1.6 Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 I have a small set of document numbers as a query result collected with some non-scoring collector. Now, I want to send high-performant successive queries only in this document number scope, as part of a customized Similarity implementation (modified, dynamic idf on result query scope, known as TF-ICF) For this, after reading the user list and further looking around, I decided to try the approach with a filter, i.e. IndexSearcher.search(Query, Filter, HitCountCollector), whereby the filter should limit the scope to the initially determined document number list. I have the hope that this will perform very well, because there is no need for reading information that doesn't last in memory, as e.g. with a TermFrequenciesVector approach. So far so good - but I can't find a nice way to create this Filter/DocIdSet easily out of my simple document number list. I know it sounds trivial - what is it I can't see? :) Thanks so much! Christian -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.19 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iEYEARECAAYFAlMgnDkACgkQ6EqMXq+WZg9h5wCeOHo3bfXW6G0wYTy0x1Vnvruc AtQAn2e6oSniN/ZQ7g6nhUeyvM/QErJ9 =Wyja -----END PGP SIGNATURE----- --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org