lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christian Reuschling <christian.reuschl...@gmail.com>
Subject create a Filter/DocIdSet from a number of documents
Date Wed, 12 Mar 2014 17:41:13 GMT
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

I have a small set of document numbers as a query result collected with some non-scoring collector.

Now, I want to send high-performant successive queries only in this document number scope,
as part
of a customized Similarity implementation (modified, dynamic idf on result query scope, known
as
TF-ICF)

For this, after reading the user list and further looking around, I decided to try the approach
with a filter, i.e. IndexSearcher.search(Query, Filter, HitCountCollector), whereby the filter
should limit the scope to the initially determined document number list.

I have the hope that this will perform very well, because there is no need for reading information
that doesn't last in memory, as e.g. with a TermFrequenciesVector approach.

So far so good - but I can't find a nice way to create this Filter/DocIdSet easily out of
my
simple document number list. I know it sounds trivial - what is it I can't see? :)

Thanks so much!

Christian
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.19 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iEYEARECAAYFAlMgnDkACgkQ6EqMXq+WZg9h5wCeOHo3bfXW6G0wYTy0x1Vnvruc
AtQAn2e6oSniN/ZQ7g6nhUeyvM/QErJ9
=Wyja
-----END PGP SIGNATURE-----

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message