lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Uwe Schindler" <...@thetaphi.de>
Subject RE: Document Frequency for a set of documents
Date Fri, 05 Feb 2010 09:40:20 GMT
How about having more than one index, so one for each user? If you want to do a search on all
use a MultiReader on all separate indexes? If you want only serach on a subset use the corresponding
index' IndexReader instead.

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: uwe@thetaphi.de

> -----Original Message-----
> From: lionel duboeuf [mailto:lionel.duboeuf@boozter.com]
> Sent: Friday, February 05, 2010 10:27 AM
> To: general@lucene.apache.org
> Subject: Document Frequency for a set of documents
> 
> Hi,
> 
> Sorry for asking again, **I still have not found a scalable solution to
> get the document frequency of a term t according a set of documents.
> Lucene only store the document frequency for the global corpus, but i
> would like to be able to get the document frequency of a term according
> only to a subset of documents (i.e. a user's collection of documents).
> 
> I guess that querying the index to get the number of hits for each term
> and for each field,  filtered by a user will be to slow.
> Any idea ?
> 
> 
> regards,
> 
> Lionel
> 
> *
> *
> 



Mime
View raw message