Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-user@lucene.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
Date: Sat, 11 Oct 2008 12:36:04 +0200
To: "java-user@lucene.apache.org" <java-user@lucene.apache.org>
Subject: Retrieving Top Terms for a subset of the index (or for all results of
 a query)
From: "Aleksander M. Stensby" <aleksander.stensby@integrasco.no>
Organization: Integrasco A/S
Content-Type: text/plain; format=flowed; delsp=yes; charset=utf-8
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Message-ID: <op.uiuvqekvoftp72@melkor>
User-Agent: Opera Mail/9.60 (Linux)

Hello everyone. I've been fiddeling with the idea of retrieving the top  
terms from a subset of the index (i.e. top terms from the documents  
retrieved by a given search). This could for instance be useful to  
identify top ranking terms in a given datespan etc.

It would be something like getting the top 50 terms (like you can do with  
luke) but instead of doing it for the full index, I would like to do the  
same procedure after applying a filter or a query. Don't know if this is a  
bad explaination or wheter it makes any sense at all...

So, I really want to avoid iterating over all results (obviously), so my  
question is really if there is a prefered approach for doing such analysis  
/ has this been done in a good way before?

Thanks for any help!

Best regards,
  Aleksander

-- 
Aleksander M. Stensby
Senior Software Developer
Integrasco A/S
+47 41 22 82 72
aleksander.stensby@integrasco.no

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org