lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Top N terms of an indexed field
Date Fri, 29 Feb 2008 04:18:26 GMT
Alex,

You can also use HighFrequencyTerms class (or something with a very similar name) from Lucene
contrib/misc (I believe).  It's a command line app that will get you exactly what you want.
 Good for figuring out if you should add more terms to your stopword list, for example.

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

----- Original Message ----
> From: Alex Benjamen <abenjamen@friendster.com>
> To: solr-user@lucene.apache.org
> Sent: Thursday, February 28, 2008 10:22:38 PM
> Subject: Top N terms of an indexed field
> 
> I was wondering if it is possible to retrieve the top 20 terms for a given 
> fields in an index. 
>  
> For example, if we're indexing user profile data and one of the fields 
> is "interests" - it would be great to get the top 20 terms for interests
> found in the index. 
>  
> -Alex
> 



Mime
View raw message