lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Walter Underwood <wunderw...@netflix.com>
Subject Re: Word frequency count in the index
Date Thu, 16 Jul 2009 14:03:31 GMT
Lucene uses a tf.idf relevance formula, so it automatically finds common
words (stop words) in your documents and gives them lower weight. I
recommend not removing stop words at all and letting Lucene handle
the weighting.

wunder

On 7/16/09 3:29 AM, "Pooja Verlani" <pooja.verlani@gmail.com> wrote:

> Hi,
> 
> Is there any way in SOLR to know the count of each word indexed in the solr
> ?
> I want to find out the different word frequencies to figure out '
> application specific stop words'.
> 
> Please let me know if its possible.
> 
> Thank you,
> Regards,
> Pooja


Mime
View raw message