Lucene uses a tf.idf relevance formula, so it automatically finds common
words (stop words) in your documents and gives them lower weight. I
recommend not removing stop words at all and letting Lucene handle
the weighting.
wunder
On 7/16/09 3:29 AM, "Pooja Verlani" <pooja.verlani@gmail.com> wrote:
> Hi,
>
> Is there any way in SOLR to know the count of each word indexed in the solr
> ?
> I want to find out the different word frequencies to figure out '
> application specific stop words'.
>
> Please let me know if its possible.
>
> Thank you,
> Regards,
> Pooja
|