lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Seth Rosen <s...@architexa.com>
Subject Re: High frequency term for the searched query
Date Thu, 04 Nov 2010 13:49:52 GMT
You might want to take a look at this tutorial on how Lucene calculates
Scoring [1]. If all you are interested in is the term frequency and you want
to ignore other calculations you can override the others and have them
return 1.

Hope this helps!
Seth Rosen
seth@architexa.com
www.architexa.com



[1] http://www.lucenetutorial.com/advanced-topics/scoring.html

On Thu, Nov 4, 2010 at 3:53 AM, starz10de <farag_ahmed@yahoo.com> wrote:

>
> I need to find the most frequent terms that are appeared with a query.
>
> HighFreqTerms.java can be used only to obtain the high frequency terms in
> the whole index.
>
> I need just to find the high frequency terms to the submitted query.
>
> What I do now is:
>
> I search the index with the query and retrieve the relevant documents then
> save those documents in a new folder then index them. At the end I use
> HighFreqTerms.java in the new index so I can find the most frequent terms
> to
> the query. However, this is very slow and need long time to run.
>
> Any idea how I can do this task efficiently
>
>
> Thanks in advance
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/High-frequency-term-for-the-searched-query-tp1839942p1839942.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message