lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jack Krupansky <jack.krupan...@gmail.com>
Subject Re: Jensen–Shannon divergence
Date Mon, 14 Dec 2015 22:21:08 GMT
Is there any particular reason that you find Lucene's builtin TF/IDF and
BM25 similarity models insufficient for your needs? In any case,
examination of their source code should get you started if you with to do
your own:

https://lucene.apache.org/core/5_3_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html
https://lucene.apache.org/core/5_3_0/core/org/apache/lucene/search/similarities/BM25Similarity.html

-- Jack Krupansky

On Sun, Dec 13, 2015 at 8:30 AM, Shay Hummel <shay.hummel@gmail.com> wrote:

> Hi
>
> I need help to implement similarity between query model and document model.
> I would like to use the JS-Divergence
> <https://en.wikipedia.org/wiki/Jensen%E2%80%93Shannon_divergence> for
> ranking documents. The documents and the query will be represented
> according to the language models approach - specifically the LMDiriclet.
> The similarity will be calculated using the JS-Div between the document
> model and the query model.
> Is it possible?
> if so how?
>
> Thank you,
> Shay Hummel
> --
> Regards,
> Shay Hummel
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message