lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dwaipayan Roy <dwaipayan....@gmail.com>
Subject Doc length nomalization in Lucene LM
Date Thu, 21 Jul 2016 15:06:33 GMT
​Hello,

In *SimilarityBase.java*, I can see that the length of the document is is
getting normalized by using the function *decodeNormValue()*. But I can't
understand how the normalizations is done. Can you please help? Also, is
there any way to avoid this doc-length normalization, to use the raw
doc-length (as used in LM-JM Zhai et al. SIGIR-2001)?

Thanks..

P.S. I am using Lucene 4.10.4

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message