lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Maurice Coyle <>
Subject similarity of page against page rather than page against query
Date Fri, 06 Jun 2003 09:37:07 GMT

i was wondering if anyone has tried implementing anything in lucene which
uses the index to calculate similarities between page content (i.e. to
calculate a score for how similar to page A page B is, rather than a score
for page B compared to a query term)?

i suppose the IndexReader class would be useful (docFreq(), termDocs(),
terms() etc), i just wanted to get an impression if anyone has actually
tried this/how doable it is.  as a new developer on the lucene project, some
of it is a bit of a mystery to me so if i could get some sort of pointers,
that would be great.


  • Unnamed multipart/related (inline, None, 0 bytes)
View raw message