lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Venkatraju <>
Subject Proximity in ranking, summary generation
Date Wed, 01 Dec 2004 16:54:17 GMT

This is actually 2 somewhat related questions:
- In regular multi term queries, does the default ranking function of
Lucene take into account proximity of the search terms? As far as I
know, proximity data is used only in phrase searches. Is this correct?
If so, does someone have pointers/sample implementation of how
proximity data can be used to supplement tfidf in ranking documents?

- Is there a way to get the term offset or byte offset of the best
match(es) in the document? I am looking to use this information for
summary generation/highlighting.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message