lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ype Kingma <ykin...@xs4all.nl>
Subject Re: Highlighting Redux
Date Fri, 21 Mar 2003 03:20:27 GMT
On Thursday 20 March 2003 10:12, Leander Harding wrote:
> Hi,
>
>     Yes, it's another question about Term highlighting. Essentially, what
> I'm looking to is obtain a set of term positions in a given document that
> are hits for a given Query. I've read the archives and looked at the
> contributed code, but it all fails in one important (to my employer)
> respect: it doesn't understand the semantics of Lucene queries, rather it
> looks at the terms they contain and highlights them all. Consider the
> following query:
> ("foo" AND "bar") OR "baz"
> Suppose that we search using this query and the following document is a
> hit: <doc>Foo.....quux......baz.</doc>
> Which Terms do we highlight?
> All of the existing highlighting code I've seen would highlight both "foo"
> and "baz", but this isn't correct - the document contains "foo", but no
> "bar", thus, since "foo" in the query is part of an AND expression that
> wasn't satisfied by this document, only "baz" should be highlighted.

According to the scoring documentation, although Foo is not a boolean match, 
it is still used to determine the score, so one might as well highlight it.

Kind regards,
Ype


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message