lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Guido Wegener" <...@startext.de>
Subject highlighting phrases
Date Mon, 30 Aug 2004 07:00:37 GMT
I am working on a modification to Lucene's highlighter. Currently all terms of
a phrase query are highlighted, even if they appear out of phrase context:
Searching for "Foo Bar" in "Foo Bar some stuff Foo" will result in
"_Foo_ _Bar_ some stuff _Foo_". It would be nicer to have
"_Foo_ _Bar_ some stuff Foo" as the result.

I already implemented this behaviour in an older version of the highlighter,
where things were still simple. But now I see that there was a modification to
deal with overlapping tokens. These make the whole matter much more complicated.
But I guess that I will try to merge my old phrase highlighter code with the
current version of Lucene.

Is anybody working on this kind of phrase highlighting?
Would my modifications be of interest to you?

Best regards,
  Guido Wegener

-- 
Guido Wegener
startext Unternehmensberatung GmbH
Kennedyallee 2, D-53175 Bonn
Tel: +49 (0)228 959 96-26, Fax: +49 (0)228 959 96-66
Internet: http://www.startext.de, E-Mail: gwe@startext.de
 


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message