lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mark Miller (JIRA)" <>
Subject [jira] Commented: (LUCENE-794) Beginnings of a span based highlighter
Date Sun, 18 Feb 2007 21:46:05 GMT


Mark Miller commented on LUCENE-794:

Howdy Mark H, I have not got into making new SpanQuery tests yet, but at this point I could
use some help/guidance. All of the original highlighter tests are passing with the new SpanScorer
except for two: 

1. testFieldSpecificHighlighting

This will not pass the second assertion (ignore fields) because when i add the TokenStream
to a MemoryIndex I have to add it to a field. I am stumped on getting around this one.

2. testOverlapAnalyzer2

Passes the first bunch but then fails on one. This is because I am looking up terms based
on position since the Spans do not return the term text. The first assertion failing is looking
for 'hi-<b>speed</b>' but finds '<b>hi-speed</b>' because both 'speed'
and 'hi-speed' are at position 0...consequently both score a 1. Any thoughts? I was thinking
about gathering all possible terms in the SpanQueryExtractor and someone using them...

Beyond that, I am sure you can find plenty of other things to point out . Have at me <g>

Any ideas on scoring would be appreciated as well.

Feel free to run with this on your own if you have time as well...or run with it a bit and
pass it back, or just provide some guidance as I go...whatever works out best for you.

- Mark M

> Beginnings of a span based highlighter
> --------------------------------------
>                 Key: LUCENE-794
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments:,,,,,,,,,,,,,,,,,,,
> This is some test code to start the work of adding a span based highlighting approach
to the existing highlighter in contrib. See
for some background.
> There is a dependency on MemoryIndex.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message