lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mark Miller (JIRA)" <>
Subject [jira] Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span queries
Date Fri, 19 Oct 2007 14:48:51 GMT


Mark Miller updated LUCENE-794:

    Attachment: spanhighlighter12.patch

Nice little addition courtesy of Michael Goddard:

"...In our Lucene work, we took the approach of indexing all fields into a single field, "FULLTEXT",
which is the default field for queries.  Our query syntax is such that a user can combine
clauses against named fields with clauses with no field specified.  When we go to highlight
such queries, if a given clause is against this FULLTEXT field but we're highlighting text
in the TITLE field, we'd still like for matching terms to be highlighted..."

Thanks for the patch Micahael.

There is a new constructor that allows you to specify a default field. Terms from this field
will be highlighted regardless of the specific field you are highlighting.

Only file to worry about in that huge mess of files listed above is spanhighlighter12.patch.

> Extend contrib Highlighter to properly support phrase queries and span queries
> ------------------------------------------------------------------------------
>                 Key: LUCENE-794
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments:,,,,,,,,,,,,,,,,, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter11.patch, spanhighlighter12.patch, spanhighlighter2.patch,
spanhighlighter3.patch, spanhighlighter5.patch, spanhighlighter6.patch, spanhighlighter7.patch,
spanhighlighter8.patch, spanhighlighter9.patch,,,,,,
> This patch adds a new Scorer class (SpanQueryScorer) to the Highlighter package that
scores just like QueryScorer, but scores a 0 for Terms that did not cause the Query hit. This
gives 'actual' hit highlighting for the range of SpanQuerys and PhraseQuery. There is also
a new Fragmenter that attempts to fragment without breaking up Spans.
> See for some background.
> There is a dependency on MemoryIndex.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message