lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David Smiley (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-7578) UnifiedHighlighter: Convert PhraseHelper to use SpanCollector API
Date Wed, 30 Nov 2016 19:59:58 GMT
David Smiley created LUCENE-7578:
------------------------------------

             Summary: UnifiedHighlighter: Convert PhraseHelper to use SpanCollector API
                 Key: LUCENE-7578
                 URL: https://issues.apache.org/jira/browse/LUCENE-7578
             Project: Lucene - Core
          Issue Type: Improvement
          Components: modules/highlighter
            Reporter: David Smiley


The PhraseHelper of the UnifiedHighlighter currently collects position-spans per SpanQuery
(and it knows which terms are in which SpanQuery), and then it filters PostingsEnum based
on that.  It's similar to how the original Highlighter WSTE works.  The main problem with
this approach is that it can be inaccurate for some nested span queries -- LUCENE-2287, LUCENE-5455
(has the clearest example), LUCENE-6796.  Non-nested SpanQueries (e.g. that which is converted
from a PhraseQuery or MultiPhraseQuery) are _not_ a problem.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message