lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-3091) Highlighter's TokenSources should use TermVector positions if indexed
Date Thu, 12 May 2011 17:07:47 GMT
Highlighter's TokenSources should use TermVector positions if indexed
---------------------------------------------------------------------

                 Key: LUCENE-3091
                 URL: https://issues.apache.org/jira/browse/LUCENE-3091
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/highlighter
            Reporter: Michael McCandless
            Priority: Minor


Spinoff from LUCENE-3087

TokenSources is able to use positions indexed into TermVectors, if you pass "false" for the
tokenPositionsGuaranteedContiguous argument to getTokenStream.

But if you pass true, it doesn't use the positions.  Maybe, we are not allowed (supposed)
to pass true if the posIncr could be 0?  But, if that is OK/allowed, then we should fix the
code to handle the posIncr=0 case correctly.

LUCENE-3087 made one simple fix, to guess at the posInc (1 or 0) based on the offsets of adjacent
tokens.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message