lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shinya Kasatani (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-2910) Highlighter does not correctly highlight the phrase around 50th term
Date Mon, 07 Feb 2011 09:29:31 GMT
Highlighter does not correctly highlight the phrase around 50th term
--------------------------------------------------------------------

                 Key: LUCENE-2910
                 URL: https://issues.apache.org/jira/browse/LUCENE-2910
             Project: Lucene - Java
          Issue Type: Bug
          Components: contrib/highlighter
    Affects Versions: 2.9.4
            Reporter: Shinya Kasatani
            Priority: Trivial
         Attachments: HighlighterFix.patch

When you use the Highlighter combined with N-Gram tokenizers such as CJKTokenizer and try
to highlight the phrase that appears around 50th term in the field, the highlighted phrase
is shorter than expected.

e.g. Highlighting "fooo" in the following text with bigram tokenizer:
"0---------1---------2---------3---------4---------fooo---"

Expected: "0---------1---------2---------3---------4---------<B>fooo</B>---"
Actual: "0---------1---------2---------3---------4---------f<B>ooo</B>---"


-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message