lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Rowe (JIRA)" <>
Subject [jira] [Commented] (LUCENE-4810) Positions are incremented for each ngram in EdgeNGramTokenFilter
Date Mon, 22 Apr 2013 13:47:15 GMT


Steve Rowe commented on LUCENE-4810:

FWIW, I fixed the svn:log property on the 1470496, 1470497, and 1470502 revisions to be "LUCENE-4810:
*position increment for* first output token from EdgeNGramTokenFilter must be > 0". 
> Positions are incremented for each ngram in EdgeNGramTokenFilter
> ----------------------------------------------------------------
>                 Key: LUCENE-4810
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/analysis
>            Reporter: Walter Underwood
>            Assignee: Michael McCandless
>             Fix For: 5.0, 4.3
>         Attachments: LUCENE-4810.diff, LUCENE-4810-first-token-position-increment.patch,
LUCENE-4810.patch, LUCENE-4810.patch
> Edge ngrams should be like synonyms, with all the ngrams generated from a token having
the same position as that original token. The current code increments position.
> For the text "molecular biology", the query "mol bio" should match as a phrase in neighboring
positions. It does not.
> You can see this in the Analysis page in the admin UI.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message