lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Walter Underwood <wun...@wunderwood.org>
Subject Re: Positions in EdgeNgramTokenFilter
Date Fri, 01 Mar 2013 19:15:16 GMT
That is a pretty broad bug, but this fix is somewhere in "improve ngrams". Maybe a specific
bug linked to that one?

Incrementing positions might be the right thing for pure ngrams. 

wunder

On Mar 1, 2013, at 11:02 AM, Robert Muir wrote:

> Walter, sounds very interesting. Maybe just use this issue:
> https://issues.apache.org/jira/browse/LUCENE-3907 ?
> 
> On Fri, Mar 1, 2013 at 10:41 AM, Walter Underwood <wunder@wunderwood.org> wrote:
>> I'm fixing position increment in EdgeNgramTokenFilter to act like synonyms,
>> with each ngram at the same position as the source token. Currently, the
>> position is incremented for each output token, which breaks phrase searching
>> with edge ngrams.
>> 
>> I could not find a current Jira issue for this. Is there one?
>> 
>> We are still on 3.3, but I'll submit a patch for 4.x.
>> 
>> Thanks to whoever converted EdgeNgramTokenFilter to use TokenStream.
>> 
>> wunder
>> --
>> Walter Underwood
>> wunder@wunderwood.org
>> 




---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message