lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Christoph Lingg (JIRA)" <>
Subject [jira] [Created] (SOLR-5152) EdgeNGramFilterFactory deletes token
Date Wed, 14 Aug 2013 15:15:47 GMT
Christoph Lingg created SOLR-5152:

             Summary: EdgeNGramFilterFactory deletes token
                 Key: SOLR-5152
             Project: Solr
          Issue Type: Improvement
    Affects Versions: 4.4
            Reporter: Christoph Lingg

I am using EdgeNGramFilterFactory in my schema.xml

{code:xml}<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
  <analyzer type="index">
    <!-- ... -->
    <filter class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="10" side="front"

Some tokens in my index only consist of one character, let's say {{R}}. minGramSize is set
to 2 and is bigger than the length of the token. I expected the NGramFilter to left {{R}}
unchanged but in fact it is deleting the token.

For my use case this interpretation is undesirable, and probably for most use cases too!?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message