lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bernd Fehling (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SOLR-2628) use of FST for SynonymsFilterFactory and synonyms.txt
Date Fri, 01 Jul 2011 07:40:28 GMT
use of FST for SynonymsFilterFactory and synonyms.txt
-----------------------------------------------------

                 Key: SOLR-2628
                 URL: https://issues.apache.org/jira/browse/SOLR-2628
             Project: Solr
          Issue Type: New Feature
          Components: Schema and Analysis
    Affects Versions: 3.4, 4.0
         Environment: Linux
            Reporter: Bernd Fehling
            Priority: Minor


Currently the SynonymsFilterFactory builds up a memory based SynonymsMap. 
This can generate huge maps because of the permutations for synonyms.

Now where FST (finite state transducer) is introduced to lucene this could also be used for
synonyms.
A tool can compile the synoynms.txt file to a binary automaton file which can then be used
with SynoynmsFilterFactory.

Advantage:
- faster start of solr, no need to generate SynonymsMap
- faster lookup
- memory saving


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message