lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namgyu Kim (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-8817) Combine Nori and Kuromoji DictionaryBuilder
Date Sat, 01 Jun 2019 16:40:00 GMT
Namgyu Kim created LUCENE-8817:
----------------------------------

             Summary: Combine Nori and Kuromoji DictionaryBuilder
                 Key: LUCENE-8817
                 URL: https://issues.apache.org/jira/browse/LUCENE-8817
             Project: Lucene - Core
          Issue Type: New Feature
            Reporter: Namgyu Kim


This issue is related to LUCENE-8816.

Currently Nori and Kuromoji Analyzer use the same dictionary structure. (MeCab)
If we make combine DictionaryBuilder, we can reduce the code size.
But this task may have a dependency on the language.
(like HEADER string in BinaryDictionary and CharacterDefinition, methods in BinaryDictionaryWriter,
...)
On the other hand, there are many overlapped classes.

The purpose of this patch is to provide users of Nori and Kuromoji with the same system dictionary
generator.

It may take some time because there is a little workload.
The work will be based on the latest master, and if the LUCENE-8816 is finished first, it
will pull the latest code and proceed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message