lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tomoko Uchida (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (LUCENE-8869) Build kuromoji system dictionary as a separated jar and load it from JapaneseTokenizer at runtime
Date Wed, 19 Jun 2019 16:19:00 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-8869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tomoko Uchida updated LUCENE-8869:
----------------------------------
    Description: 
This is a sub-task for LUCENE-8816.
 In this issue, I will try to make small but self-contained changes to kuromoji system dictionary.
 - Make it possible to build a jar that contains (maybe) only dictionary data resource generated
by the {{build-dict}} task.
 -- Maybe a new ant target will be added.
 - Make it possible to load external dictionary when initializing JapaneseTokenizer.
 -- Some work are already done on LUCENE-8863
 - Decouple current system dictionary data (mecab ipadic) from kuromoji itself and use it
as default (Possibly it can be done with another issue).

Also, some refactoring of the directory/source tree structure may be needed.

  was:
This is a sub-task for LUCENE-8816.
 In this issue, I will try to make small but self-contained changes to kuromoji system dictionary.
 - Make it possible to build a jar that contains (maybe) only dictionary data resource generated
by the {{build-dict}} task.
 - Make it possible to load external dictionary when initializing JapaneseTokenizer.
 -- Some work are already done on LUCENE-8863
 - Decouple current system dictionary data (mecab ipadic) from kuromoji itself and use it
as default (Possibly it can be done with another issue).

Also, some refactoring of the directory/source tree structure may be needed.


> Build kuromoji system dictionary as a separated jar and load it from JapaneseTokenizer
at runtime
> -------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-8869
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8869
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/analysis
>            Reporter: Tomoko Uchida
>            Priority: Major
>
> This is a sub-task for LUCENE-8816.
>  In this issue, I will try to make small but self-contained changes to kuromoji system
dictionary.
>  - Make it possible to build a jar that contains (maybe) only dictionary data resource
generated by the {{build-dict}} task.
>  -- Maybe a new ant target will be added.
>  - Make it possible to load external dictionary when initializing JapaneseTokenizer.
>  -- Some work are already done on LUCENE-8863
>  - Decouple current system dictionary data (mecab ipadic) from kuromoji itself and use
it as default (Possibly it can be done with another issue).
> Also, some refactoring of the directory/source tree structure may be needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message