hivemall-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Takuya Kitazawa (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVEMALL-130) Support user-defined dictionary for `tokenize_ja`
Date Fri, 07 Jul 2017 03:47:00 GMT

     [ https://issues.apache.org/jira/browse/HIVEMALL-130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Takuya Kitazawa updated HIVEMALL-130:
-------------------------------------
    Description: 
Support another argument "userDict". Type would be List<String>, and each element defines
a new word in the following format: <word>,<result>,<read>,<class>
https://github.com/atilika/kuromoji/blob/d0700ab6dd489aaf0fcb1e4e78ce2f682be9f255/kuromoji-core/src/test/resources/userdict.txt

Reference for adding user dictionary in the Lucene API (Japanese): http://d.hatena.ne.jp/Kazuhira/20130616/1371390716

  was:
Support another argument "userDict". Type would be List<String>, and each element defines
a new word in the following format: <word>,<result>,<read>,<class>
https://github.com/atilika/kuromoji/blob/d0700ab6dd489aaf0fcb1e4e78ce2f682be9f255/kuromoji-core/src/test/resources/userdict.txt

Ref (Japanese): http://d.hatena.ne.jp/Kazuhira/20130616/1371390716


> Support user-defined dictionary for `tokenize_ja`
> -------------------------------------------------
>
>                 Key: HIVEMALL-130
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-130
>             Project: Hivemall
>          Issue Type: Improvement
>            Reporter: Takuya Kitazawa
>            Assignee: Takuya Kitazawa
>
> Support another argument "userDict". Type would be List<String>, and each element
defines a new word in the following format: <word>,<result>,<read>,<class>
https://github.com/atilika/kuromoji/blob/d0700ab6dd489aaf0fcb1e4e78ce2f682be9f255/kuromoji-core/src/test/resources/userdict.txt
> Reference for adding user dictionary in the Lucene API (Japanese): http://d.hatena.ne.jp/Kazuhira/20130616/1371390716



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message