lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Benson Margulies (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4956) the korean analyzer that has a korean morphological analyzer and dictionaries
Date Thu, 17 Oct 2013 20:50:49 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13798393#comment-13798393
] 

Benson Margulies commented on LUCENE-4956:
------------------------------------------

I am told (I don't read Korean myself) that people often leave out the white space between
eojeol that are made up entirely of Hangul letters (Korean letters). Are you just defining
these very long things to be single eojeol? Prof Kang in his own work has a module that splits
these using some rules.

> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-4956
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4956
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 4.2
>            Reporter: SooMyung Lee
>            Assignee: Christian Moen
>              Labels: newbie
>         Attachments: eval.patch, kr.analyzer.4x.tar, lucene-4956.patch, lucene4956.patch,
LUCENE-4956.patch
>
>
> Korean language has specific characteristic. When developing search service with lucene
& solr in korean, there are some problems in searching and indexing. The korean analyer
solved the problems with a korean morphological anlyzer. It consists of a korean morphological
analyzer, dictionaries, a korean tokenizer and a korean filter. The korean anlyzer is made
for lucene and solr. If you develop a search service with lucene in korean, It is the best
idea to choose the korean analyzer.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message