lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4956) the korean analyzer that has a korean morphological analyzer and dictionaries
Date Sun, 28 Apr 2013 22:54:18 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13644162#comment-13644162
] 

Edward J. Yoon commented on LUCENE-4956:
----------------------------------------

I think this would be a valuable addition to the Apache Lucene (P.S., I'm Korean as you may
know). 

It would be nice if you can remove all the korean comments or strings, and author tags in
source code to avoid any compiling and installing problems. Otherwise, SVN server/client settings
and build-script's encoding options etc. will be somewhat tricky. For example, 

{code}
if(entry!=null&&!("을".equals(end)&&entry.getFeature(WordEntry.IDX_REGURA)==IrregularUtil.IRR_TYPE_LIUL))
{

and, 

/**
 * 복합명사의 개별단어에 대한 정보를 담고있는 클래스 
 * @author S.M.Lee
 *
 */
{code}
                
> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-4956
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4956
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 4.2
>            Reporter: SooMyung Lee
>              Labels: newbie
>         Attachments: kr.analyzer.4x.tar
>
>
> Korean language has specific characteristic. When developing search service with lucene
& solr in korean, there are some problems in searching and indexing. The korean analyer
solved the problems with a korean morphological anlyzer. It consists of a korean morphological
analyzer, dictionaries, a korean tokenizer and a korean filter. The korean anlyzer is made
for lucene and solr. If you develop a search service with lucene in korean, It is the best
idea to choose the korean analyzer.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message