Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 51E431022D for ; Wed, 24 Apr 2013 05:06:20 +0000 (UTC) Received: (qmail 7302 invoked by uid 500); 24 Apr 2013 05:06:19 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 7084 invoked by uid 500); 24 Apr 2013 05:06:18 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 7052 invoked by uid 99); 24 Apr 2013 05:06:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Apr 2013 05:06:17 +0000 X-ASF-Spam-Status: No, hits=2.0 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_FONT_FACE_BAD,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of smlee0818@gmail.com designates 209.85.128.194 as permitted sender) Received: from [209.85.128.194] (HELO mail-ve0-f194.google.com) (209.85.128.194) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Apr 2013 05:06:13 +0000 Received: by mail-ve0-f194.google.com with SMTP id db10so119968veb.5 for ; Tue, 23 Apr 2013 22:05:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:date:message-id:subject:from:to :content-type; bh=jxpf1v3OJM+duirpvFTZ5FQ1l54HAlQBR2l5PXfgmPk=; b=awRC/aLt52+hFakNw+XaLAkGZXdxrHuTlbfKSaLf19RxrEWUyD3mwpvApL2+2JM/VJ JgSfmFU64kWDjnG3LG9by9+YU5FjFCIjqGzI+StWBwd3eaOHmcLw45kVvKY9Sq7WYuMS 4F+vKOcYT+9Uw2Ruo0c0rHYvj/BAV21kw4C3SSSseTxlj/azx5c+GL2K8anE1PS11iZP YYoricr1oBtYwpeGh4+NKRO8NyPALOpIi50s5fxnWqVrZrYaRdRpy672+JSj0+reHS7c jn8Uaq2IGc/1aHYPvtPNJQynjf+JEgFauLuZ4a9DkaZV+I5is6jSh2HQFQvKRtgi5EPd aPeA== MIME-Version: 1.0 X-Received: by 10.220.106.14 with SMTP id v14mr24355826vco.2.1366779952244; Tue, 23 Apr 2013 22:05:52 -0700 (PDT) Received: by 10.220.232.204 with HTTP; Tue, 23 Apr 2013 22:05:52 -0700 (PDT) Date: Wed, 24 Apr 2013 14:05:52 +0900 Message-ID: Subject: Contributing the Korean Analyzer From: =?EUC-KR?B?wMy89rjt?= To: dev@lucene.apache.org Content-Type: multipart/alternative; boundary=047d7b3433f28a2cd104db1441ef X-Virus-Checked: Checked by ClamAV on apache.org --047d7b3433f28a2cd104db1441ef Content-Type: text/plain; charset=ISO-8859-1 Hello, I've developed the Korean Analyzer and distributed it since 2008. Many people who use lucene with korean use it. I posted it to the sourceforge (http://sourceforge.net/projects/lucenekorean ) Here is the cvs address d:pserver:anonymous@lucenekorean.cvs.sourceforge.net:/cvsroot/lucenekorean KoreanAnalyzer consists of Korean Morphological Analyzer, Korean Dictionary and Korean Filter. When using lucene with korean, One thinks of CJK Analyzer. But CJK Analyzer is improper for korean. Korean has a specific characteristic and is needed to analyze morpheme when extracting the index keyword. Korean Analyzer has solved the problem with the Korean Morphological Analyzer. Korean Analyzer has also the feature of spliting compound noun. Now, I want to contribute the korean analyzer to the lucene project. Please let me know how to contribute it. If you want to check the source code, please visit the sourceforge cvs repository. Best regards. -- SooMyung Lee Director of Research Center Argonet co. ltd, Manager of Luene Korean Analyzer http://korlucene.naver.com Contact: +82-10-6480-5710 --047d7b3433f28a2cd104db1441ef Content-Type: text/html; charset=EUC-KR Content-Transfer-Encoding: quoted-printable
Hello,

I've developed the Korean An= alyzer and distributed it since 2008.
Many people who use lucene = with korean use it.

I posted it to the sourceforge= (http://sourceforge.net/projects/luc= enekorean)
Here is the cvs address
d:pserver:anonymous@lucenekorean.cvs.sourceforge.net:/c= vsroot/lucenekorean

KoreanAnalyzer consists of Korean Morphological Analyzer, Korean Dicti= onary and Korean Filter.
When using lucene with korean, One thinks of C= JK Analyzer.
But CJK = Analyzer is improper for korean.

Korean has a specific charact= eristic and is needed to analyze morpheme when extracting the index keyword= .
Korean A= nalyzer has solved the problem with the Korean Morphological Analyzer.
=
Korean Analyzer has= also the feature of spliting compound noun.

Now, I want to contribute the korean analyzer= to the lucene project.
Please let me know how to contribute= it.

If you want to check the source code, = please visit the sourceforge cvs repository.

Best regards.

-- 

SooMyung Lee
Director of Research Center
Argonet co. = ltd,

Manager of Luene Korean Analyzer

Contact: +82-10-6480-5710
--047d7b3433f28a2cd104db1441ef--