Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 65737 invoked from network); 22 Feb 2006 07:28:22 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 22 Feb 2006 07:28:22 -0000 Received: (qmail 25385 invoked by uid 500); 22 Feb 2006 07:28:19 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 25352 invoked by uid 500); 22 Feb 2006 07:28:19 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 25337 invoked by uid 99); 22 Feb 2006 07:28:18 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Feb 2006 23:28:18 -0800 X-ASF-Spam-Status: No, hits=2.3 required=10.0 tests=DNS_FROM_RFC_ABUSE,DNS_FROM_RFC_WHOIS,FORGED_YAHOO_RCVD X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [206.190.38.242] (HELO web50309.mail.yahoo.com) (206.190.38.242) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 21 Feb 2006 23:28:16 -0800 Received: (qmail 68045 invoked by uid 60001); 22 Feb 2006 07:27:54 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=qU0TGcXPXE55KtMJUQjYpp48+8EkLYCdwLfqMfzEJ5hy7s+ggAtyRahJChRefVpKbj6x1OWgDzLENaXMzd8l/ag5HyRJKHJ8g+0IP8oNPCoZD0WHvut64i/G27Cse7uSZie4vTz4f/dl1A243sHNSGh5V5aFt9Tia78cZcDq9Yc= ; Message-ID: <20060222072754.68043.qmail@web50309.mail.yahoo.com> Date: Tue, 21 Feb 2006 23:27:54 -0800 (PST) From: Otis Gospodnetic Reply-To: Otis Gospodnetic Subject: Re: ThaiAnalyzer for Lucene To: java-dev@lucene.apache.org In-Reply-To: <43FBDFC5.40509@osdev.co.th> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi Samphan, Please create an "issue" in JIRA, and attach your code to it. We can put the analyzers in the contrib section of Lucene. I hope DictionaryBasedBreakIterator is not a compile-time dependency, because we probably can't distribute ICU4J due to the license. Otis ----- Original Message ---- From: Samphan Raruenrom To: java-dev@lucene.apache.org Sent: Tuesday, February 21, 2006 10:51:33 PM Subject: ThaiAnalyzer for Lucene Hi, I've wrote an alpha version of ThaiAnalyzer to enable Thai in Lucene full text search. Thai has no space between words (same for Lao and Khmer), so you need a dictionary-based word breaker to break words. I use ICU4j DictionaryBasedBreakIterator for this job. I want to contribute the code using the Apache license, so it'll be useful to other people. How can I do this? I see analyzers for various languages in the Sandbox. How can I put the code there? Thanks. -- _/|\_ Samphan Raruenrom. Open Source Development Co., Ltd. Tel: +66 38 311816, Fax: +66 38 773128, http://www.osdev.co.th/ --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org