lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (JIRA)" <>
Subject [jira] Resolved: (LUCENE-503) Contrib: ThaiAnalyzer to enable Thai full-text search in Lucene
Date Mon, 05 Jun 2006 17:30:30 GMT
     [ ]
Hoss Man resolved LUCENE-503:

    Resolution: Fixed


> Contrib: ThaiAnalyzer to enable Thai full-text search in Lucene
> ---------------------------------------------------------------
>          Key: LUCENE-503
>          URL:
>      Project: Lucene - Java
>         Type: New Feature

>   Components: Analysis
>     Versions: 1.4
>     Reporter: Samphan Raruenrom
>     Assignee: Hoss Man
>  Attachments:,,
> Thai text don't have space between words. Usually, a dictionary-based algorithm is used
to break string into words. For Lucene to be usable for Thai, an Analyzer that know how to
break Thai words is needed.
> I've implemented such Analyzer, ThaiAnalyzer, using ICU4j DictionaryBasedBreakIterator
for word breaking. I'll upload the code later.
> I'm normally a C++ programmer and very new to Java. Please review the code for any problem.
One possible problem is that it requires ICU4j. I don't know whether this is OK.

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message