lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings
Date Sat, 24 Mar 2012 21:16:27 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13237683#comment-13237683
] 

Robert Muir commented on LUCENE-3915:
-------------------------------------

looks good!

we should also do a little optimization
to the romanization method... 

we can make instead of String f(String kana)
and then it making its own stringbuilder,
then toStringing that, it can be f(Appendable sb, Charsequence kana)
and we just pass termAtt.setEmpty() as the 'sb'.

                
> Add Japanese filter to replace term attribute with readings
> -----------------------------------------------------------
>
>                 Key: LUCENE-3915
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3915
>             Project: Lucene - Java
>          Issue Type: New Feature
>            Reporter: Christian Moen
>            Priority: Minor
>         Attachments: LUCENE-3915.patch
>
>
> Koji and Robert are working on LUCENE-3888 that allows spell-checkers to do their similarity
matching using a different word than its surface form.
> This approach is very useful for languages such as Japanese where the surface form and
the form we'd like to use for similarity matching is very different.  For Japanese, it's useful
to use readings for this -- probably with some normalization.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message