lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] Updated: (SOLR-2210) Provide solr FilterFactory for Lucene ICUTokenizer
Date Tue, 02 Nov 2010 05:27:23 GMT


Robert Muir updated SOLR-2210:

    Attachment: SOLR-2210.patch

here's a start: makes an analysis-extras contrib with all the build logic, and factories for
the icu filters.

still todo: add support for custom normalization and custom tokenizer config, filters for
smart chinese, and stempel.

But i think its ok to commit this as-is and improve it in svn.

> Provide solr FilterFactory for Lucene ICUTokenizer
> --------------------------------------------------
>                 Key: SOLR-2210
>                 URL:
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1
>            Reporter: Tom Burton-West
>            Priority: Minor
>         Attachments: SOLR-2210.patch
> The Lucene ICUTokenizer provides many benefits for multilingual tokenizing.   There should
be a ICUFilterFactory so that it can be used from Solr.   There are probably some issues in
terms of passing configuration parameters.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message