lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] Commented: (SOLR-2210) Provide solr FilterFactory for Lucene ICUTokenizer
Date Mon, 01 Nov 2010 17:26:23 GMT


Robert Muir commented on SOLR-2210:

Thanks for opening this, Tom.

I've got some barebones filters for some of this stuff on my computer.
Because the ICU jar file is large, i was trying to see if i could solve LUCENE-2510 first,
but this would only fix the problem for 4.0 anyway.
I think we should just make an icu contrib for now, and put the factories (Tokenizer, Normalizer,
Folding, Transliterator, Collation) and the jar file in there.

> Provide solr FilterFactory for Lucene ICUTokenizer
> --------------------------------------------------
>                 Key: SOLR-2210
>                 URL:
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1
>            Reporter: Tom Burton-West
>            Priority: Minor
> The Lucene ICUTokenizer provides many benefits for multilingual tokenizing.   There should
be a ICUFilterFactory so that it can be used from Solr.   There are probably some issues in
terms of passing configuration parameters.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message