lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom Burton-West (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-2211) Create Solr FilterFactory for Lucene StandardTokenizer with UAX#29 support
Date Mon, 01 Nov 2010 18:10:27 GMT

    [ https://issues.apache.org/jira/browse/SOLR-2211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12927067#action_12927067
] 

Tom Burton-West commented on SOLR-2211:
---------------------------------------

Sure, I'll give it a try.  I've got  large Monday morning backlog in my todo list today, so
it will probably be towards the middle of the week.

> Create Solr FilterFactory for Lucene StandardTokenizer with  UAX#29 support
> ---------------------------------------------------------------------------
>
>                 Key: SOLR-2211
>                 URL: https://issues.apache.org/jira/browse/SOLR-2211
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1
>            Reporter: Tom Burton-West
>            Priority: Minor
>
> The Lucene 3.x StandardTokenizer with UAX#29 support provides benefits for non-English
tokenizing.  Presently it can be invoked by using the StandardTokenizerFactory and setting
the Version to 3.1.  However, it would be useful to be able to use the improved unicode processing
without necessarily including the ip address and email address processing of StandardAnalyzer.
  A FilterFactory that allowed the use of the StandardTokenizer with UAX#29 support on its
own would be useful.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message