lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David Croley (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-3748) EnglishPossessiveFilter should work with Unicode right single quotation mark
Date Thu, 02 Feb 2012 20:20:53 GMT
EnglishPossessiveFilter should work with Unicode right single quotation mark
----------------------------------------------------------------------------

                 Key: LUCENE-3748
                 URL: https://issues.apache.org/jira/browse/LUCENE-3748
             Project: Lucene - Java
          Issue Type: Improvement
          Components: modules/analysis
    Affects Versions: 3.5, 3.4, 3.2, 3.1
            Reporter: David Croley
            Priority: Minor
         Attachments: LucenePatch

The current EnglishPossessiveFilter (used in EnglishAnalyzer) removes possessives using only
the '\'' character (plus 's' or 'S'), but some common systems (German?) insert the Unicode
"\u2019" (RIGHT SINGLE QUOTATION MARK) instead and this is not removed when processing UTF-8
text. I propose to change EnglishPossesiveFilter to support '\u2019' as an alternative to
'\''.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message