lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wettin (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-889) Standard tokenizer with punctuation output
Date Fri, 25 May 2007 10:37:16 GMT
Standard tokenizer with punctuation output
------------------------------------------

                 Key: LUCENE-889
                 URL: https://issues.apache.org/jira/browse/LUCENE-889
             Project: Lucene - Java
          Issue Type: Improvement
    Affects Versions: 2.1
            Reporter: Karl Wettin
            Priority: Trivial


This patch adds punctuation (comma, period, question mark and exclamation point)  tokens as
output from the StandardTokenizer, and filters them out in the StandardFilter.

(I needed them for text classification reasons.)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message