lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Cullin Wible (JIRA)" <>
Subject [jira] Created: (LUCENE-1403) StandardTokenizer - Improper Hostname Recognition
Date Tue, 23 Sep 2008 17:44:44 GMT
StandardTokenizer - Improper Hostname Recognition

                 Key: LUCENE-1403
             Project: Lucene - Java
          Issue Type: Bug
    Affects Versions: 2.3.2, 2.3.1
         Environment: Java 5
            Reporter: Cullin Wible

As of 2.3.1 the documentation for the StandardTokenizer states that it "Recognizes email addresses
and internet hostnames as one token."

However hostnames such as "" are recognized as two tokens "my" and "".

Any host with a dash in the name is not recognized properly.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message