commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebb (JIRA)" <>
Subject [jira] [Commented] (VALIDATOR-235) UrlValidator rejects url with german umlaut
Date Sun, 04 Jan 2015 22:18:35 GMT


Sebb commented on VALIDATOR-235:

There are two parts to this.

1) Syntax validation.
At present the Regex does not allow Unicode characters, because they are not permitted by

2) Domain validation
At present the Unicode versions of TLDs are not included in the DomainValidator tables, and
the code does not convert Unicode domains to punycode in order to check against the punycode
There is no point doing either of these until the Regex issues are sorted.

Presumably the intention is to extend RFC3986 so that Alpha characters can now include Unicode
Similarly for Alphanumerics. But there may be some exceptions. Need the relevant RFCs.

> UrlValidator rejects url with german umlaut
> -------------------------------------------
>                 Key: VALIDATOR-235
>                 URL:
>             Project: Commons Validator
>          Issue Type: Bug
>    Affects Versions: 1.3.1 Release
>            Reporter: Brian Preuß
> e.g. http://www.dü

This message was sent by Atlassian JIRA

View raw message